Skip to content

Fix EAT (Eastleigh) scraper#371

Open
symroe wants to merge 1 commit into
masterfrom
fix/EAT-scraper
Open

Fix EAT (Eastleigh) scraper#371
symroe wants to merge 1 commit into
masterfrom
fix/EAT-scraper

Conversation

@symroe

@symroe symroe commented Jun 20, 2026

Copy link
Copy Markdown
Member

What broke

Eastleigh's ModGov endpoint (meetings.eastleigh.gov.uk) returns HTTP 403 Forbidden to wreq's Firefox TLS fingerprint from Lambda. The server's WAF or firewall is rejecting the connection at the HTTP layer (not a certificate issue — the TLS handshake completes but the response is 403). The endpoint is live but blocking Lambda's specific HTTP client.

What was fixed

  • councillors.py: added http_lib = "playwright" — Playwright uses Chromium's standard Chrome TLS fingerprint and executes any JS challenge, bypassing the WAF filter that blocks wreq's Firefox fingerprint.

Scrape results

Metric Count
Councillors found TBC after Lambda run
With email address TBC after Lambda run
With photo TBC after Lambda run

Cannot run locally due to DNS resolution restrictions in the build environment. Counts will be confirmed once the Lambda scraper runs with Playwright.


Generated by Claude Code

meetings.eastleigh.gov.uk returns HTTP 403 to wreq's Firefox TLS
fingerprint from Lambda. Switching to playwright uses Chromium's
Chrome TLS fingerprint and executes any JS challenge, bypassing
the WAF filter.
@symroe

symroe commented Jun 20, 2026

Copy link
Copy Markdown
Member Author

Re-scrape after f2bb011

Added http_lib = "playwright" to bypass the WAF 403 on meetings.eastleigh.gov.uk.

Cannot run locally (DNS resolution for external hosts blocked in build environment). The endpoint returns 403 to both wreq and standard web clients from this IP range, consistent with a WAF fingerprint or IP filter that Playwright's Chromium TLS fingerprint and JS execution capability should bypass (same pattern as recent EPS, NYE, LBH fixes).

Metric Count
Councillors found TBC after Lambda run
With email address TBC after Lambda run
With photo TBC after Lambda run

Generated by Claude Code

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant