US Government Website Scans
This project discovers and catalogs scan data for United States government websites, organized by state and federal domains.
Focus
- URL validation and redirect tracking
- Accessibility statement detection
- Social media, technology, and third-party script scanning
- Lighthouse quality audits
Dataset
- Source CSV imports live in
data/imports/google_sheets/ - TOON seed outputs live in
data/toon-seeds/ - One seed per state plus a federal seed file
Reports
Detailed scan reports are being regenerated for the US-focused dataset.
Use these pages as scan pipelines are refreshed:
scan-progress.mdsocial-media.mdaccessibility-statements.mdtechnology-scanning.mdthird-party-tools.mdlighthouse-scanning.mddomains.md