US Government Website Scans

This project discovers and catalogs scan data for United States government websites, organized by state and federal domains.

Focus

  • URL validation and redirect tracking
  • Accessibility statement detection
  • Social media, technology, and third-party script scanning
  • Lighthouse quality audits

Dataset

  • Source CSV imports live in data/imports/google_sheets/
  • TOON seed outputs live in data/toon-seeds/
  • One seed per state plus a federal seed file

Reports

Detailed scan reports are being regenerated for the US-focused dataset.

Use these pages as scan pipelines are refreshed:

  • scan-progress.md
  • social-media.md
  • accessibility-statements.md
  • technology-scanning.md
  • third-party-tools.md
  • lighthouse-scanning.md
  • domains.md