EU Government Website Scans

This project discovers and catalogues how European (and allied) government websites use social media, whether their URLs are accessible, and what technology platforms power them, including which third-party JavaScript services they rely on.

Current Scan Progress

Progress as of 2026-07-23 03:23 UTC

Scan Type	Pages Scanned	Coverage	Avg Age
Combined Reachability	44,977 confirmed reachable	51.3%	—
Social Media	49,503 scanned (44,966 reachable)	56.4%	1.5 days
Technology	4,068 scanned	4.6%	1.4 days
Accessibility Statements	13,297 scanned	15.2%	1.5 days

32 countries with scan data · 44,977 of 87,696 available pages confirmed reachable. See the Scan Progress Report for full details.

Latest Scan Results

Scan Progress Report — The best place to start for overall coverage, scan status, and country-level comparisons across the project.
Social Media — Government use of legacy and open social platforms, with evidence behind the published counts.
Accessibility Statements — Country-by-country evidence showing which pages do and do not publish accessibility statements.
Technology Scanning — Detected CMSs, frameworks, analytics tools, and other software found on government sites.
Third-Party JavaScript — External scripts, services, and hosted dependencies loaded by government pages.
Lighthouse Scanning — Google Lighthouse methodology, workflow details, and page-level quality scores as they are collected.
Government Domains — The tracked source dataset: government domains and page URLs used as the input for scans, grouped by country.
Scan Cycle Pace — Whether each scanner is on pace to finish its target cycle (30 or 60 days), based on recent scan velocity.

What We Track

We check government pages for links to legacy and open social platforms, then classify what was found at page and country level.

See Social Media for platform coverage, tier definitions, and downloadable evidence.

URL Validation

We validate tracked URLs, follow redirects, and monitor persistent failures so the source dataset stays current.

See Scan Progress Report for current validation coverage and country-level results.

Technology Detection

We detect the CMS, framework, analytics, hosting, and other technologies used by government sites.

See Technology Scanning for the detected technologies and country tables.

Third-Party JavaScript

We track externally hosted scripts and services such as analytics tags, consent tools, CDNs, shared JavaScript libraries, and support widgets.

See Third-Party JavaScript for the EU-wide breakdown and evidence exports.

Lighthouse Audits

We run Google Lighthouse on each government page and record five quality scores: performance, accessibility, best practices, SEO, and PWA compliance (0–100 scale).

See Lighthouse Scanning for full details.

Countries Covered

The dataset covers all EU member states plus selected allied nations: United Kingdom, Switzerland, Iceland, Norway, and Canada.

See Government Domains for the full domain and page-url source list per country.

How the Scans Work

Scans run automatically on a schedule via GitHub Actions:

Scan	Schedule	Priority
Social Media	Every 3 hours	Highest — confirms reachability and collects social-link data in one pass
Technology Detection	On demand	Medium — run manually for new countries
URL Validation	Every 12 hours	Lowest — lightweight redirect/404 check; skipped for pages already confirmed reachable within 30 days
Lighthouse Audits	Weekly (Sundays 04:00 UTC)	Medium — slow per-URL (~5 s); weekly cadence keeps data fresh without overloading servers
Scan Progress Report	After every scan	—

After each scan run, this site is automatically updated with the latest results.

Accessing Scan Artifacts

Each GitHub Actions scan run uploads its results as a downloadable artifact:

Go to GitHub Actions
Click the relevant workflow
Open a completed run and scroll to the Artifacts section
Download the artifact to inspect the database, annotated TOON files, and scan logs

The Scan Progress Report is regenerated automatically, so most visitors should not need the raw artifacts unless they want to inspect the source outputs directly.

Source Code & Data

Scan data is collected by automated workflows and stored as GitHub Actions artifacts. The progress report is regenerated after every scan and committed directly to this site.