GovData
by ethica.design ↗
Data infrastructure

Public records,
made queryable

Fragmented government data — thousands of incompatible formats, agencies, and eras — transformed into structured, versioned, queryable datasets.

Read case studyEthica studio →
40+
source formats ingested
2.3M
records normalised
<4hr
acquisition to query-ready

How it works

Universal ingestion

Automated scrapers and agency partnerships ingest 40+ source formats. Every record is checksummed at intake — provenance is always traceable.

ML normalisation pipeline

Records are classified, deduplicated, and schematised automatically. Edge cases surface to a human review queue rather than being silently dropped.

Versioned API

Schema-first, fully versioned. Consumers can pin to a release and receive breaking-change notifications before anything shifts under them.

Natural language search

The explorer maps plain-language queries to structured filters. Designed for researchers who know what they want and journalists who don't yet.