GovData
by ethica.design ↗Data infrastructure
Public records,
made queryable
Fragmented government data — thousands of incompatible formats, agencies, and eras — transformed into structured, versioned, queryable datasets.
40+
source formats ingested
2.3M
records normalised
<4hr
acquisition to query-ready
How it works
Universal ingestion
Automated scrapers and agency partnerships ingest 40+ source formats. Every record is checksummed at intake — provenance is always traceable.
ML normalisation pipeline
Records are classified, deduplicated, and schematised automatically. Edge cases surface to a human review queue rather than being silently dropped.
Versioned API
Schema-first, fully versioned. Consumers can pin to a release and receive breaking-change notifications before anything shifts under them.
Natural language search
The explorer maps plain-language queries to structured filters. Designed for researchers who know what they want and journalists who don't yet.