Skip to content

Site components

Susan Valente edited this page Apr 17, 2026 · 26 revisions

This is a description of all the components that make up Data.gov.

The public-facing homepage of Data.gov, fondly known as "www" or "main site". Contains resources for the public and agencies about open data. Hosted on cloud.gov Pages.

Custom Python application providing dataset discovery and search across Federal, State, Municipal, university, and tribal datasets. Serves 515,000+ datasets from 120+ publishing organizations.

Automated ingestion service that pulls agency metadata into the catalog on a scheduled basis. Manages harvest sources and job runs for all publishing organizations.

Used by federal agencies to create and manage metadata for their datasets. Generates the agency's data.json file for harvest into the catalog.

Includes resources.data.gov and strategy.data.gov. Hosted on cloud.gov Pages.

api.data.gov

GSA's shared API gateway. Data.gov APIs are accessible via api.data.gov in addition to their direct endpoints.

Clone this wiki locally