We have a Jenkins job that grabs the list of UK gov departments and uploads to PMD so that we can have a URI and a label for each dataset publisher. This needs to be synced into the corresponding Airtable table for linking from the ETL records.
We also have a separate Python/Docker app that pulls JSON info for all stats datasets published on gov.uk via the Whitehall API endpoints. The data is synced into a MariaDB instance running on ons.floop.org.uk. We could sync it up to a gov.uk specific read-only table in Airtable so that BAs can cross reference their ETL records.