Nabu is the central data engineering tool for geoconnex. It is a CLI for
- crawling remote JSON-LD documents from a remote sitemap and storing them into an S3 bucket
- preparing data for ingestion into a graph database by:
- validating RDF data against SHACL shapes
- converting JSON-LD documents from the S3 bucket into N-Quad files
- enriching N-Quads with additional hydrologic metadata such as mainstem identifiers
For more technical details see the docs folder.
See the examples directory for example CLI usage.
docker run internetofwater/nabu:latestgit clone https://github.com/internetofwater/nabu
cd nabu
go build ./cmd/nabu
./nabu --helpThis repo is a completely rewritten fork of the gleanerio Nabu repo