Skip to content

internetofwater/nabu

Repository files navigation

Nabu

codecov goreportcard status

Nabu is the central data engineering tool for geoconnex. It is a CLI for

  • crawling remote JSON-LD documents from a remote sitemap and storing them into an S3 bucket
  • preparing data for ingestion into a graph database by:
    • validating RDF data against SHACL shapes
    • converting JSON-LD documents from the S3 bucket into N-Quad files
    • enriching N-Quads with additional hydrologic metadata such as mainstem identifiers

For more technical details see the docs folder.

See the examples directory for example CLI usage.

Installation

Docker

docker run internetofwater/nabu:latest

Native Binary

git clone https://github.com/internetofwater/nabu
cd nabu
go build ./cmd/nabu
./nabu --help

Fork Details

This repo is a completely rewritten fork of the gleanerio Nabu repo

About

Crawl and prepare RDF data for loading into a triplestore

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors 6