Skip to content

Weimar Jazz Database Ingestion#460

Open
SCN-MNG wants to merge 12 commits intomainfrom
wjazzd_ingestion
Open

Weimar Jazz Database Ingestion#460
SCN-MNG wants to merge 12 commits intomainfrom
wjazzd_ingestion

Conversation

@SCN-MNG
Copy link
Copy Markdown
Contributor

@SCN-MNG SCN-MNG commented Jan 16, 2026

This PR introduces the configuration files and documentation necessary to ingest Weimar Jazz Database (wJazzd) data into our pipeline. No new Python scripts were added in this PR; ingestion relies on existing shared scripts and configuration files.

OpenRefine Configuration Files (wjazzd/openrefine/)

  • wjazzd/openrefine/history/composition_info_history.json
  • wjazzd/openrefine/history/record_info_history.json
  • wjazzd/openrefine/history/solo_info_history.json
  • wjazzd/openrefine/history/track_info_history.json
    These files allow users to automatically reapply the same reconciliation steps used during OpenRefine processing.

RDF Conversion Config

  • shared/rdf_config/wjazzd.toml
    Specifies RDF property mapping and RDF graph schema. Allow wjazzd to be converted into RDF using the shared conversion pipeline (shared/rdfconv/convert.py).

Documentation

  • wjazzd/README.md
    Provides instructions for wJazzd ingestion and overview of the database schema.
  • wjazzd/doc/reconciliation_procedures.md
    Detailed step-by-step reconciliation procedures for compositions, records, solos, and tracks.

Other

  • shared/rdf_conversion/using_rdfconv_script.md
    Fix a small mistake in shared documentation.
  • wjazzd/.gitignore

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds configuration files and documentation for ingesting the Weimar Jazz Database (wJazzd) into the LinkedMusic pipeline. The changes enable automated data reconciliation using OpenRefine and RDF conversion using existing shared infrastructure.

Changes:

  • Added OpenRefine history JSON files for automated reconciliation of composition, record, solo, and track data
  • Added RDF conversion configuration (wjazzd.toml) specifying property mappings and RDF schema
  • Added comprehensive documentation for ingestion workflow and reconciliation procedures
  • Fixed command syntax error in shared RDF conversion documentation

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 7 comments.

Show a summary per file
File Description
wjazzd/openrefine/history/track_info_history.json OpenRefine reconciliation steps for track information including lineup, instruments, and recording dates
wjazzd/openrefine/history/solo_info_history.json OpenRefine reconciliation steps for solo performances including performers, instruments, and musical properties
wjazzd/openrefine/history/record_info_history.json OpenRefine reconciliation steps for record/album information including artists and labels
wjazzd/openrefine/history/composition_info_history.json OpenRefine reconciliation steps for composition metadata including composers and templates
wjazzd/doc/reconciliation_procedures.md Step-by-step guide for data reconciliation procedures
wjazzd/README.md Overview of wJazzd database and ingestion workflow
shared/rdf_config/wjazzd.toml RDF property mappings and conversion configuration
shared/rdf_conversion/using_rdfconv_script.md Fixed missing --input flag in command examples
wjazzd/.gitignore Excludes data directory from version control

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

SCN-MNG and others added 6 commits January 16, 2026 10:53
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants