Metadata Schema

Development of a metadata schema for experimental data, specifically electrochemical and electrocatalytic data.

Install

Install pixi and get a copy of the metadata-schema:

git clone https://github.com/echemdb/metadata-schema.git
cd metadata-schema

Usage

Convert metadata to Excel/CSV

The mdstools package provides tools to convert nested YAML metadata into flat Excel/CSV formats with optional schema-based enrichment (descriptions and examples from JSON schemas).

Convert a YAML file to enriched Excel and CSV:

pixi run convert tests/example_metadata.yaml

This creates three files in generated/:

example_metadata.csv - Flat CSV with all metadata
example_metadata.xlsx - Single-sheet Excel file
example_metadata_sheets.xlsx - Multi-sheet Excel (one sheet per top-level key)

All exported files include Description and Example columns populated from the JSON schemas, making it easier for users to understand and fill out the metadata templates.

Options

pixi run convert <yaml_file> [--schema-dir DIR] [--output-dir DIR] [--no-enrichment]

--schema-dir - Directory with JSON schemas (default: schemas)
--output-dir - Output directory (default: generated)
--no-enrichment - Disable enrichment (no Description/Example columns)

Convert Excel/CSV back to YAML (CLI)

pixi run unflatten generated/example_metadata.xlsx --schema-file schemas/schema_pieces/minimum_echemdb.json

Python API

The mdstools package can also be used programmatically:

from mdstools.metadata.metadata import Metadata
from mdstools.metadata.enriched_metadata import EnrichedFlattenedMetadata

# Load YAML metadata
metadata = Metadata.from_yaml('metadata.yaml')

# Flatten to tabular format
flattened = metadata.flatten()

# Add schema enrichment (descriptions and examples)
enriched = EnrichedFlattenedMetadata(flattened.rows, schema_dir='schemas')

# Get enriched DataFrame
df = enriched.to_pandas()

# Export to various formats
enriched.to_csv('output.csv')
enriched.to_excel('output.xlsx')
enriched.to_excel('output_multi.xlsx', separate_sheets=True)  # One sheet per top-level key
enriched.to_markdown('output.md')

You can also load a flat Excel/CSV file, reconstruct the nested dict, and optionally write YAML. This workflow expects columns named Number, Key, and Value and is intended for unflattening back to dict/YAML. An enriched Excel can also be loaded.

from mdstools.metadata.flattened_metadata import FlattenedMetadata

flattened = FlattenedMetadata.from_excel("generated/example_metadata.xlsx")
metadata = flattened.unflatten()

data = metadata.data  # Nested dict
metadata.to_yaml("generated/example_metadata.yaml")

Developer

Run tests

pixi run test              # Run all tests
pixi run doctest           # Run doctests only
pixi run test-comprehensive # Run integration tests only

or all

pixi run -e dev test-all

Resolve schemas

Generate resolved (single-file) JSON schemas from the modular schema pieces:

pixi run resolve-schemas

This resolves all $ref references and writes the combined schemas to schemas/.

After intentional changes to schema pieces, update the expected baseline files:

pixi run update-expected-schemas

Validate schema files

To validate the example files against the JSON schemas:

pixi run validate

Name		Name	Last commit message	Last commit date
Latest commit History 280 Commits
.github/workflows		.github/workflows
doc/news		doc/news
examples		examples
mdstools		mdstools
schemas		schemas
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.pylintrc		.pylintrc
ChangeLog		ChangeLog
LICENSE		LICENSE
claude.md		claude.md
pixi.lock		pixi.lock
pyproject.toml		pyproject.toml
readme.md		readme.md
rever.xsh		rever.xsh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Metadata Schema

Install

Usage

Convert metadata to Excel/CSV

Options

Convert Excel/CSV back to YAML (CLI)

Python API

Developer

Run tests

Resolve schemas

Validate schema files

About

Uh oh!

Releases 8

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

echemdb/metadata-schema

Folders and files

Latest commit

History

Repository files navigation

Metadata Schema

Install

Usage

Convert metadata to Excel/CSV

Options

Convert Excel/CSV back to YAML (CLI)

Python API

Developer

Run tests

Resolve schemas

Validate schema files

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages