Skip to content

Add Common Voice dataset loader #4

@dnvt

Description

@dnvt

Common Voice is a large multilingual speech corpus with sentence-level transcripts. Adding a loader would enable training on languages beyond English.

Scope:

  • Parse Common Voice TSV metadata (validated.tsv)
  • Load MP3 audio files via symphonia
  • Map to TrainingSample format
  • Handle the flat clips/ directory structure

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions