Skip to content
Change the repository type filter

All

    Repositories list

    • [ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates
      Python
      MIT License
      54510Updated Apr 13, 2026Apr 13, 2026
    • Python
      Apache License 2.0
      0200Updated Apr 1, 2026Apr 1, 2026
    • Python
      0000Updated Apr 1, 2026Apr 1, 2026
    • TaDiCodec

      Public
      This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDiCodec: Text-aware Diff…
      Python
      37730Updated Jan 25, 2026Jan 25, 2026
    • SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)
      Python
      47130Updated Dec 23, 2025Dec 23, 2025
    • AnyAccomp

      Public
      AnyAccomp: Generalizable accompaniment generation for vocals and solo instruments, powered by a quantized melodic bottleneck.
      Python
      MIT License
      23630Updated Dec 22, 2025Dec 22, 2025
    • SA-Eval

      Public
      0800Updated Mar 20, 2025Mar 20, 2025
    • debatts

      Public
      HTML
      1500Updated Dec 23, 2024Dec 23, 2024
    • SD-Eval

      Public
      [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
      Python
      Apache License 2.0
      25610Updated Jun 25, 2024Jun 25, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.