speech-generation

Star

Here are 14 public repositories matching this topic...

Wendison / VQMIVC

Star

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

speech voice-conversion one-shot disentanglement-learning speech-generation

Updated Apr 27, 2022
Jupyter Notebook

cortictechnology / cep

Sponsor

Star

CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.

raspberry-pi opencv iot natural-language-processing computer-vision deep-learning smarthome artificial-intelligence speech-recognition visual-programming edge-computing lego-mindstorms oak-d speech-generation

Updated May 17, 2022
JavaScript

ga642381 / SpeechGen

Star

《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》

deep-learning prompt speech-processing speech-generation large-language-models speech-llm

Updated Jun 9, 2023

ictnlp / NAST-S2x

Star

A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.

non-autoregressive simultaneous-translation speech-generation speech-to-speech-translation non-autoregressive-transformers

Updated Oct 22, 2024
Python

youngsheen / GPST

Star

[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer

moshi language-model autoregressive fairseq speech-generation

Updated Nov 1, 2024
Python

01Zhangbw / Speech-and-audio-papers-Top-Conference

Star

It includes papers on speech&audio field. Now update: ICLR2023-2025, ICML2023-2024, NeurIPS2023-2024, ACMMM2024, AAAI2024, ACL2024, EMNLP2024, NAACL2025, AAAI2025, IJCAI2024

audio text-to-speech speech tts speech-synthesis speech-recognition automatic-speech-recognition speech-processing audio-processing asr speech-enhancement speech-generation

Updated Apr 22, 2025

caizexin / GenVC

Star

Self-supervised Generative LM-based Voice Conversion

voice-conversion voice-anonymization voice-cloning speech-generation

Updated Apr 24, 2025
Python

Vidyut / vidyut-tts

Star

Streamlit frontend for Coqui-tts

text-to-speech tts speech-generation tts-frontend

Updated Apr 16, 2023
Python

nidhiyashwanth / SesameAILabs-csm

Star

A conversational speech model (CSM) that generates natural-sounding speech with context awareness and consistent audio quality. Supports multi-speaker conversations and maintains contextual understanding across turns, ensuring consistent audio output throughout the conversation.

moshi sesame context-aware csm conversational-ai speech-generation sesameailabs