Fine-tuning CosyVoice3 on Bengali podcast audio with emotion and persona instruction control. Covers the full pipeline: data collection, Gemini enrichment, ASR verification, quality filtering, and Gradio inference UI.
nlp text-to-speech deep-learning pytorch tts speech-synthesis bengali gradio low-resource-languages bengali-natural-language-processing gradio-interface emotion-tts cosyvoice3
-
Updated
Apr 20, 2026