Transform any text into natural speech using your own voice or any reference audio
Bhavesh AI Voice Cloner is a cutting-edge, open-source Text-to-Speech (TTS) and voice cloning system powered by state-of-the-art AI technology. With just a few seconds of reference audio, you can clone any voice and generate natural-sounding speech with emotion control and perfect pronunciation.
- ๐ฏ Zero-shot Voice Cloning: Clone any voice with just seconds of reference audio
- ๐ญ Emotion Control: Adjust emotional intensity and expression levels
- โก Lightning Fast: Optimized for real-time generation
- ๐ง 0.5B Llama Backbone: Built on state-of-the-art transformer architecture
- ๐ต High Quality Audio: Crystal clear, natural-sounding speech output
- ๐ Ethical AI: Built-in watermarking for responsible AI usage
- ๐ Multiple Languages: Supports multiple languages and accents
- ๐ Easy to Use: Simple API and beautiful Streamlit interface
- ๐ฌ Content Creation: Videos, podcasts, audiobooks
- ๐ฎ Gaming: Character voices and narration
- ๐ฑ Applications: Voice assistants and chatbots
- ๐ญ Entertainment: Memes, voice effects, creative projects
- ๐ Education: Language learning and accessibility tools
# Clone the repository
git clone https://github.com/bhau23/voice-cloner.git
cd voice-cloner
# Run the setup script
python launch.py setup
# Launch the Streamlit app
python launch.py streamlit# Clone the repository
git clone https://github.com/bhau23/voice-cloner.git
cd voice-cloner
# Install dependencies
pip install -r requirements.txt
# Run the Streamlit app
streamlit run streamlit_app.pygit clone https://github.com/bhau23/voice-cloner.git
cd voice-cloner
python launch.py dockerpip install bhavesh-ai-voice-clonerimport torchaudio as ta
from bhavesh_ai_voice_cloner.tts import BhaveshTTS
# Initialize the model
model = BhaveshTTS.from_pretrained(device="cuda")
# Generate speech with default voice
text = "Hello! This is Bhavesh AI speaking. I can clone any voice and make it sound natural!"
wav = model.generate(text)
ta.save("output.wav", wav, model.sr)
# Clone a specific voice
REFERENCE_AUDIO = "path/to/your/reference.wav"
wav = model.generate(text, audio_prompt_path=REFERENCE_AUDIO)
ta.save("cloned_voice.wav", wav, model.sr)For a user-friendly interface, run the Streamlit app:
# Quick launch
python launch.py streamlit
# Or traditional method
streamlit run streamlit_app.pyGenerate speech directly from command line:
python launch.py cli "Your text here"Run the comprehensive demo:
python launch.py demopython launch.py setuppython verify_project.pypython github_setup.pyexaggeration(0.25-2.0): Control emotional intensity (default: 0.5)cfg_weight(0.0-1.0): Control speech pacing and clarity (default: 0.5)temperature(0.05-5.0): Control randomness in generation (default: 0.8)repetition_penalty(1.0-2.0): Prevent repetitive speech patterns (default: 1.2)
Currently supporting English with plans to expand to more languages.
We welcome contributions! Please see our Contributing Guidelines for details.
This project builds upon excellent open-source work:
Every audio file generated by Bhavesh AI includes built-in watermarking technology for responsible AI usage. This helps identify AI-generated content and prevents misuse.
import perth
import librosa
# Load the generated audio
audio, sr = librosa.load("generated_audio.wav", sr=None)
# Initialize watermarker
watermarker = perth.PerthImplicitWatermarker()
# Check for watermark
watermark = watermarker.get_watermark(audio, sample_rate=sr)
print(f"AI Generated: {watermark > 0.5}")- ๐ Bug Reports: GitHub Issues
- ๐ฌ Discussions: GitHub Discussions
- ๐ง Contact: bhavesh23official@gmail.com
This project is licensed under the MIT License - see the LICENSE file for details.
If you use this project in your research, please cite:
@misc{bhaveshaivoicecloner2025,
author = {{Bhavesh AI}},
title = {{Bhavesh AI Voice Cloner}},
year = {2025},
howpublished = {\url{https://github.com/bhau23/voice-cloner}},
note = {GitHub repository}
}This tool is intended for legitimate and ethical uses only. Users are responsible for ensuring they have proper consent before cloning someone's voice. Please use this technology responsibly and in compliance with applicable laws.
Made with โค๏ธ by Bhavesh AI
GitHub โข Try Online โข Contact