A full-stack AI voice synthesis application with RAG (Retrieval-Augmented Generation) capabilities, featuring celebrity voice cloning using TTS and RVC technology.
- Frontend: https://Zane-Dev14.github.io/ClosedAIv2/
- Backend API: https://closedaiv2.onrender.com (after deployment)
- Frontend: React + TypeScript + Vite + Tailwind CSS + Framer Motion
- Backend: FastAPI + Python + OpenAI + ChromaDB
- AI Models: Edge TTS + RVC (Realistic Voice Cloning)
- RAG System: OpenAI + ChromaDB for speech pattern enhancement
- Voice Cloning: Transform text into celebrity voices (Obama, Trump, Modi, SRK, etc.)
- RAG Enhancement: AI-powered text enhancement using speech patterns
- Real-time Synthesis: Fast voice generation with progress tracking
- Modern UI: Beautiful, responsive interface with 3D animations
- Production Ready: Optimized for deployment on GitHub Pages and Render
The frontend is automatically deployed to GitHub Pages:
cd frontend
npm run deploy- Fork this repository
- Go to Render Dashboard
- Create a new Web Service
- Connect your GitHub repository
- Set environment variables:
OPENAI_API_KEY: Your OpenAI API key
- Deploy!
- Python 3.11+
- Node.js 18+
- OpenAI API Key
pip install -r requirements.txt
cp .env.example .env
# Edit .env with your OpenAI API key
python main.pycd frontend
npm install
npm run devClosedAIv2/
├── frontend/ # React frontend
├── minimal_tts_rvc/ # TTS and RVC models
├── models/ # Voice models (gitignored)
├── output/ # Generated audio (gitignored)
├── speech_documents/ # RAG training data
├── speech_patterns_db/ # ChromaDB vector store
├── main.py # FastAPI backend
├── rag_system.py # RAG implementation
└── requirements.txt # Python dependencies
Create a .env file in the root directory:
OPENAI_API_KEY=your_openai_api_key_here
PORT=10000
ENVIRONMENT=productionGET /- API documentationGET /models- List available voice modelsGET /validate- Validate system setupPOST /synthesize- Generate voice synthesisGET /audio/{filename}- Serve generated audio files
- Fork the repository
- Create a feature branch
- Make your changes
- Test thoroughly
- Submit a pull request
MIT License - see LICENSE file for details.
- Edge TTS for text-to-speech
- RVC for voice cloning
- OpenAI for RAG capabilities
- ChromaDB for vector storage