Skip to content

Zane-Dev14/ClosedAIv2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ClosedAIv2 - AI Voice Synthesis Studio

A full-stack AI voice synthesis application with RAG (Retrieval-Augmented Generation) capabilities, featuring celebrity voice cloning using TTS and RVC technology.

🚀 Live Demo

🏗️ Architecture

  • Frontend: React + TypeScript + Vite + Tailwind CSS + Framer Motion
  • Backend: FastAPI + Python + OpenAI + ChromaDB
  • AI Models: Edge TTS + RVC (Realistic Voice Cloning)
  • RAG System: OpenAI + ChromaDB for speech pattern enhancement

🎯 Features

  • Voice Cloning: Transform text into celebrity voices (Obama, Trump, Modi, SRK, etc.)
  • RAG Enhancement: AI-powered text enhancement using speech patterns
  • Real-time Synthesis: Fast voice generation with progress tracking
  • Modern UI: Beautiful, responsive interface with 3D animations
  • Production Ready: Optimized for deployment on GitHub Pages and Render

🚀 Deployment

Frontend (GitHub Pages)

The frontend is automatically deployed to GitHub Pages:

cd frontend
npm run deploy

Backend (Render)

  1. Fork this repository
  2. Go to Render Dashboard
  3. Create a new Web Service
  4. Connect your GitHub repository
  5. Set environment variables:
    • OPENAI_API_KEY: Your OpenAI API key
  6. Deploy!

🛠️ Local Development

Prerequisites

  • Python 3.11+
  • Node.js 18+
  • OpenAI API Key

Backend Setup

pip install -r requirements.txt
cp .env.example .env
# Edit .env with your OpenAI API key
python main.py

Frontend Setup

cd frontend
npm install
npm run dev

📁 Project Structure

ClosedAIv2/
├── frontend/          # React frontend
├── minimal_tts_rvc/   # TTS and RVC models
├── models/            # Voice models (gitignored)
├── output/            # Generated audio (gitignored)
├── speech_documents/  # RAG training data
├── speech_patterns_db/ # ChromaDB vector store
├── main.py           # FastAPI backend
├── rag_system.py     # RAG implementation
└── requirements.txt  # Python dependencies

🔧 Environment Variables

Create a .env file in the root directory:

OPENAI_API_KEY=your_openai_api_key_here
PORT=10000
ENVIRONMENT=production

📝 API Endpoints

  • GET / - API documentation
  • GET /models - List available voice models
  • GET /validate - Validate system setup
  • POST /synthesize - Generate voice synthesis
  • GET /audio/{filename} - Serve generated audio files

🤝 Contributing

  1. Fork the repository
  2. Create a feature branch
  3. Make your changes
  4. Test thoroughly
  5. Submit a pull request

📄 License

MIT License - see LICENSE file for details.

🙏 Acknowledgments

  • Edge TTS for text-to-speech
  • RVC for voice cloning
  • OpenAI for RAG capabilities
  • ChromaDB for vector storage

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors