Advanced Voice AI Platform with Proprietary Agent Technology
VoiceKeep is a voice AI platform that combines proprietary voice agent technology with ElevenLabs' voice synthesis capabilities. Create, clone, and interact with AI voices through natural conversation.
- Intelligent Voice Agent: Custom-built AI agent that understands context and responds naturally
- Real-time Voice Processing: Voice analysis and response generation
- Contextual Conversations: Maintains conversation flow and remembers context
- Multi-language Support: Voice processing in multiple languages
- Voice Cloning: Create voice replicas using ElevenLabs' models
- Speech-to-Text: Transcription with multilingual support
- Text-to-Speech: Voice generation with cloned voices
- Voice Library Management: Store and manage multiple cloned voices
- Record & Clone: Capture your voice and create a digital clone
- Generate Speech: Convert any text to speech using your cloned voice
- Voice Agent Conversation: Have intelligent conversations with your AI agent
- Export & Share: Download audio files and share your creations
- Node.js 18+
- npm or pnpm
- ElevenLabs API key
- Supabase account
# Clone the repository
git clone <repository-url>
cd voicekeep
# Install dependencies
npm install
# Set up environment variables
cp .env.example .env.local# .env.local
VITE_SUPABASE_URL=your_supabase_url
VITE_SUPABASE_ANON_KEY=your_supabase_anon_key
ELEVENLABS_API_KEY=your_elevenlabs_api_key# Start development server
npm run dev
# Build for production
npm run build- Modern UI: Built with React, TypeScript, and Tailwind CSS
- Component Library: shadcn/ui components for consistent design
- State Management: React hooks for efficient state handling
- Audio Processing: Real-time audio recording and playback
- Voice Agent Engine: Our proprietary AI processing logic
- ElevenLabs Integration: Seamless API integration for voice services
- Session Management: Secure voice agent session handling
- Audio Processing: Real-time audio analysis and response generation
┌─────────────────────────────────────────────────────────────┐
│ VoiceKeep Architecture │
├─────────────────────────────────────────────────────────────┤
│ │
│ ┌─────────────┐ ┌──────────────┐ ┌─────────────┐ │
│ │ React │◄──►│ Supabase │◄──►│ ElevenLabs │ │
│ │ Frontend │ │ Functions │ │ API │ │
│ └─────────────┘ └──────────────┘ └─────────────┘ │
│ │ │ │ │
│ ▼ ▼ ▼ │
│ ┌─────────────┐ ┌──────────────┐ ┌─────────────┐ │
│ │ Voice │◄──►│ Our AI │◄──►│ Voice │ │
│ │ Recording │ │ Agent │ │ Synthesis │ │
│ └─────────────┘ └──────────────┘ └─────────────┘ │
└─────────────────────────────────────────────────────────────┘
- High-quality audio recording with real-time feedback
- Automatic voice cloning using ElevenLabs API
- Multi-language voice input support
- Voice quality optimization
- Convert any text to speech using cloned voices
- Multiple voice models and settings
- Audio file export and sharing
- Real-time voice preview
- Voice Cloning: Create voice replicas from audio samples
- Speech-to-Text: Convert speech to text with high accuracy
- Text-to-Speech: Generate natural-sounding speech
- Voice Library: Manage and organize cloned voices
- AI Agent Processing: Custom AI logic for intelligent responses
- Voice Analysis: Advanced voice pattern recognition
- Context Management: Maintains conversation state
- Response Generation: Natural language response creation
- Clean, intuitive interface with dark/light themes
- Responsive design for all devices
- Real-time audio visualization
- Smooth animations and transitions
- One-Click Recording: Simple voice capture process
- Instant Cloning: Fast voice cloning with progress indicators
- Natural Conversations: Seamless voice agent interactions
- Audio Controls: Play, pause, and download audio files
- Frontend: React 18, TypeScript, Vite
- Styling: Tailwind CSS, shadcn/ui
- Backend: Supabase Functions (Deno)
- Voice AI: ElevenLabs API
- Database: Supabase PostgreSQL
src/
├── components/
│ ├── VoiceAgentInterface.tsx # Voice agent UI
│ ├── RecordingControls.tsx # Audio recording
│ └── ui/ # UI components
├── services/
│ ├── voiceAgent.ts # Our AI agent logic
│ └── elevenlabs.ts # ElevenLabs integration
├── pages/
│ ├── VoiceAgentPage.tsx # Voice agent page
│ ├── ClonePage.tsx # Voice cloning
│ └── RecordPage.tsx # Voice recording
└── hooks/
└── useAudioRecorder.ts # Audio recording hook
npm run build
npm run previewVITE_SUPABASE_URL: Supabase project URLVITE_SUPABASE_ANON_KEY: Supabase anonymous keyELEVENLABS_API_KEY: ElevenLabs API key
Deploy the voice agent functions to Supabase:
voice-agent-init: Initialize voice agent sessionsvoice-agent-process: Process voice inputvoice-agent-respond: Generate voice responsesvoice-agent-end: Cleanup sessions
- Secure API Keys: All API keys stored securely in environment variables
- Session Management: Secure voice agent session handling
- Data Privacy: Voice data processed securely and not stored permanently
- Authentication: Secure user authentication and authorization
- Real-time Processing: Low-latency voice processing
- Optimized Audio: Efficient audio compression and streaming
- Caching: Smart caching for improved performance
- Responsive UI: Smooth animations and interactions
- Fork the repository
- Create a feature branch
- Make your changes
- Test thoroughly
- Submit a pull request
This project is licensed under the MIT License - see the LICENSE file for details.
For support and questions:
- Create an issue on GitHub
- Check the documentation
- Contact the development team
VoiceKeep - Where proprietary AI meets cutting-edge voice technology.