Real-time AI voice translator that converts your speech into another language using a synthetic voice. Speak English and your teammates hear fluent German, Japanese, or any of 15+ supported languages — all in real-time.
- Real-time voice translation — Speak English, output translated speech instantly
- 322 AI voices — Pick from Microsoft Edge TTS voices across 74 languages
- 3 mic modes — Push-to-Talk, Toggle, or Open Mic (auto-detect speech)
- Live voice switching — Change voice, language, or mic mode on the fly
- Modern GUI — Dark-themed app with voice preview, key rebinding, and activity log
- Auto Docker startup — Whisper container starts automatically when you launch the app
- Subtitle overlay — Optional real-time subtitles for incoming foreign speech
Your voice → Whisper AI (transcribe) → Translate → Edge TTS (synthesize) → App mic input
- Record your English speech
- Whisper AI transcribes it to text
- Google Translate (or DeepL) translates to target language
- Edge TTS generates speech in your chosen AI voice
- Output plays to your app via virtual audio cable
- Windows 10/11
- Python 3.10+
- Docker Desktop (for Whisper AI)
- NVIDIA GPU (recommended for Whisper)
- VoiceMeeter Banana — Download
- VB-CABLE Virtual Audio Cable — Download
pip install -r requirements.txt- Set VoiceMeeter Input as your default Windows playback device
- In VoiceMeeter Banana, set A1 hardware out to your speakers/headphones
- In your target app (Discord, game, etc.):
- Output → VoiceMeeter Aux Input
- Input → CABLE Output
cp .env.sample .envRun python src/modules/get_audio_device_ids.py to find your device IDs, then update .env.
cd src
python app.pyThe app will auto-start Docker and load Whisper. Click Start Translator, hold your push-to-talk key, and speak.
German, Japanese, French, Spanish, Italian, Portuguese, Russian, Chinese, Korean, Hindi, Arabic, Dutch, Polish, Swedish, Turkish — and any language supported by Edge TTS.
Your Microphone → VoxMorph app → Whisper → Translate → Edge TTS
↓
[Parallel Output]
├→ VoiceMeeter (you hear it)
└→ VB-CABLE (app hears it)
- Whisper AI (faster-whisper) — Speech recognition via Docker
- Edge TTS — Microsoft neural text-to-speech (free, 322 voices)
- Google Translate / DeepL — Translation
- CustomTkinter — Modern GUI
- PyAudio / SoundDevice — Audio I/O
- VoiceMeeter + VB-CABLE — Audio routing
Built by ArsenalRX