VoxMorph

Real-time AI voice translator that converts your speech into another language using a synthetic voice. Speak English and your teammates hear fluent German, Japanese, or any of 15+ supported languages — all in real-time.

Features

Real-time voice translation — Speak English, output translated speech instantly
322 AI voices — Pick from Microsoft Edge TTS voices across 74 languages
3 mic modes — Push-to-Talk, Toggle, or Open Mic (auto-detect speech)
Live voice switching — Change voice, language, or mic mode on the fly
Modern GUI — Dark-themed app with voice preview, key rebinding, and activity log
Auto Docker startup — Whisper container starts automatically when you launch the app
Subtitle overlay — Optional real-time subtitles for incoming foreign speech

How It Works

Your voice → Whisper AI (transcribe) → Translate → Edge TTS (synthesize) → App mic input

Record your English speech
Whisper AI transcribes it to text
Google Translate (or DeepL) translates to target language
Edge TTS generates speech in your chosen AI voice
Output plays to your app via virtual audio cable

Requirements

Windows 10/11
Python 3.10+
Docker Desktop (for Whisper AI)
NVIDIA GPU (recommended for Whisper)
VoiceMeeter Banana — Download
VB-CABLE Virtual Audio Cable — Download

Quick Start

1. Install dependencies

pip install -r requirements.txt

2. Configure audio routing

Set VoiceMeeter Input as your default Windows playback device
In VoiceMeeter Banana, set A1 hardware out to your speakers/headphones
In your target app (Discord, game, etc.):
- Output → VoiceMeeter Aux Input
- Input → CABLE Output

3. Set up your .env

cp .env.sample .env

Run python src/modules/get_audio_device_ids.py to find your device IDs, then update .env.

4. Launch

cd src
python app.py

The app will auto-start Docker and load Whisper. Click Start Translator, hold your push-to-talk key, and speak.

Supported Languages

German, Japanese, French, Spanish, Italian, Portuguese, Russian, Chinese, Korean, Hindi, Arabic, Dutch, Polish, Swedish, Turkish — and any language supported by Edge TTS.

Audio Routing Diagram

Your Microphone → VoxMorph app → Whisper → Translate → Edge TTS
                                                          ↓
                                                   [Parallel Output]
                                                   ├→ VoiceMeeter (you hear it)
                                                   └→ VB-CABLE (app hears it)

Tech Stack

Whisper AI (faster-whisper) — Speech recognition via Docker
Edge TTS — Microsoft neural text-to-speech (free, 322 voices)
Google Translate / DeepL — Translation
CustomTkinter — Modern GUI
PyAudio / SoundDevice — Audio I/O
VoiceMeeter + VB-CABLE — Audio routing

Author

Built by ArsenalRX

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docs		docs
logs		logs
src		src
.env.sample		.env.sample
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
ROADMAP.md		ROADMAP.md
docker-compose-de.yml		docker-compose-de.yml
docker-compose.yml		docker-compose.yml
glossary.txt		glossary.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoxMorph

Features

How It Works

Requirements

Quick Start

1. Install dependencies

2. Configure audio routing

3. Set up your .env

4. Launch

Supported Languages

Audio Routing Diagram

Tech Stack

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VoxMorph

Features

How It Works

Requirements

Quick Start

1. Install dependencies

2. Configure audio routing

3. Set up your .env

4. Launch

Supported Languages

Audio Routing Diagram

Tech Stack

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages