VoxSherpa TTS

Studio-quality offline neural text-to-speech for Android.
Hindi · English · British · Japanese · Chinese · and more — No cloud. No limits. No compromise.

🏆 Featured In

VoxSherpa TTS is listed in the official README of k2-fsa/sherpa-onnx — the core inference library powering this app.

Why VoxSherpa?

Most TTS apps make you choose between quality and privacy. Cloud-based tools like ElevenLabs sound incredible — but they require internet, send your text to remote servers, and charge per character.

VoxSherpa breaks that tradeoff.

It runs two professional-grade neural engines entirely on your device:

Engine	Quality	Speed	Best For
🧠 Kokoro-82M	Studio-grade · rivals ElevenLabs	Slower on budget hardware	Audiobooks, voiceovers, professional content
⚡ Piper / VITS	Natural · clear	Fast on any device	Daily use, quick synthesis

Screenshots

Generate	Models	Library	Settings

Features

🎙️ Dual Neural Engine

Kokoro-82M — 82 million parameter neural model. Multilingual support including Hindi, English, British English, French, Spanish, Chinese, Japanese and 50+ more languages. Same architecture used by top-tier commercial TTS services.
Piper / VITS — Fast, lightweight, natural. Generates speech in seconds on any Android device.

🔒 100% Offline & Private

All processing happens on your device
No internet required after model download
No account, no telemetry, no data collection
Your text never leaves your phone

📦 Model Management

Download models directly from the app
Import your own .onnx models from local storage
Multiple models installed simultaneously
Smart storage tracking

🎧 Audio Controls

Real-time waveform visualization
Adjustable speed and pitch
Play, pause, and replay generated audio
Export as WAV with correct sample rate per model

📚 Speech Library

Save all generated audio locally
Favorites system for quick access
View generation history with timestamps
Voice model attribution per recording

⚙️ Smart Settings

Smart Punctuation — natural pauses after sentence breaks
Emotion Tags — [whisper], [angry], [happy] support
Per-model voice selection (Kokoro supports 100+ speakers)
Theme-aware UI

Technical Architecture

User Text
    │
    ├─── Kokoro Engine (KokoroEngine.java)
    │         └── Sherpa-ONNX JNI → ONNX Runtime → CPU/NNAPI
    │                   └── kokoro-multi-lang-v1_0 (82M params, FP32)
    │
    └─── Piper / VITS Engine (VoiceEngine.java)
              └── Sherpa-ONNX JNI → ONNX Runtime → CPU
                        └── VITS model (language-specific)

Built with:

Sherpa-ONNX — on-device neural inference
Kokoro-82M — multilingual neural TTS model
Piper — fast local TTS
Android AudioTrack API — low-latency PCM playback

Performance

Generation speed depends entirely on your device's processor:

Device Tier	Kokoro	Piper
🟢 Flagship (Snapdragon 8 Gen 3)	~20–40 sec/min audio	~5 sec/min audio
🟡 Mid-range (8-core)	~60–90 sec/min audio	~10 sec/min audio
🔴 Budget (6-core)	~2–3 min/min audio	~20 sec/min audio

Kokoro prioritizes quality over speed by design. It uses the same 82M parameter architecture that powers premium commercial TTS — running it entirely offline on a mobile CPU is genuinely pushing the hardware limits.

Installation

🧪 Help Me Reach Google Play — Join the Beta!

I've submitted VoxSherpa TTS V2.1 to Google Play, but according to Play Store rules, I need at least 12 testers for 14 days before I can publish to production.

If you find this project useful and want early access to V2.1 — I'd really appreciate your help. All you need to do is install the app and keep it for 14 days. You don't have to do anything else.

What's new in V2.1:

🔊 System-wide TTS engine — use VoxSherpa in any app (Chrome, WhatsApp, etc.)
📄 PDF to Audio
📑 TXT to Audio

How to join:

Fill out the form below with your Gmail
I'll add you manually to the closed test
You'll receive a Play Store opt-in link

Source code for V2.0 and V2.1 will be pushed to GitHub after beta testing is complete.

F-Droid

Coming Soon — F-Droid version uses GitHub-hosted model list instead of Firebase — fully FOSS compliant, GPL v3.0 licensed.

Manual APK

Download the latest APK from Releases.

Model Import (Technical Users)

VoxSherpa supports importing custom .onnx models without any server:

Place your .onnx model + tokens.txt on device storage
Open Models tab → tap + → Import Local Model
Select your files

Compatible with any Sherpa-ONNX compatible TTS model.

Contributing

VoxSherpa is open source. Contributions welcome:

🐛 Bug reports via Issues
💡 Feature requests via Discussions
🔧 Pull requests for fixes and improvements

License

Copyright (C) 2025 CodeBySonu95

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.

https://www.gnu.org/licenses/gpl-3.0.html

Acknowledgements

k2-fsa/sherpa-onnx — the inference engine that makes this possible
hexgrad/Kokoro-82M — the neural model behind studio-quality synthesis
rhasspy/piper — fast local TTS engine

Built with obsession. Runs without internet.

VoxSherpa — Because your voice deserves to stay yours.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
.github/workflows		.github/workflows
app		app
assets		assets
fastlane/metadata/android/en-US		fastlane/metadata/android/en-US
gradle/wrapper		gradle/wrapper
LICENSE		LICENSE
README.md		README.md
build.gradle		build.gradle
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
index.html		index.html
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoxSherpa TTS

Studio-quality offline neural text-to-speech for Android.
Hindi · English · British · Japanese · Chinese · and more — No cloud. No limits. No compromise.

🏆 Featured In

Why VoxSherpa?

Screenshots

Features

🎙️ Dual Neural Engine

🔒 100% Offline & Private

📦 Model Management

🎧 Audio Controls

📚 Speech Library

⚙️ Smart Settings

Technical Architecture

Performance

Installation

🧪 Help Me Reach Google Play — Join the Beta!

F-Droid

Manual APK

Model Import (Technical Users)

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Languages

Folders and files

Latest commit

History

Repository files navigation

VoxSherpa TTS

Studio-quality offline neural text-to-speech for Android.Hindi · English · British · Japanese · Chinese · and more — No cloud. No limits. No compromise.

🏆 Featured In

Why VoxSherpa?

Screenshots

Features

🎙️ Dual Neural Engine

🔒 100% Offline & Private

📦 Model Management

🎧 Audio Controls

📚 Speech Library

⚙️ Smart Settings

Technical Architecture

Performance

Installation

🧪 Help Me Reach Google Play — Join the Beta!

F-Droid

Manual APK

Model Import (Technical Users)

Contributing

License

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Languages

Studio-quality offline neural text-to-speech for Android.
Hindi · English · British · Japanese · Chinese · and more — No cloud. No limits. No compromise.

Packages