Skip to content

Conversation

@Robinbinu
Copy link
Contributor

Summary

  • Add PocketTTSEngine - Kyutai Labs' lightweight 100M parameter TTS model
  • CPU-optimized inference (~6x real-time performance)
  • Voice cloning via WAV files
  • ~200ms latency to first audio chunk
  • 8 built-in voices (alba, marius, javert, jean, fantine, cosette, eponine, azelma)

Installation

pip install pocket-tts

Usage

from RealtimeTTS import TextToAudioStream, PocketTTSEngine

engine = PocketTTSEngine(voice="alba")
stream = TextToAudioStream(engine)
stream.feed("Hello world!")
stream.play_async()

Test plan

  • Test with built-in voices
  • Test voice cloning with WAV file
  • Verify CPU performance

Kyutai Labs' Pocket TTS - lightweight 100M parameter model with:
- CPU-optimized inference (~6x real-time performance)
- Voice cloning via WAV files
- ~200ms latency to first audio chunk
- 8 built-in voices

Install with: pip install pocket-tts
@Robinbinu
Copy link
Contributor Author

Hi @KoljaB,
I have added support for PocketTTS, please review and merge when you are available.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant