Skip to content

Fluentez/SpeechTTModels

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

SpeechTTModels - Audio to Text Conversion

SpeechTTModels is a speech-to-text conversion tool designed to transcribe audio files into text. It uses state-of-the-art models for automatic speech recognition (ASR) to convert various audio formats (such as MP3, WAV, etc.) into readable text.

Features

  • Audio Transcription: Converts audio files into accurate text.
  • Multiple Audio Formats Supported: Supports various audio file formats like MP3, WAV, FLAC, and more.
  • Fast and Reliable: Uses advanced ASR models for high-quality transcriptions.
  • Customizable Output: Allows saving the transcribed text in different formats (e.g., plain text, JSON).

Installation

  1. Clone the repository:
 git clone https://github.com/Fluentez/SpeechTTModels.git
 cd SpeechTTModels
  1. Install library
 pip install vosk

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages