SpeechTTModels - Audio to Text Conversion

SpeechTTModels is a speech-to-text conversion tool designed to transcribe audio files into text. It uses state-of-the-art models for automatic speech recognition (ASR) to convert various audio formats (such as MP3, WAV, etc.) into readable text.

Features

Audio Transcription: Converts audio files into accurate text.
Multiple Audio Formats Supported: Supports various audio file formats like MP3, WAV, FLAC, and more.
Fast and Reliable: Uses advanced ASR models for high-quality transcriptions.
Customizable Output: Allows saving the transcribed text in different formats (e.g., plain text, JSON).

Installation

Clone the repository:

 git clone https://github.com/Fluentez/SpeechTTModels.git
 cd SpeechTTModels

Install library

 pip install vosk

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
vosk-model-small-en-us-0.15		vosk-model-small-en-us-0.15
LICENSE		LICENSE
README.md		README.md
speechRecognition.py		speechRecognition.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpeechTTModels - Audio to Text Conversion

Features

Installation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SpeechTTModels - Audio to Text Conversion

Features

Installation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages