Skip to content

Mattyfreshy/RTT

Repository files navigation

Real Time Transcription

Goal of this project is to create a real time transcription tool that can be used to transcribe audio from microphone and output to text.

Features

  • Transcribe audio from microphone and output to text in terminal.
  • ASR (Automatic Speech Recognition) using Google Speech API, Whisper, or AssemblyAI.

Requirements

  • Python 3.6 or higher
  • All requirements in requirements.txt

Note

If could not build wheels for pyaudio, On Mac

brew install portaudio

On Linux

sudo apt-get install portaudio19-dev

Then try again

Files

  • RTT.py

    • Main file that runs the program.
  • RTT_spectrogram.py

  • system_record.py

    • Records audio from system and outputs to text using loopback.
  • util.py

    • Utility functions for the program.
  • file_converter.py

    • Converts audio files to any supporting format. Requires ffmpeg.
  • ASR

    • Folder containing ASR modules. Currently supports Google Speech API, Whisper, and AssemblyAI.

Setup

  1. If you don’t have Python installed, install it from here.

  2. Clone this repository.

  3. Navigate into the project directory:

    cd RTT
  4. Create a new virtual environment:

    python -m venv venv
    . venv/bin/activate
  5. Install the requirements:

    pip install -r requirements.txt
  6. Make a copy of the example environment variables file:

    cp .env.example .env
  7. Add your API key to the newly created .env file.

  8. Run the app using python or python3 depending on your system:

    python RTT.py

About

Real Time Transcriptions

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages