Language Lab for correcting spoken Japanese

The purpose of this program is to correct a sample of spoken Japanese. This is done by creating a recording and then passing the transcribed text to the ChatGPT API. The transcription is managed using OpenAI's Whisper API. The app requires an environmental variable OpenAI_Key with your key for OpenAI's API (Open AI)

The prompt for this program is focused on correcting the speach and providing a corrected example in Hiragana and common Kanji, and identifying grammatical errors. Japanese is selected on the whisper API and specified in the prompt. Another language could be corrected if the prompt and Whisper settings are changed. ChatGPT does make errors in both the transcription and translation. Grammar correction seems to be more reliable.

The recordings are stored as .wav file in the ./recordings directory. There is a utility program remove_recordings.py which will delete all the recordings in the directory if there is no need to save them.

There is also a Docker image for the application:

docker container run -e OpenAI_Key -p 8501:8501 cgrams/languagelab:v1

It is a large image and takes some time to download and space on your drive. It also requires your OpenAI_Key environmental variable with your personal key.

There are some known problems:

The rendering of the Streamlit Audio Recorder components initially sends a recording with size 0 bytes to Whisper: Solution is to reset and repeat.
ChatGPT has returned responses based on more literal rendering of sounds. Solution is to trust but verify.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
st_audiorec		st_audiorec
.gitignore		.gitignore
README.md		README.md
main_lab.py		main_lab.py
manage_text.py		manage_text.py
prompt_formats.py		prompt_formats.py
recorder.py		recorder.py
remove_recordings.py		remove_recordings.py
st_custom_components.py		st_custom_components.py
transcribe.py		transcribe.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Language Lab for correcting spoken Japanese

Resources Used:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Cameron-Grams/LanguageCorrection

Folders and files

Latest commit

History

Repository files navigation

Language Lab for correcting spoken Japanese

Resources Used:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages