Gemini Transcribe

A web application for transcribing audio and video files using Google's Gemini Flash model.

Live Application: https://gemini-transcribe.fly.dev/

Features

Specify the desired language of the transcript
Automatically detects and labels different speakers in the audio (Speaker 1, Speaker 2, etc.).
Instead of a timestamp for every word, the transcript is logically grouped into paragraphs with a single timestamp, making it much more readable.
Click on any timestamp to jump to that specific moment in the audio or video player.
Download the final transcript as a plain .txt file (with or without timestamps) or as a .srt subtitle file for use in video players.

Getting started

Prerequisites

Node.js (v22 or later)
A Google AI API Key

Clone & install

git clone https://github.com/mikeesto/gemini-transcribe.git
cd gemini-transcribe

Install dependencies (using npm, pnpm, or yarn)

npm install

Environment variables

Create a .env file in the root of the project and add your Google API.
```
GOOGLE_API_KEY="YOUR_API_KEY_HERE"
```
Run the development server
```
npm run dev
```
The application should now be running at http://localhost:5173.

Future work

Flash is a very interesting model to explore for audio transcription because...

It can attempt to detect not only words but also silence, sentiment, and sounds beyond human voices
It can translate the transcription

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
src		src
static		static
.dockerignore		.dockerignore
.gitignore		.gitignore
.npmrc		.npmrc
.prettierignore		.prettierignore
.prettierrc		.prettierrc
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
components.json		components.json
eslint.config.js		eslint.config.js
fly.toml		fly.toml
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
svelte.config.js		svelte.config.js
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gemini Transcribe

Features

Getting started

Future work

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

mikeesto/gemini-transcribe

Folders and files

Latest commit

History

Repository files navigation

Gemini Transcribe

Features

Getting started

Future work

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages