Open Speech Studio

Open source speech recognition for everyone — a local AI alternative to Wispr Flow.

Part of the OpenAEC Foundation.

Features

100% Local — All speech recognition runs on your own computer. No cloud, no subscription, full privacy.
GPU & CPU — Works on both GPU (CUDA/Vulkan) and CPU. Choose the model that fits your hardware.
Multilingual — Supports 99+ languages with automatic language detection.
Dictionary — Add your own words, names and jargon for better recognition.
Universal speech — Works in any application via a global hotkey.
Open source — Apache 2.0 license. Free to use, modify and distribute.

Installation

Download the latest installer from the Releases page.

Usage

Start Open Speech Studio
Go to Models and download a speech model
Press Ctrl+Shift+Space to start speaking
Speak your text
Press again to stop — text is automatically inserted

Dictionary

Go to the Dictionary tab to:

Add your own words (names, abbreviations, technical terms)
Set up replacements (e.g. "OpenAEC" -> "OpenAEC Foundation")

Models

Via the Models tab you can download models for speech recognition:

Model	Size	Speed	Accuracy	Recommended for
tiny	75 MB	Very fast	Basic	CPU, quick notes
base	142 MB	Fast	Good	CPU, daily use
small	466 MB	Medium	Very good	GPU or fast CPU
medium	1.5 GB	Slow	Excellent	GPU
large-v3	3.1 GB	Very slow	Best	Powerful GPU
large-v3-turbo	1.6 GB	Medium	Near best	GPU

Tech Stack

Component	Technology
Framework	Tauri 2 (Rust)
Frontend	SolidJS + Vite
AI Model	whisper.cpp via whisper-rs
Audio	cpal (Cross-Platform Audio Library)
UI Style	Office-style ribbon (OpenAEC Foundation)

Development

Prerequisites

Node.js (LTS)
Rust
Platform-specific dependencies: see Tauri prerequisites

Getting started

git clone https://github.com/OpenAEC-Foundation/open-speech-studio.git
cd open-speech-studio
npm install
cargo tauri dev

Production build

cargo tauri build

# With GPU support
cargo tauri build --features cuda

Architecture

open-speech-studio/
├── src/                    # SolidJS frontend
│   ├── components/         # UI components (Ribbon, Settings, etc.)
│   ├── lib/                # API bindings
│   └── styles/             # CSS
├── src-tauri/              # Rust backend
│   └── src/
│       ├── lib.rs          # Tauri commands & app state
│       ├── transcriber.rs  # Whisper speech recognition
│       ├── audio.rs        # Microphone recording
│       ├── settings.rs     # Configuration & model auto-detection
│       └── dictionary.rs   # Dictionary
├── models/                 # Whisper models (downloaded via app)
└── index.html              # Entry point

License

Apache License 2.0 — see LICENSE

Contributing

Contributions are welcome! Open an issue or pull request on GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.github		.github
Socials		Socials
bin		bin
docs/superpowers		docs/superpowers
models		models
public/sounds		public/sounds
scripts		scripts
sidecar-dlls		sidecar-dlls
src-tauri		src-tauri
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app-icon.svg		app-icon.svg
build.bat		build.bat
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Open Speech Studio

Features

Installation

Usage

Dictionary

Models

Tech Stack

Development

Prerequisites

Getting started

Production build

Architecture

License

Contributing

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Open Speech Studio

Features

Installation

Usage

Dictionary

Models

Tech Stack

Development

Prerequisites

Getting started

Production build

Architecture

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages