VhisperNative

Pure Swift/SwiftUI implementation of Vhisper voice input app for macOS.

Features

Multiple ASR Engines
- Qwen Realtime (WebSocket streaming)
- DashScope Paraformer
- OpenAI Whisper
- FunASR (local deployment)
LLM Text Refinement
- DashScope (Qwen)
- OpenAI ChatGPT
- Ollama (local)
System Integration
- Global hotkey support
- Menu bar app
- Waveform visualization
- Direct text input (Espanso-style)

Requirements

macOS 14.0+
Xcode 15.0+
Swift 5.9+

Setup

1. Create Xcode Project

Open Xcode
File > New > Project
Select "macOS" > "App"
Configure:
- Product Name: VhisperNative
- Team: Your team
- Organization Identifier: com.yourcompany
- Interface: SwiftUI
- Language: Swift

2. Add Source Files

Delete the default ContentView.swift and VhisperNativeApp.swift
Drag the VhisperNative folder contents into your project
Make sure "Copy items if needed" is checked

3. Configure Project

Select the project in navigator
Select the target
Under "Signing & Capabilities":
- Add "App Sandbox" capability (disable if needed for accessibility)
- Enable "Audio Input"
- Enable "Outgoing Connections (Client)"
Update Info.plist:
- Add NSMicrophoneUsageDescription
- Set LSUIElement to YES (menu bar only app)

4. Build & Run

Select "My Mac" as destination
Build and run (Cmd+R)

Configuration

API Keys

Click the menu bar icon
Open Settings
Configure your ASR provider and API key
Optionally enable LLM text refinement

Hotkey

Default hotkey is Option key. You can change it in Settings > General.

Permissions

The app requires:

Microphone: For voice recording
Accessibility: For global hotkeys and text input

Grant these permissions in System Settings > Privacy & Security.

Architecture

VhisperNative/
├── App/                    # Application entry
├── Core/
│   ├── ASR/               # Speech recognition services
│   ├── LLM/               # Language model services
│   ├── Audio/             # Audio recording & FFT
│   ├── Pipeline/          # Voice processing pipeline
│   └── Config/            # Configuration management
├── System/
│   ├── Hotkey/            # Global hotkey management
│   ├── Output/            # Text output & clipboard
│   └── Permissions/       # Permission management
├── UI/
│   ├── Settings/          # Settings views
│   └── Waveform/          # Waveform visualization
└── Managers/              # State management

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
VhisperNative.xcodeproj		VhisperNative.xcodeproj
VhisperNative		VhisperNative
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
package.json		package.json
project.yml		project.yml
reset-permissions.sh		reset-permissions.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VhisperNative

Features

Requirements

Setup

1. Create Xcode Project

2. Add Source Files

3. Configure Project

4. Build & Run

Configuration

API Keys

Hotkey

Permissions

Architecture

License

About

Uh oh!

Releases 5

Packages

Contributors 3

Uh oh!

Languages

License

vimo-ai/VhisperNative

Folders and files

Latest commit

History

Repository files navigation

VhisperNative

Features

Requirements

Setup

1. Create Xcode Project

2. Add Source Files

3. Configure Project

4. Build & Run

Configuration

API Keys

Hotkey

Permissions

Architecture

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Contributors 3

Uh oh!

Languages

Packages