feat: add REST API endpoint for audio transcription by nuspy · Pull Request #738 · cjpais/Handy

nuspy · 2026-02-08T15:04:40Z

Add a REST API server (axum) that accepts audio files via multipart POST and returns transcriptions. Supports WAV, MP3, FLAC, OGG Vorbis, AAC via symphonia, with ffmpeg fallback for OGG Opus (Telegram voice messages).

POST /transcribe: accepts multipart 'file' or 'audio' field
GET /health: health check endpoint
Enabled via HANDY_API_PORT env var (e.g. HANDY_API_PORT=8384)
Binds to 127.0.0.1 only (localhost)
Resamples audio to 16kHz mono for Whisper/Parakeet/Moonshine

https://claude.ai/code/session_01G6v7CQen3RdSBUdpFVQbzD

Before Submitting This PR

Please confirm you have done the following:

I have searched existing issues and pull requests (including closed ones) to ensure this isn't a duplicate
I have read CONTRIBUTING.md

If this is a feature or change that was previously closed/rejected:

I have explained in the description below why this should be reconsidered
I have gathered community feedback (link to discussion below)

Human Written Description

Related Issues/Discussions

Fixes #
Discussion:

Community Feedback

Testing

Screenshots/Videos (if applicable)

AI Assistance

No AI was used in this PR
AI was used (please describe below)

If AI was used:

Tools used:
How extensively:

Add a REST API server (axum) that accepts audio files via multipart POST and returns transcriptions. Supports WAV, MP3, FLAC, OGG Vorbis, AAC via symphonia, with ffmpeg fallback for OGG Opus (Telegram voice messages). - POST /transcribe: accepts multipart 'file' or 'audio' field - GET /health: health check endpoint - Enabled via HANDY_API_PORT env var (e.g. HANDY_API_PORT=8384) - Binds to 127.0.0.1 only (localhost) - Resamples audio to 16kHz mono for Whisper/Parakeet/Moonshine https://claude.ai/code/session_01G6v7CQen3RdSBUdpFVQbzD

cjpais · 2026-02-08T15:07:57Z

Did you also check #509 exists?

cjpais · 2026-02-11T02:56:57Z

no response from the author so I'm closing this #509 will be the one we'll track this under for now

cjpais closed this Feb 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

feat: add REST API endpoint for audio transcription#738

feat: add REST API endpoint for audio transcription#738
nuspy wants to merge 1 commit intocjpais:mainfrom
nuspy:claude/add-audio-transcription-api-ZLvJe

nuspy commented Feb 8, 2026

Uh oh!

cjpais commented Feb 8, 2026

Uh oh!

cjpais commented Feb 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Comments

Conversation

nuspy commented Feb 8, 2026

Before Submitting This PR

Human Written Description

Related Issues/Discussions

Community Feedback

Testing

Screenshots/Videos (if applicable)

AI Assistance

Uh oh!

cjpais commented Feb 8, 2026

Uh oh!

cjpais commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cjpais commented Feb 11, 2026 •

edited

Loading