Added opportunity to transcribe audio directly from bytes #56

KirillKukharev · 2026-01-21T15:32:15Z

The transcribe_bytes() method has been added to the GigaAMASR class, which performs audio transcription directly from bytes in memory without file I/O.

Why is this important?

Performance and latency
Fixed file I/O: temporary files are not created
Eliminated overhead from ffmpeg subprocess call
Direct conversion of PCM16 bytes to tensors via torch.frombuffer()

Technical details
Uses load_audio_from_bytes() instead of load_audio() for direct byte conversion
Usage example:
Version with temp file (before pull request was created):

result = model.transcribe("audio.wav")

Version without file:

audio_bytes = receive_audio_from_network()  # abstract method; receive PCM16 16kHz
result = model.transcribe_bytes(audio_bytes)

KirillKukharev added 2 commits January 21, 2026 18:22

added opportunity to transcribe from bytes directly

add7a31

combined imports

c409ff6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added opportunity to transcribe audio directly from bytes #56

Added opportunity to transcribe audio directly from bytes #56

Uh oh!

KirillKukharev commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Added opportunity to transcribe audio directly from bytes #56

Are you sure you want to change the base?

Added opportunity to transcribe audio directly from bytes #56

Uh oh!

Conversation

KirillKukharev commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant