AssemblyAI Code Switching Demo

This is a minimal reproducible example demonstrating an issue with AssemblyAI's code switching feature for English/French bilingual audio.

Issue Description

When transcribing Canadian Parliament floor audio (which contains both English and French), the transcription appears to only return English text, with French portions being omitted or removed.

Setup

Add your AssemblyAI API key to the .env file:

ASSEMBLYAI_API_KEY='your-api-key-here'

Install dependencies with pipenv:

pipenv install

Activate the pipenv environment:

pipenv shell

Run the demo:

python assemblyai_code_switching_demo.py

Expected Behavior

The audio file contains mixed English and French speech from Canadian Parliament proceedings. With code switching enabled, we expect:

Both English and French utterances to be transcribed
language_code attribute on utterances indicating "en" or "fr"
Full transcript containing both languages

Actual Behavior

Only French text appears in the transcript, with English portions missing or removed.

Audio Sample

URL: https://twocapitals.ca/assets/floor-audio-test.m4a

This is floor audio from Canadian Parliament, which naturally contains both English and French as both are official languages.

Environment

AssemblyAI Python SDK: 0.45.4
Python: 3.11+
Audio format: M4A
Languages: English (en) + French (fr)

Notes

According to AssemblyAI documentation, code switching can be enabled in two ways:

Manual: Set language_codes to a list of two language codes (one must be 'en'), e.g., ["en", "fr"]
Automatic: Enable language_detection=True and set code_switching=True within language_detection_options

However, the English-French language pair is not listed as one of the "optimal" pairs (only English-Spanish and English-German are listed as optimal). For other language combinations, the documentation notes that optimal results typically require the non-English language to be dominant in the audio.

Reference: https://www.assemblyai.com/docs/pre-recorded-audio/code-switching

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
code_switching_demo.py		code_switching_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AssemblyAI Code Switching Demo

Issue Description

Setup

Expected Behavior

Actual Behavior

Audio Sample

Environment

Notes

About

Uh oh!

Releases

Packages

Languages

awwester/assemblyai-code-switching

Folders and files

Latest commit

History

Repository files navigation

AssemblyAI Code Switching Demo

Issue Description

Setup

Expected Behavior

Actual Behavior

Audio Sample

Environment

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages