feat(aws): add auto language detection and mid-stream language switch… by cldsime · Pull Request #5435 · livekit/agents

cldsime · 2026-04-13T17:23:55Z

Summary

Adds support for Amazon Transcribe's automatic language identification parameters to the livekit-plugins-aws STT plugin, enabling auto language detection and mid-stream language switching without requiring users to manually specify a language_code.

Changes

6 new parameters added to `STTOptions` and `STT.init()`:

identify_language — detect the dominant language for the stream
identify_multiple_languages — detect language switches mid-stream
language_options — comma-separated list of expected language codes (2–12 required)
preferred_language — bias detection toward a specific language
vocabulary_names — custom vocabularies per language
vocabulary_filter_names — vocabulary filters per language

All default to disabled (False / NOT_GIVEN), preserving full backward compatibility.

Conditional config building in `SpeechStream._run()`:

language_code and identify_language/identify_multiple_languages are mutually exclusive per the AWS API. The config builder now conditionally sends one or the other:

if self._opts.identify_language:
    live_config["identify_language"] = True
    ...
elif self._opts.identify_multiple_languages:
    live_config["identify_multiple_languages"] = True
    ...
else:
    live_config["language_code"] = self._opts.language

Bug fix: `filtered_config` boolean handling

The original filter {k: v for k, v in live_config.items() if v and is_given(v)} silently drops False booleans. Replaced with explicit type checking that correctly preserves booleans, numbers, and NOT_GIVEN values.

Usage

# Existing behavior — unchanged
stt = STT(language="en-US")
Single language auto-detection
stt = STT(identify_language=True, language_options="en-US,es-US,fr-FR")
Multi-language mid-stream switching
stt = STT(

identify_multiple_languages=True,

language_options="en-US,es-US,fr-FR,de-DE,ja-JP,ko-KR,zh-HK,pt-BR,hi-IN,vi-VN,pl-PL,ru-RU",

)

Test Results

Tested with identify_multiple_languages=True and 12 language codes. All 12 configured languages were successfully detected across test sessions with mid-stream switching:

Language	Code	Detected
English	en-US	✅
Spanish	es-US	✅
French	fr-FR	✅
German	de-DE	✅
Japanese	ja-JP	✅
Korean	ko-KR	✅
Cantonese	zh-HK	✅
Portuguese	pt-BR	✅
Hindi	hi-IN	✅
Vietnamese	vi-VN	✅
Polish	pl-PL	✅
Russian	ru-RU	✅

Sample output showing mid-stream language switching in a single session:

[FINAL] [en-US] Good afternoon, everyone. Welcome to day 4.
[FINAL] [zh-HK] 你聽緊嘅係SBS電台廣東話節目
[FINAL] [vi-VN] Xin kính chào quý vị và các bạn
[FINAL] [pt-BR] Nós estamos aqui de azul hoje porque azul é a cor mais celebrando
[FINAL] [pl-PL] Dzień dobry, przy mikrofonie Joanna Borkowska Surucić

Backward Compatibility

All new parameters default to disabled. Existing code continues to work without any changes.

CLAassistant · 2026-04-13T17:24:07Z

All committers have signed the CLA.

tinalenguyen

we also now support passing multiple detected languages in SpeechData via the source_languages field, could you pass the detected languages there as well?

…ing support

…ield

…etection mode

tinalenguyen

thanks for iterating!

This comment was marked as resolved.

Sign in to view

tinalenguyen reviewed Apr 23, 2026

View reviewed changes

Comment thread livekit-plugins/livekit-plugins-aws/livekit/plugins/aws/stt.py

This comment was marked as resolved.

Sign in to view

cldsime added 7 commits April 23, 2026 13:16

feat(aws): add auto language detection and mid-stream language switch…

0e64c94

…ing support

fix(aws): handle None language_code fallback in auto-detection mode

7dd9d68

fix(aws): add type annotation to filtered_config to fix mypy errors

4f13086

feat(aws): add mutual exclusion check and populate source_languages f…

e753ad8

…ield

style(aws): fix line length for ruff linter

328b6c5

fix(aws): remove source_languages until rebased onto latest main

cf78ef3

feat(aws): populate source_languages with detected language in auto-d…

2a2b9f5

…etection mode

cldsime force-pushed the feature/auto-language-detection branch from 3f16cbc to 2a2b9f5 Compare April 23, 2026 20:17

style(aws): apply ruff formatting

c7ba76d

cldsime closed this Apr 23, 2026

cldsime reopened this Apr 23, 2026

tinalenguyen approved these changes Apr 24, 2026

View reviewed changes

tinalenguyen merged commit cfb853f into livekit:main Apr 24, 2026
34 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(aws): add auto language detection and mid-stream language switch…#5435

feat(aws): add auto language detection and mid-stream language switch…#5435
tinalenguyen merged 8 commits intolivekit:mainfrom
cldsime:feature/auto-language-detection

cldsime commented Apr 13, 2026

Uh oh!

CLAassistant commented Apr 13, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

tinalenguyen left a comment

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

tinalenguyen left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cldsime commented Apr 13, 2026

Summary

Changes

6 new parameters added to STTOptions and STT.__init__():

Conditional config building in SpeechStream._run():

Bug fix: filtered_config boolean handling

Usage

Single language auto-detection

Multi-language mid-stream switching

Test Results

Backward Compatibility

Uh oh!

CLAassistant commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

tinalenguyen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

tinalenguyen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

6 new parameters added to `STTOptions` and `STT.init()`:

Conditional config building in `SpeechStream._run()`:

Bug fix: `filtered_config` boolean handling

CLAassistant commented Apr 13, 2026 •

edited

Loading