Language Support

The Inworld STT model (inworld/inworld-stt-1) supports 30 languages for speech recognition.

Supported languages

Language	Code
Arabic	`ar`
Cantonese	`yue`
Chinese	`zh`
Czech	`cs`
Danish	`da`
Dutch	`nl`
English	`en`
Filipino	`fil`
Finnish	`fi`
French	`fr`
German	`de`
Greek	`el`
Hindi	`hi`
Hungarian	`hu`
Indonesian	`id`
Italian	`it`
Japanese	`ja`
Korean	`ko`
Macedonian	`mk`
Malay	`ms`
Persian	`fa`
Polish	`pl`
Portuguese	`pt`
Romanian	`ro`
Russian	`ru`
Spanish	`es`
Swedish	`sv`
Thai	`th`
Turkish	`tr`
Vietnamese	`vi`

Specifying a language

The language field is a language hint — it tells the model which language to prefer, but it is not guaranteed to be respected. The model automatically detects the spoken language from the audio, and you can switch languages in the middle of a conversation without changing the hint. The field accepts ISO 639-1 language codes (e.g., en, ja) matching the codes listed in the table above.

BCP-47 codes (e.g., en-US, ja-JP) are also accepted and will be automatically converted to the base ISO 639 language code — for example, en-US becomes en. Regional variants do not affect recognition behavior.

If you know the primary language of the audio in advance, providing a hint will generally produce more accurate transcription results — especially for short utterances where auto-detection may not have enough context.

For the Inworld first-party model (inworld/inworld-stt-1), setting language also constrains the output script for English, Chinese, Cantonese, Japanese, Korean, Russian, and Hindi. For example, en keeps the transcript in Latin script (a name spoken in another language is romanized rather than written in its native script), while ja allows Japanese script. This applies only when you set the hint: to let auto-detection follow the spoken language as it changes mid-stream, leave language empty.

Third-party provider languages

The Inworld STT API also supports models from third-party providers, each with their own language coverage. See the provider documentation for details:

Provider	Models	Language documentation
Groq	`groq/whisper-large-v3`	Whisper — supported languages
AssemblyAI	`assemblyai/universal-streaming-multilingual`, `assemblyai/u3-rt-pro`, `assemblyai/whisper-rt`	AssemblyAI — supported languages
Soniox	`soniox/stt-rt-v4`, `soniox/stt-rt-v5`	Soniox — supported languages
Deepgram	`deepgram/flux-general-en`, `deepgram/flux-general-multi`	Deepgram Flux — language prompting

Get Started

Resources

Supported languages

Specifying a language

Third-party provider languages

Next steps

Developer Quickstart

API Reference

​Supported languages

​Specifying a language

​Third-party provider languages

​Next steps

Developer Quickstart

API Reference

Supported languages

Specifying a language

Third-party provider languages

Next steps