Identifying the dominant languages in your media - Amazon Transcribe

Identifying the dominant languages in your media

Amazon Transcribe is able to automatically identify the languages spoken in your media without you having to specify a language code.

Batch language identification can identify the dominant language spoken in your media file or, if your media contains multiple languages, it can identify all languages spoken. To improve language identification accuracy, you can optionally provide a list of two or more languages you think may be present in your media.

Streaming language identification can identify one language per channel (a maximum of two channels are supported). Streaming requests must have a minimum of two additional language options included in your request. Providing language options allows for faster language identification, which prevents the loss of the first few seconds of your stream.

Important

Batch and streaming transcriptions support different languages. Refer to the supported languages table for details.

To learn about monitoring and events with language identification, refer to Language identification events.