Amazon Transcribe
Developer Guide

The AWS Documentation website is getting a new look!
Try it now and let us know what you think. Switch to the new look >>

You can return to the original look by selecting English in the language selector above.

How Amazon Transcribe Works

Amazon Transcribe analyzes audio files that contain speech and uses advanced machine learning techniques to transcribe the voice data into text. You can then use the transcription as you would any text document.

To transcribe an audio file, Amazon Transcribe uses three operations:

  • StartTranscriptionJob – Starts an asynchronous job to transcribe the speech in an audio file to text.

  • ListTranscriptionJobs – Returns a list of transcription jobs that have been started. You can specify the status of the jobs that you want the operation to return. For example, you can get a list of all pending jobs, or a list of completed jobs.

  • GetTranscriptionJob – Returns the result of a transcription job. The response contains a link to a JSON file containing the results.

To transcribe streaming audio to text, Amazon Transcribe provides one operation:

  • StartStreamTranscription – Starts a bi-directional HTTP/2 stream where audio is streamed to Amazon Transcribe and the transcription results are streamed to your application.

You can also start a WebSocket protocol stream to send audio the Amazon Transcribe. For more information, see Using Amazon Transcribe Streaming with WebSockets.

You can use Amazon Transcribe to create and manage custom vocabularies for your solution. A custom vocabulary gives Amazon Transcribe more information about how to process speech in an audio clip.

  • CreateVocabulary – Creates a custom vocabulary that you can use in your transcription jobs.

  • DeleteVocabulary – Deletes a custom vocabulary from your account.

  • GetVocabulary – Gets information about a custom vocabulary and a URL that you can use to download the contents of a vocabulary.

  • ListVocabularies – Gets a list of custom vocabularies in your account.

  • UpdateVocabulary – Updates an existing vocabulary.

You can transcribe speech in any of the following languages:

  • Modern Standard Arabic (ar-SA)

  • Chinese Mandarin-Mainland (zh-CN)

  • Australian English (en-AU)

  • British English (en-GB)

  • Indian English (en-IN)

  • US English (en-US)

  • French (fr-FR)

  • Canadian French (fr-CA)

  • German (de-DE)

  • Indian Hindi (hi-IN)

  • Italian (it-IT)

  • Korean (ko-KR)

  • Brazilian Portuguese (pt-BR)

  • Russian (ru-RU)

  • Spanish (es-ES)

  • US Spanish (es-US)

You can use streaming transcription for the following languages:

  • British English (en-GB)

  • US English (en-US)

  • French (fr-FR)

  • Canadian French (fr-CA)

  • US Spanish (es-US)