Generating alternative transcriptions

When you use Amazon Transcribe Medical, you get the transcription that has the highest confidence level. However, you can configure Amazon Transcribe Medical to return additional transcriptions with lower confidence levels.

Use alternative transcriptions to see different interpretations of the transcribed audio. For example, in an application that enables a person to review the transcription, you can present the alternative transcriptions for the person to choose from.

You can generate alternative transcriptions with the AWS Management Console or the StartMedicalTranscriptionJob API.

To use the AWS Management Console to generate alternative transcriptions, you enable alternative results when you configure your job.

Sign in to the AWS Management Console.
In the navigation pane, under Amazon Transcribe Medical, choose Transcription jobs.
Choose Create job.
On the Specify job details page, provide information about your transcription job.
Choose Next.
Enable Alternative results.
For Maximum alternatives, enter an integer value between 2 and 10, for the maximum number of alternative transcriptions you want in the output.
Choose Create.

To separate text per speaker in an audio file using a batch transcription job (API)

For the StartMedicalTranscriptionJob API, specify the following.
1. For MedicalTranscriptionJobName, specify a name that is unique in your AWS account.
2. For LanguageCode, specify the language code that corresponds to the language spoken in your audio file and the language of your vocabulary filter.
3. In the MediaFileUri parameter of the Media object, specify the location of the audio file you want to transcribe.
4. For Specialty, specify the medical specialty of the clinician speaking in the audio file.
5. For Type, specify whether you're transcribing a medical conversation or a dictation.
6. For OutputBucketName, specify the Amazon S3 bucket to store the transcription results.
7. For the Settings object, specify the following.
  1. ShowAlternatives – true.
  2. MaxAlternatives - An integer between 2 and 10 to indicate the number of alternative transcriptions you want in the transcription output.

The following request uses the AWS SDK for Python (Boto3) to start a transcription job that generates up to two alternative transcriptions.



from __future__ import print_function
import time
import boto3
transcribe = boto3.client('transcribe', 'us-west-2')
job_name = "my-first-transcription-job"
job_uri = s3://DOC-EXAMPLE-BUCKET/my-input-files/my-audio-file.flac
transcribe.start_medical_transcription_job(
    MedicalTranscriptionJobName = job_name,
    Media = {
        'MediaFileUri': job_uri
    },
    OutputBucketName = 'DOC-EXAMPLE-BUCKET',
    OutputKey = 'my-output-files/', 
    LanguageCode = 'en-US',
    Specialty = 'PRIMARYCARE',
    Type = 'CONVERSATION', 
    Settings = {
        'ShowAlternatives': True,
        'MaxAlternatives': 2
    }
)

while True:
   status = transcribe.get_medical_transcription_job(MedicalTranscriptionJobName = job_name)
   if status['MedicalTranscriptionJob']['TranscriptionJobStatus'] in ['COMPLETED', 'FAILED']:
       break
   print("Not ready yet...")
   time.sleep(5)
print(status)

To transcribe an audio file of a conversation between a primary care clinician and a patient in an audio file (AWS CLI)

Run the following code.


                    
aws transcribe start-transcription-job \
--cli-input-json file://filepath/example-start-command.json

The following code shows the contents of example-start-command.json.



{
      "MedicalTranscriptionJobName": "my-first-transcription-job",
      "LanguageCode": "en-US",
      "Specialty": "PRIMARYCARE",
      "Type": "CONVERSATION",
      "OutputBucketName":"DOC-EXAMPLE-BUCKET",
      "Media": {
          "MediaFileUri": "s3://DOC-EXAMPLE-BUCKET/my-input-files/my-audio-file.flac"
        },
      "Settings":{
          "ShowAlternatives": true,
          "MaxAlternatives": 2
        }
}

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Identifying PHI in a real-time stream

VPC endpoints (AWS PrivateLink)