Amazon SageMaker
Developer Guide

Using the Amazon Mechanical Turk Workforce

The Amazon Mechanical Turk workforce provides the most workers for your Amazon Augmented AI task review and Amazon SageMaker Ground Truth labeling job.

You can use the console to choose the Amazon Mechanical Turk workforce for your Amazon SageMaker Ground Truth labeling job or Amazon Augmented AI human review workflow, or you can provide the Amazon Resource Name (ARN) for the Amazon Mechanical Turk workforce when you use the CreateLabelingJob operation.

Any Amazon Mechanical Turk workforce billing is handled as part of your Ground Truth or Amazon Augmented AI billing. You do not need to create a separate Mechanical Turk account to use the Amazon Mechanical Turk workforce.

The ARN for the Amazon Mechanical Turk workforce is:

  • arn:aws:sagemaker:region:394669845002:workteam/public-crowd/default

The Amazon Mechanical Turk workforce is a world-wide resource. Workers are available 24 hours a day, 7 days a week. You typically get the fastest turn-around for your human review tasks and labeling jobs when you use the Amazon Mechanical Turk workforce.

Adjust the number of workers that annotate each data object based on the complexity of the job and the quality that you need. Amazon SageMaker Ground Truth uses annotation consolidation to improve the quality of the labels. More workers can make a difference in the quality of the labels for more complex labeling jobs, but might not make a difference for simpler jobs. For more information, see Annotation Consolidation. Annotation consolidation is not supported for Amazon Augmented AI human review workflows.

Important

Use the Amazon Mechanical Turk workforce to label only data that is public or has been stripped of any sensitive information. Do not share confidential information, personal information, or protected health information with this workforce. Do not use the Amazon Mechanical Turk workforce when you use Amazon Augmented AI in conjunction with other AWS HIPAA-eligible services, such as Amazon Textract and Amazon Rekognition.

To choose the Amazon Mechanical Turk workforce when you are creating a labeling job or human review workflow using the console, do the following during the Select workers and configure tool step:

To use the Amazon Mechanical Turk workforce

  1. Choose Public from Worker types.

  2. Choose The dataset does not contain adult content if your dataset doesn't contain potentially offensive content. This enables workers to opt out if they don't want to work with it.

  3. Acknowledge that your data will be viewed by the Amazon Mechanical Turk workforce and that all personally identifiable information (PII) has been removed.

  4. Choose Additional configuration to set optional parameters.

  5. Optional. Enable automated data labeling to have Ground Truth automatically label some of your dataset. For more information, see Using Automated Data Labeling. Automated data labeling is not available for Amazon Augmented AI.

  6. Optional. Set the number of workers that should see each object in your dataset. Using more workers can increase the quality of your labels but also increases the cost.

Your labeling job or human review task will now be sent to the Amazon Mechanical Turk workforce. You can use the console to continue configuring your labeling job.