Use Amazon Augmented AI with Amazon Textract
Amazon Textract enables you to add document text detection and analysis to your
applications. Amazon Augmented AI (Amazon A2I) directly integrates with Amazon Textract's
AnalyzeDocument
API operation. You can use AnalyzeDocument
to
analyze a document for relationships between detected items. When you add an Amazon A2I
human review loop to an AnalyzeDocument
request, Amazon A2I monitors the
Amazon Textract results and sends a document to one or more human workers for review when the
conditions specified in your flow definition are met. For example, if you want a human to
review a specific key like Full name:
and their associated input values, you
can create an activation condition that starts a human review any time the Full
name:
key is detected or when the inference confidence for that key falls within
a range that you specify.
The following image depicts the Amazon A2I built-in workflow with Amazon Textract. On the left, the resources that are required to create an Amazon Textract human review workflow are depicted: and Amazon S3 bucket, activation conditions, a worker task template, and a work team. These resources are used to create a human review workflow, or flow definition. An arrow points right to the next step in the workflow: using Amazon Textract to configure a human loop with the human review workflow. A second arrow points right from this step to the step in which activation conditions specified in the human review workflow are met. This initiates the creation of a human loop. On the right of the image, the human loop is depicted in three steps: 1) the worker UI and tools are generated and the task is made available to workers,2) workers review input data, and finally, 3) results are saved in Amazon S3.
You can specify when Amazon Textract sends a task to a human worker for review when creating a human review workflow or flow definition by specifying activation conditions.
You can set the following activation conditions when using the Amazon Textract task type:
-
Initiate a human review for specific form keys based on the form key confidence score.
-
Initiate a human review when specific form keys are missing.
-
Initiate human review for all form keys identified by Amazon Textract with confidence scores in a specified range.
-
Randomly send a sample of forms to humans for review.
When your activation condition depends on form key confidence scores, you can use two types of prediction confidence to initiate human loops:
-
Identification confidence – The confidence score for key-value pairs detected within a form.
-
Qualification confidence – The confidence score for text contained within key and value in a form.
In the image in the following section, Full Name: Jane Doe is the key-value pair, Full Name is the key, and Jane Doe is the value.
You can set these activation conditions using the Amazon SageMaker console when you create a human
review workflow, or by creating a JSON for human loop activation conditions and specifying
this as input in the HumanLoopActivationConditions
parameter of
CreateFlowDefinition
API operation. To learn how specify activation
conditions in JSON format, see JSON Schema for Human Loop
Activation Conditions in Amazon Augmented AI and Use Human Loop
Activation Conditions JSON Schema with Amazon Textract.
Note
When using Augmented AI with Amazon Textract, create Augmented AI resources in the same AWS Region you
use to call AnalyzeDocument
.
Get Started: Integrate a Human Review into an Amazon Textract Analyze Document Job
To integrate a human review into an Amazon Textract text detection and analysis job, you need to create a flow definition, and then use the Amazon Textract API to integrate that flow definition into your workflow. To learn how to create a flow definition using the SageMaker console or Augmented AI API, see the following topics:
After you've created your flow definition, see Using Augmented AI with Amazon Textract to learn how to integrate your flow definition into your Amazon Textract task.
End-to-End Example Using Amazon Textract and Amazon A2I
For an end-to-end example that demonstrates how to use Amazon Textract with Amazon A2I using the console, see Tutorial: Get Started in the Amazon A2I Console.
To learn how to use the Amazon A2I API to create and start a human review, you can use
Amazon Augmented AI (Amazon A2I) integration with Amazon Textract's Analyze
Document [Example]
A2I Textract Worker Console Preview
When they're assigned a review task in an Amazon Textract workflow, workers might see a user interface similar to the following:
You can customize this interface in the SageMaker console when you create your human review definition, or by creating and using a custom template. To learn more, see Create and Manage Worker Task Templates.