Detecting Text - Amazon Textract

Detecting Text

Amazon Textract provides synchronous and asynchronous operations that return only the text detected in a document. For both sets of operations, the following information is returned in multiple Block objects:

  • The lines and words of detected text

  • The relationships between the lines and words of detected text

  • The page that the detected text appears on

  • The location of the lines and words of text on the document page

For more information, see Lines and Words of Text.

To detect text synchronously, use the DetectDocumentText API operation, and pass a document file as input. The entire set of results is returned by the operation. For more information and an example, see Processing Documents with Synchronous Operations.

Note

The Amazon Rekognition API operation DetectText is different from DetectDocumentText. You use DetectText to detect text in live scenes, such as posters or road signs.

To detect text asynchronously, use StartDocumentTextDetection to start processing an input document file. To get the results, call GetDocumentTextDetection. The results are returned in one or more responses from GetDocumentTextDetection. For more information and an example, see Processing Documents with Asynchronous Operations.