Input Documents
A suitable input for an Amazon Textract operation is a single or multipage document. Some examples are a legal document, a form, an ID, or a letter. A form is a document with questions or prompts for a user to provide answers. Some examples are a patient registration form, a tax form, or an insurance claim form.
A document can be in JPEG, PNG, PDF, or TIFF format. With PDF and TIFF format files,
you can process multipage documents. For information about how Amazon Textract represents
documents as Block
objects, see Text Detection and Document Analysis
Response Objects.
The following is an acceptable input document example.

For information about document limits, see Quotas in Amazon Textract.
For Amazon Textract synchronous operations, you can use input documents that are stored in an Amazon S3 bucket, or you can pass base64-encoded image bytes. For more information, see Calling Amazon Textract Synchronous Operations. For asynchronous operations, you need to supply input documents in an Amazon S3 bucket. For more information, see Calling Amazon Textract Asynchronous Operations.