Hard Limits in Amazon Textract - Amazon Textract

Hard Limits in Amazon Textract

The following is a list of hard limits in Amazon Textract, which cannot be changed. For information about limitations in location and limits you can change see Amazon Textract Endpoints and Quotas. For information about limits you can change, see AWS Service Limits. To change a limit, see Create Case.

Amazon Textract

Limit Description

Accepted File formats

Synchronous operations support JPEG and PNG (JPEG 2000 not supported). Asynchronous operations support JPEG, PNG and PDF (JPEG 2000 not supported).

File Size Limits

JPEG and PNG files have a 10MB size limit. PDF files have a 500MB limit.

PDF Specific Limits

The maximum number of pages is 3,000, the maximum height and width is 40 inches and 2880 points. PDFs cannot be password protected. PDFs cannot contain JPEG 2000 formatted images.

Document Rotation and Image Size

Amazon Textract supports all in-plane document rotations, for example 45 degree in-plane rotation.

Amazon Textract supports images with a resolution under 10000 pixels on all sides.

Text Alignment

Text can be text aligned horizontally within the document. Amazon Textract does not support vertical text alignment within the document.

Languages

Amazon Textract supports English, French, German, Italian, Portuguese and Spanish text detection. Amazon Textract will not return the language detected in its output.

Character Size

The minimum height for text to be detected is 15 pixels. At 150 DPI, this would be the same as 8 point font.

Character Type

Amazon Textract supports both handwritten and printed character recognition.

Characters

Amazon Textract detects the following characters:

  • a-z

  • A-Z

  • 0-9

  • ä Ä ö Ö ü Ü ç Ç é É â Â ê Ê î Î ô Ô û Û à À è È ù Ù ë Ë ï Ï ü Ü á Á é É í Í ó Ó ú Ú ü Ü ñ Ñ ì Ì ò Ò ã Ã õ Õ

  • ! " # $ % ' & ( ) * + , - . / : ; = ? @ [ \ ] ^ _ ` { | } ~ > < ° € £ ¥ ₹ ß ẞ ¿ ¡ € £ ¥ ₹ ø Ø œ Œ © ® ™ § ¹ ² ³ '