本文属于机器翻译版本。若本译文内容与英语原文存在差异,则一律以英文原文为准。
使用AWS开发工具包
以下代码示例显示如何获取 Amazon Textract 文档分析作业的数据。
- Python
-
- SDK for Python (Boto3)
-
class TextractWrapper: """Encapsulates Textract functions.""" def __init__(self, textract_client, s3_resource, sqs_resource): """ :param textract_client: A Boto3 Textract client. :param s3_resource: A Boto3 Amazon S3 resource. :param sqs_resource: A Boto3 Amazon SQS resource. """ self.textract_client = textract_client self.s3_resource = s3_resource self.sqs_resource = sqs_resource def get_analysis_job(self, job_id): """ Gets data for a previously started detection job that includes additional elements. :param job_id: The ID of the job to retrieve. :return: The job data, including a list of blocks that describe elements detected in the image. """ try: response = self.textract_client.get_document_analysis( JobId=job_id) job_status = response['JobStatus'] logger.info("Job %s status is %s.", job_id, job_status) except ClientError: logger.exception("Couldn't get data for job %s.", job_id) raise else: return response-
在 GitHub
中查找说明和更多代码。 -
有关 API 详细信息,请参阅GetDocumentAnalysis在AWSSDK for Python (Boto3) 的 API 参考.
-
有关的完整列表AWSSDK 开发人员指南和代码示例,请参阅将 Amazon Textract 与AWS开发工具包. 本主题还包括有关入门的信息以及有关以前 SDK 版本的详细信息。
检测文档中的文本
开始对文档进行异步分析