用于事件检测的异步分析 - Amazon Comprehend

本文属于机器翻译版本。若本译文内容与英语原文存在差异,则一律以英文原文为准。

用于事件检测的异步分析

要检测文档集中的事件,请使用启动异步作业。StartEventsDetectionJob

开始之前

在开始之前,请确保您具有:

  • 输入和输出存储桶:确定要用于输入和输出的 Amazon S3 存储桶。存储桶必须与所调用的 API 位于同一区域。

  • IAM 服务角色:您必须拥有一个有权访问您的输入和输出存储桶的 IAM 服务角色。有关更多信息,请参阅 异步操作所需的基于角色的权限

使用 AWS CLI 检测事件

以下示例演示了如何在 AWS CLI 中使用 StartEventsDetectionJob 操作。

此示例的格式适用于 Unix、Linux 和 macOS。对于 Windows,请将每行末尾的反斜杠 (\) Unix 行继续符替换为脱字号 (^)。

aws comprehend start-events-detection-job \ --region region \ --job-name job name \ --cli-input-json file://path to JSON input file

对于 cli-input-json 参数,请提供包含请求数据的 JSON 文件的路径,如以下示例中所示。

{ "InputDataConfig": { "S3Uri": "s3://input bucket/input path", "InputFormat": "ONE_DOC_PER_LINE" }, "OutputDataConfig": { "S3Uri": "s3://output bucket/output path" }, "DataAccessRoleArn": "arn:aws:iam::account ID:role/data access role" "LanguageCode": "en", "TargetEventTypes": [ "BANKRUPTCY", "EMPLOYMENT", "CORPORATE_ACQUISITION", "INVESTMENT_GENERAL", "CORPORATE_MERGER", "IPO", "RIGHTS_ISSUE", "SECONDARY_OFFERING", "SHELF_OFFERING", "TENDER_OFFERING", "STOCK_SPLIT" ] }

如果启动事件检测作业的请求成功,您将收到以下响应:

{ "JobStatus": "SUBMITTED", "JobId": "job ID" }

使用 AWS CLI 列出事件

使用该ListEventsDetectionJobs操作可以查看您已提交的事件检测作业的列表。该列表包括有关您使用的输入和输出位置以及每个检测作业状态的信息。此示例的格式适用于 Unix、Linux 和 macOS。对于 Windows,请将每行末尾的反斜杠 (\) Unix 行继续符替换为脱字号 (^)。

aws comprehend list-events-detection-jobs --region region

您将得到类似于以下内容的 JSON 响应:

{ "EventsDetectionJobPropertiesList": [ { "DataAccessRoleArn": "arn:aws:iam::account ID:role/data access role", "EndTime": timestamp, "InputDataConfig": { "InputFormat": "ONE_DOC_PER_LINE", "S3Uri": "s3://input bucket/input path" }, "JobId": "job ID", "JobName": "job name", "JobStatus": "COMPLETED", "LanguageCode": "en", "Message": "message", "OutputDataConfig": { "S3Uri": "s3://output bucket/ouput path" }, "SubmitTime": timestamp, "TargetEventTypes": [ "BANKRUPTCY", "EMPLOYMENT", "CORPORATE_ACQUISITION", "INVESTMENT_GENERAL", "CORPORATE_MERGER", "IPO", "RIGHTS_ISSUE", "SECONDARY_OFFERING", "SHELF_OFFERING", "TENDER_OFFERING", "STOCK_SPLIT" ] } ], "NextToken": "next token" }

使用 AWS CLI 描述事件

您可以使用该DescribeEventsDetectionJob操作来获取现有任务的状态。此示例的格式适用于 Unix、Linux 和 macOS。对于 Windows,请将每行末尾的反斜杠 (\) Unix 行继续符替换为脱字号 (^)。

aws comprehend describe-events-detection-job \ --region region \ --job-id job ID

您将收到以下 JSON 响应:

{ "EventsDetectionJobProperties": { "DataAccessRoleArn": "arn:aws:iam::account ID:role/data access role", "EndTime": timestamp, "InputDataConfig": { "InputFormat": "ONE_DOC_PER_LINE", "S3Uri": "S3Uri": "s3://input bucket/input path" }, "JobId": "job ID", "JobName": "job name", "JobStatus": "job status", "LanguageCode": "en", "Message": "message", "OutputDataConfig": { "S3Uri": "s3://output bucket/output path" }, "SubmitTime": timestamp, "TargetEventTypes": [ "BANKRUPTCY", "EMPLOYMENT", "CORPORATE_ACQUISITION", "INVESTMENT_GENERAL", "CORPORATE_MERGER", "IPO", "RIGHTS_ISSUE", "SECONDARY_OFFERING", "SHELF_OFFERING", "TENDER_OFFERING", "STOCK_SPLIT" ] } }

获取事件检测结果

以下是检测文档中的事件的分析作业的输出文件示例。输入格式是每行一个文档。

{"Entities": [{"Mentions": [{"BeginOffset": 12, "EndOffset": 27, "GroupScore": 1.0, "Score": 0.916355, "Text": "over a year ago", "Type": "DATE"}]}, {"Mentions": [{"BeginOffset": 33, "EndOffset": 39, "GroupScore": 1.0, "Score": 0.996603, "Text": "Amazon", "Type": "ORGANIZATION"}]}, {"Mentions": [{"BeginOffset": 66, "EndOffset": 77, "GroupScore": 1.0, "Score": 0.999283, "Text": "Whole Foods", "Type": "ORGANIZATION"}]}], "Events": [{"Arguments": [{"EntityIndex": 2, "Role": "INVESTEE", "Score": 0.999283}, {"EntityIndex": 0, "Role": "DATE", "Score": 0.916355}, {"EntityIndex": 1, "Role": "INVESTOR", "Score": 0.996603}], "Triggers": [{"BeginOffset": 373, "EndOffset": 380, "GroupScore": 0.999984, "Score": 0.999955, "Text": "acquire", "Type": "CORPORATE_ACQUISITION"}], "Type": "CORPORATE_ACQUISITION"}, {"Arguments": [{"EntityIndex": 2, "Role": "PARTICIPANT", "Score": 0.999283}], "Triggers": [{"BeginOffset": 115, "EndOffset": 123, "GroupScore": 1.0, "Score": 0.999967, "Text": "combined", "Type": "CORPORATE_MERGER"}], "Type": "CORPORATE_MERGER"}], "File": "doc.txt", "Line": 0}

有关事件输出文件结构和支持的事件类型的更多信息,请参阅 事件