本文属于机器翻译版本。若本译文内容与英语原文存在差异,则一律以英文原文为准。
通过 Amazon EventBridge 监控 OpenSearch Service 事件
Amazon OpenSearch Service 与 Amazon EventBridge 集成,以通知您影响您的域的特定事件。AWS 服务中的事件将近乎实时传输到 EventBridge。同样的事件也会发送到 Amazon EventBridge 的前身 Amazon CloudWatch Events 中。您可以编写简单规则来指示您关注的事件,并指示要在事件匹配规则时执行的自动化操作。可自动触发的操作包括:
-
调用 AWS Lambda 函数
-
调用 Amazon EC2 Run Command
-
将事件中继到 Amazon Kinesis Data Streams
-
激活 AWS Step Functions 状态机
-
通知 Amazon SNS 主题或 Amazon SQS 队列
有关更多信息,请参阅 Amazon EventBridge 用户指南中的 Amazon EventBridge 入门。
主题
服务软件更新事件
当以下任一服务软件更新事件发生时,OpenSearch Service 将事件发送到 EventBridge。
可用服务软件更新
OpenSearch Service 将此事件发送到服务软件更新可用。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Software Update Notification", "source": "aws.es", "account": "123456789012", "time": "2016-11-01T13:12:22Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Service Software Update", "status": "Available", "severity": "Informational", "description": "Service software update R20220928 available. Service Software Deployment Mechanism: Blue/Green. For more information on deployment configuration, please see: https://docs.aws.amazon.com/opensearch-service/latest/developerguide/managedomains-configuration-changes.html" } }
已安排服务软件更新
OpenSearch Service 会在安排了服务软件更新后发送此事件。对于可选的更新,您会在计划日期收到通知,并且可以随时选择重新安排。对于必需的更新,您会在计划日期前的三天收到通知,并且可以在强制窗口中选择重新安排。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Software Update Notification", "source": "aws.es", "account": "123456789012", "time": "2016-11-01T13:12:22Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Service Software Update", "status": "Scheduled", "severity": "High", "description": "A new service software update [R20200330-p1] has been scheduled at [21st May 2023 12:40 GMT]. Please see documentation for more information on scheduling software updates: https://docs.aws.amazon.com/opensearch-service/latest/developerguide/service-software.html." } }
已重新安排服务软件更新
OpenSearch Service 会在重新安排可选服务软件更新时发送此事件。有关更多信息,请参阅 可选更新与必需更新。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Software Update Notification", "source": "aws.es", "account": "123456789012", "time": "2016-11-01T13:12:22Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Service Software Update", "status": "Rescheduled", "severity": "High", "description": "The service software update [R20200330-p1], which was originally scheduled for [21st May 2023 12:40 GMT], has been rescheduled to [23rd May 2023 12:40 GMT]. Please see documentation for more information on scheduling software updates: https://docs.aws.amazon.com/opensearch-service/latest/developerguide/service-software.html." } }
服务软件更新已启动
OpenSearch Service 将此事件发送到服务软件更新已启动。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Software Update Notification", "source": "aws.es", "account": "123456789012", "time": "2016-11-01T13:12:22Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Service Software Update", "status": "Started", "severity": "Informational", "description": "Service software update [R20200330-p1] started. } }
服务软件更新已完成
OpenSearch Service 将此事件发送到服务软件更新已完成。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Software Update Notification", "source": "aws.es", "account": "123456789012", "time": "2016-11-01T13:12:22Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Service Software Update", "status": "Completed", "severity": "Informational", "description": "Service software update [R20200330-p1] completed." } }
已取消服务软件更新
OpenSearch Service 会在取消了服务软件更新后发送此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Software Update Notification", "source": "aws.es", "account": "123456789012", "time": "2016-11-01T13:12:22Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Service Software Update", "status": "Cancelled", "severity": "Informational", "description": "The scheduled service software update [R20200330-p1] has been cancelled as a newer update is available. Please schedule the latest update." } }
已取消计划服务软件更新
OpenSearch Service 会在先前为域安排的服务软件更新已取消后发送此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Software Update Notification", "source": "aws.es", "account": "123456789012", "time": "2016-11-01T13:12:22Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Service Software Update", "status": "Cancelled", "severity": "Informational", "description": "The scheduled service software update [R20200330-p1] has been cancelled." } }
未执行服务软件更新
OpenSearch Service 会在无法启动服务软件更新时发送此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Software Update Notification", "source": "aws.es", "account": "123456789012", "time": "2016-11-01T13:12:22Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Service Software Update", "status": "Unexecuted", "severity": "Informational", "description": "The scheduled service software update [R20200330-p1] cannot be started. Reason: [reason]" } }
服务软件更新失败
OpenSearch Service 会在服务软件更新失败时发送此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Software Update Notification", "source": "aws.es", "account": "123456789012", "time": "2016-11-01T13:12:22Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Service Software Update", "status": "Failed", "severity": "High", "description": "Installation of service software update [R20200330-p1] failed. [reason]. } }
已要求服务软件更新
当需要更新服务软件时,OpenSearch Service 会发送此事件。有关更多信息,请参阅 可选更新与必需更新。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Software Update Notification", "source": "aws.es", "account": "123456789012", "time": "2016-11-01T13:12:22Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Service Software Update", "status": "Required", "severity": "High", "description": "Service software update [R20200330-p1] available. Update will be automatically installed after [21st May 2023] if no action is taken. Service Software Deployment Mechanism: Blue/Green. For more information on deployment configuration, please see: https://docs.aws.amazon.com/opensearch-service/latest/developerguide/managedomains-configuration-changes.html" } }
自动调整事件
当以下任一自动调整事件发生时,OpenSearch Service 将事件发送到 EventBridge。
自动调整挂起
OpenSearch Service 发送此事件时,自动调整已确定改进集群性能和可用性的优化建议。您只能看到已禁用自动调整的域的此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "3acb26c8-397c-4c89-a80a-ce672a864c55", "detail-type": "Amazon OpenSearch Service Auto-Tune Notification", "source": "aws.es", "account": "123456789012", "time": "2020-10-30T22:06:31Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Auto-Tune Event", "severity": "Informational", "status": "Pending", "description": "Auto-Tune recommends the following new settings for your domain: { JVM Heap size : 60%}. Enable Auto-Tune to improve cluster stability and performance.", "scheduleTime": "{iso8601-timestamp}" } }
自动调整已开始
OpenSearch Service 会在自动调整开始将新设置应用到您的域时发送此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "3acb26c8-397c-4c89-a80a-ce672a864c55", "detail-type": "Amazon OpenSearch Service Auto-Tune Notification", "source": "aws.es", "account": "123456789012", "time": "2020-10-30T22:06:31Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Auto-Tune Event", "severity": "Informational", "status": "Started", "scheduleTime": "{iso8601-timestamp}", "startTime": "{iso8601-timestamp}", "description" : "Auto-Tune is applying the following settings to your domain: { JVM Heap size : 60%}." } }
自动调节需要计划蓝绿部署
OpenSearch Service 发送此事件时,自动调整已确定需要计划的蓝绿部署的调整建议。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "3acb26c8-397c-4c89-a80a-ce672a864c55", "detail-type": "Amazon OpenSearch Service Auto-Tune Notification", "source": "aws.es", "account": "123456789012", "time": "2020-10-30T22:06:31Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Auto-Tune Event", "severity": "Low", "status": "Pending", "startTime": "{iso8601-timestamp}", "description": "Auto-Tune has identified the following settings for your domain that require a blue/green deployment: { JVM Heap size : 60%}. You can schedule the deployment for your preferred time." } }
自动调整已取消
OpenSearch Service 会在自动调整计划被取消时发送此事件,因为没有待处理的优化建议。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "3acb26c8-397c-4c89-a80a-ce672a864c55", "detail-type": "Amazon OpenSearch Service Auto-Tune Notification", "source": "aws.es", "account": "123456789012", "time": "2020-10-30T22:06:31Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Auto-Tune Event", "severity": "Low", "status": "Cancelled", "scheduleTime": "{iso8601-timestamp}", "description": "Auto-Tune has cancelled the upcoming blue/green deployment." } }
自动调整已完成
OpenSearch Service 发送此事件时,自动调整已完成蓝绿部署,并且集群在新 JVM 设置的情况下运行。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "3acb26c8-397c-4c89-a80a-ce672a864c55", "detail-type": "Amazon OpenSearch Service Auto-Tune Notification", "source": "aws.es", "account": "123456789012", "time": "2020-10-30T22:06:31Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Auto-Tune Event", "severity": "Informational", "status": "Completed", "completionTime": "{iso8601-timestamp}", "description": "Auto-Tune has completed the blue/green deployment and successfully applied the following settings: { JVM Heap size : 60%}." } }
已禁用自动调整并恢复更改
OpenSearch Service 会在禁用自动调整并回滚应用的更改时发送此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "3acb26c8-397c-4c89-a80a-ce672a864c55", "detail-type": "Amazon OpenSearch Service Auto-Tune Notification", "source": "aws.es", "account": "123456789012", "time": "2020-10-30T22:06:31Z", "region": "us-east-1", "resources": [ "arn:aws:es:us-east-1:123456789012:domain/test-domain" ], "detail": { "event": "Auto-Tune Event", "severity": "Informational", "status": "Completed", "description": "Auto-Tune is now disabled. All settings have been reverted. Auto-Tune will continue to evaluate cluster performance and provide recommendations.", "completionTime": "{iso8601-timestamp}" } }
已禁用自动调整并保留更改
OpenSearch Service 会在禁用自动调整并保留应用的更改时发送此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "3acb26c8-397c-4c89-a80a-ce672a864c55", "detail-type": "Amazon OpenSearch Service Auto-Tune Notification", "source": "aws.es", "account": "123456789012", "time": "2020-10-30T22:06:31Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Auto-Tune Event", "severity": "Informational", "status": "Completed", "description": "Auto-Tune is now disabled. The most-recent settings by Auto-Tune have been retained. Auto-Tune will continue to evaluate cluster performance and provide recommendations.", "completionTime": "{iso8601-timestamp}" } }
集群运行状况事件
当集群的运行状况受到影响时,OpenSearch Service 会向 EventBridge 发送某些事件。
红色集群恢复已开始
OpenSearch Service 会在集群状态持续变为红色超过一个小时后发送此事件。它会尝试从快照中自动还原一个或多个红色索引,以修复集群状态。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Cluster Status Notification", "source":"aws.es", "account":"123456789012", "time":"2016-11-01T13:12:22Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-domain" ], "detail":{ "event":"Automatic Snapshot Restore for Red Indices", "status":"Started", "severity":"High", "description":"Your cluster status is red. We have started automatic snapshot restore for the red indices. No action is needed from your side. Red indices [red-index-0, red-index-1]" } }
红色集群恢复部分完成
在尝试修复红色集群状态期间,OpenSearch Service 仅在其只能从快照还原部分红色索引时,才会发送此事件。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Cluster Status Notification", "source":"aws.es", "account":"123456789012", "time":"2016-11-01T13:12:22Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-domain" ], "detail":{ "event":"Automatic Snapshot Restore for Red Indices", "status":"Partially Restored", "severity":"High", "description":"Your cluster status is red. We were able to restore the following Red indices from snapshot: [red-index-0]. Indices not restored: [red-index-1]. Please refer https://docs.aws.amazon.com/opensearch-service/latest/developerguide/handling-errors.html#handling-errors-red-cluster-status for troubleshooting steps." } }
红色集群恢复失败
在尝试修复红色集群状态时,如果 OpenSearch Service 无法恢复任何索引,则会发送此事件。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Cluster Status Notification", "source":"aws.es", "account":"123456789012", "time":"2016-11-01T13:12:22Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-domain" ], "detail":{ "event":"Automatic Snapshot Restore for Red Indices", "status":"Failed", "severity":"High", "description":"Your cluster status is red. We were unable to restore the Red indices automatically. Indices not restored: [red-index-0, red-index-1]. Please refer https://docs.aws.amazon.com/opensearch-service/latest/developerguide/handling-errors.html#handling-errors-red-cluster-status for troubleshooting steps." } }
要删除的分片
OpenSearch Service 在红色集群持续为红色 14 天后尝试自动修复红色集群状态时,如果仍有一个或多个索引为红色,则会发送此事件。再过 7 天(总共 21 天持续变为红色)之后,OpenSearch Service 将在所有红色索引上删除未分配的分片。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Cluster Status Notification", "source":"aws.es", "account":"123456789012", "time":"2022-04-09T10:36:48Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-domain" ], "detail":{ "severity":"Medium", "description":"Your cluster status is red. Please fix the red indices as soon as possible. If not fixed by 2022-04-12 01:51:47+00:00, we will delete all unassigned shards, the unit of storage and compute, for these red indices to recover your domain and make it green. Please refer to https://docs.aws.amazon.com/opensearch-service/latest/developerguide/handling-errors.html#handling-errors-red-cluster-status for troubleshooting steps. test_data, test_data1", "event":"Automatic Snapshot Restore for Red Indices", "status":"Shard(s) to be deleted" } }
分片已删除
OpenSearch Service 会在集群状态持续变为红色 21 天之后发送此事件。它会删除所有红色索引上未分配的分片(存储和计算)。有关详细信息,请参阅自动修复红色集群。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Cluster Status Notification", "source":"aws.es", "account":"123456789012", "time":"2022-04-09T10:54:48Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-domain" ], "detail":{ "severity":"High", "description":"We have deleted unassinged shards, the unit of storage and compute, in red indices: index-1, index-2 because these indices were red for more than 21 days and could not be restored with the automated restore process. Please refer to https://docs.aws.amazon.com/opensearch-service/latest/developerguide/handling-errors.html#handling-errors-red-cluster-status for troubleshooting steps.", "event":"Automatic Snapshot Restore for Red Indices", "status":"Shard(s) deleted" } }
高分片数警告
当热数据节点中的平均分片数超过建议的默认限制 1,000 的 90% 时,OpenSearch Service 将会发送此事件。尽管较新版本的 Elasticsearch 和 OpenSearch 支持可配置最大分片数单个节点限制,但我们还是建议每个节点上拥有不超过 1,000 个分片。请参阅选择分片数。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2016-11-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"High Shard Count", "status":"Warning", "severity":"Low", "description":"One or more data nodes have close to 1000 shards. To ensure optimum performance and stability of your cluster, please refer to the best practice guidelines - https://docs.aws.amazon.com/opensearch-service/latest/developerguide/sizing-domains.html#bp-sharding." } }
超出分片数限制
当热数据节点中的平均分片数超过建议的默认限制 1,000 时,OpenSearch Service 将会发送此事件。尽管较新版本的 Elasticsearch 和 OpenSearch 支持可配置最大分片数单个节点限制,但我们还是建议每个节点上拥有不超过 1,000 个分片。请参阅选择分片数。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2016-11-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"High Shard Count", "status":"Warning", "severity":"Medium", "description":"One or more data nodes have more than 1000 shards. To ensure optimum performance and stability of your cluster, please refer to the best practice guidelines - https://docs.aws.amazon.com/opensearch-service/latest/developerguide/sizing-domains.html#bp-sharding." } }
磁盘空间不足
当集群中的一个或多个节点的可用存储空间低于 25%(也就是少于 25GB)时,OpenSearch Service 将会发送此事件。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2017-12-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"Low Disk Space", "status":"Warning", "severity":"Medium", "description":"One or more data nodes in your cluster has less than 25% of storage space or less than 25GB. Your cluster will be blocked for writes at 20% or 20GB. Please refer to the documentation for more information - https://docs.aws.amazon.com/opensearch-service/latest/developerguide/handling-errors.html#troubleshooting-cluster-block" } }
低磁盘水位违例
当集群中所有节点的可用存储空间低于 10%(或少于 10 GB)时,OpenSearch Service 将会发送此事件。当所有节点都发生低磁盘水位违例时,任何新的索引都会生成一个黄色的集群,而当所有节点都低于高磁盘水位时,它将产生一个红色集群。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2017-12-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"Low Disk Watermark Breach", "status":"Warning", "severity":"Medium", "description":"Low Disk Watermark threshold is about to be breached. Once the threshold is breached, new index creation will be blocked on all nodes to prevent the cluster status from turning red. Please increase disk size to suit your storage needs. For more information, see https://docs.aws.amazon.com/opensearch-service/latest/developerguide/handling-errors.html#troubleshooting-cluster-block". } }
EBS 可爆发容量余额低于 70%
OpenSearch Service 会在一个或多个数据节点上的 EBS 可爆发容量余额低于 70% 时发送此事件。如果 EBS 可爆发容量余额耗尽,会导致集群完全不可用和 I/O 请求节流,从而导致索引和搜索请求出现高延迟和超时。有关修复此问题的步骤,请参阅EBS 可爆发容量余额低。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2017-12-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"EBS Burst Balance", "status":"Warning", "severity":"Medium", "description":"EBS burst balance on one or more data nodes is below 70%. Follow https://docs.aws.amazon.com/opensearch-service/latest/developerguide/handling-errors.html#handling-errors-low-ebs-burst to fix this issue." } }
EBS 可爆发容量余额低于 20%
OpenSearch Service 会在一个或多个数据节点上的 EBS 可爆发容量余额低于 20% 时发送此事件。如果 EBS 可爆发容量余额耗尽,会导致集群完全不可用和 I/O 请求节流,从而导致索引和搜索请求出现高延迟和超时。有关修复此问题的步骤,请参阅EBS 可爆发容量余额低。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2017-12-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"EBS Burst Balance", "status":"Warning", "severity":"High", "description":"EBS burst balance on one or more data nodes is below 20%. Follow https://docs.aws.amazon.com/opensearch-service/latest/developerguide/handling-errors.html#handling-errors-low-ebs-burst to fix this issue. } }
磁盘吞吐量节流
OpenSearch Service 会在 EBS 卷或 EC2 实例的吞吐量限制导致对域的读取和写入请求被节流时发送此事件。如果您收到此通知,请考虑根据 AWS 建议的最佳实践纵向扩展卷或实例。如果您的卷类型是 gp2
,请增加卷大小。如果您的卷类型是 gp3
,请预调配更多吞吐量。您还可以检查您的实例基础和最大 EBS 吞吐量是否大于或等于预调配的卷吞吐量,并可以相应扩展。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2017-12-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"Disk Throughput Throttle", "status":"Warning", "severity":"Medium", "description":"Your domain is experiencing throttling due to instance or volume throughput limitations. Please consider scaling your domain to suit your throughput needs. In July 2023, we improved the accuracy of throughput throttle calculation by replacing ‘Max volume throughput’ with ‘Provisioned volume throughput’. Please refer to the documentation for more information." } }
分片大小较大
OpenSearch Service 会在集群中有一个或多个分片超过 50 GiB 或 65 GiB 时发送此事件。为确保最佳集群性能和稳定性,请减少分片大小。
有关更多信息,请参阅分片最佳实践。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2017-12-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"Large Shard Size", "status":"Warning", "severity":"Medium", "description":"One or more shards are larger than 65GiB. To ensure optimum cluster performance and stability, reduce shard sizes. For more information, see https://docs.aws.amazon.com/opensearch-service/latest/developerguide/monitoring-events.html#monitoring-events-large-shard-size." } }
高 JVM 使用率
OpenSearch Service 会在域的 JVMMemoryPressure
指标超过 80% 时发送此事件。如果在 30 分钟内超过 92%,则对集群的所有写入操作都将被阻止。为确保最佳的集群稳定性,请减少集群流量或扩展域,以便为工作负载提供足够的内存。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2017-12-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"High JVM Usage", "status":"Warning", "severity":"High", "description":"JVM memory pressure has exceeded 80%. If it exceeds 92% for 30 minutes, all write operations to your cluster will be blocked. To ensure optimum cluster stability, reduce traffic to the cluster or use larger instance types. For more information, see https://docs.aws.amazon.com/opensearch-service/latest/developerguide/monitoring-events.html#monitoring-events-high-jvm." } }
GC 不足
OpenSearch Service 会在 JVM 最大值大于 70%,且最大值和最小值之差小于 30% 时发送此事件。这可能表明 JVM 无法在垃圾回收周期中回收足够的内存来处理您的工作负载。这可能会导致响应速度越来越慢和延迟增加;在某些情况下,甚至会因为运行状况检查超时而导致节点掉线。为确保最佳的集群稳定性,请减少集群流量或扩展域,以便为工作负载提供足够的内存。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2017-12-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"Insufficient GC", "status":"Warning", "severity":"Medium", "description":"Maximum JVM is above 70% and JVM range is less than 30%. This may indicate insufficient garbage collection for your workload. For more information, see https://docs.aws.amazon.com/opensearch-service/latest/developerguide/monitoring-events.html#monitoring-events-insufficient-gc." } }
自定义索引路由警告
OpenSearch Service 会在您的域处于处理状态时发送此事件,并且包含带有自定义 index.routing.allocation 设置的索引,这可能会导致蓝绿部署停滞。验证设置是否正确应用。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2017-12-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"Custom Index Routing Warning", "status":"Warning", "severity":"Medium", "description":"Your domain is in processing state and contains indice(s) with custom index.routing.allocation settings which can cause blue-green deployments to get stuck. Verify settings are applied properly. For more information, see https://docs.aws.amazon.com/opensearch-service/latest/developerguide/monitoring-events.html#monitoring-events-index-routing." } }
分片锁定失败
OpenSearch Service 会在由于 [ShardLockObtainFailedException]
未分配的分片导致您的域无法正常运行时发送此事件。有关更多信息,请参阅如何解决 Amazon OpenSearch Service 中内存分片锁定异常
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2017-12-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"Failed Shard Lock", "status":"Warning", "severity":"Medium", "description":"Your domain is unhealthy due to unassigned shards with [ShardLockObtainFailedException]. For more information, see https://docs.aws.amazon.com/opensearch-service/latest/developerguide/monitoring-events.html#monitoring-events-failed-shard-lock." }
VPC 端点事件
OpenSearch Service 可将某些事件发送到与 AWS PrivateLink 接口端点相关的 EventBridge。
VPC 端点创建失败
当 OpenSearch Service 无法创建请求的 VPC 端点时,它将发送此事件。发生此错误可能是因为您已达到针对某一区域内允许的 VPC 端点数量的限制。如果指定子网或安全组不存在,您也会看到此错误。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service VPC Endpoint Notification", "source":"aws.es", "account":"123456789012", "time":"2016-11-01T13:12:22Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-domain" ], "detail":{ "event":"VPC Endpoint Create Validation", "status":"Failed", "severity":"High", "description":"Unable to create VPC endpoint aos-0d4c74c0342343 for domain arn:aws:es:eu-south-1:123456789012:domain/my-domain due to the following validation failures: You've reached the limit on the number of VPC endpoints that you can create in the AWS Region." } }
VPC 端点更新失败
当 OpenSearch Service 无法删除请求的 VPC 端点时,它将发送此事件。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service VPC Endpoint Notification", "source":"aws.es", "account":"123456789012", "time":"2016-11-01T13:12:22Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-domain" ], "detail":{ "event":"VPC Endpoint Update Validation", "status":"Failed", "severity":"High", "description":"Unable to update VPC endpoint aos-0d4c74c0342343 for domain arn:aws:es:eu-south-1:123456789012:domain/my-domain due to the following validation failures: <failure message>." } }
VPC 端点删除失败
当 OpenSearch Service 无法删除请求的 VPC 端点时,它将发送此事件。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service VPC Endpoint Notification", "source":"aws.es", "account":"123456789012", "time":"2016-11-01T13:12:22Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-domain" ], "detail":{ "event":"VPC Endpoint Delete Validation", "status":"Failed", "severity":"High", "description":"Unable to delete VPC endpoint aos-0d4c74c0342343 for domain arn:aws:es:eu-south-1:123456789012:domain/my-domain due to the following validation failures: Specified subnet doesn't exist." } }
节点停用事件
当以下任一节点停用事件发生时,OpenSearch Service 会将事件发送到 EventBridge。
已计划停用节点
OpenSearch Service 会在计划节点停用时发送此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Notification", "source": "aws.es", "account": "123456789012", "time": "2023-04-07T10:07:33Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Node Retirement Notification", "status": "Scheduled", "severity": "Medium", "description": "An automated action to retire and replace a node has been scheduled on your domain. The node will be replaced in the next off-peak window. For more information, see https://docs.aws.amazon.com/opensearch-service/latest/developerguide/monitoring-events.html." } }
已完成节点停用
OpenSearch Service 会在完成节点停用时发送此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Notification", "source": "aws.es", "account": "123456789012", "time": "2023-04-07T10:07:33Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Node Retirement Notification", "status": "Completed", "severity": "Medium", "description": "The node has been retired and replaced with a new node." } }
节点停用失败
OpenSearch Service 会在节点停用失败时发送此事件。
示例
以下是该类型的示例事件:
{ "version": "0", "id": "01234567-0123-0123-0123-012345678901", "detail-type": "Amazon OpenSearch Service Notification", "source": "aws.es", "account": "123456789012", "time": "2023-04-07T10:07:33Z", "region": "us-east-1", "resources": ["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail": { "event": "Node Retirement Notification", "status": "Failed", "severity": "Medium", "description": "Node retirement failed. No actions are required from your end. We will automatically retry replacing the node." } }
性能降级节点停用事件
由于节点上的硬件性能降级而需要更换节点时,OpenSearch Service 会发送这些事件。
性能降级节点停用通知
为您的域计划了停用并更换性能降级节点的自动化操作后,OpenSearch Service 会发送此事件。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"db233454-aad1-7676-3b15-10a84b052baa", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2024-01-11T08:16:06Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-node-replacement" ], "detail":{ "severity":"Medium", "description":"An automated action to retire and replace a node has been scheduled on your domain. For more information, please see https://docs.aws.amazon.com/opensearch-service/latest/developerguide/monitoring-events.html.", "event":"Degraded Node Retirement Notification", "status":"Scheduled" } }
性能降级节点停用完成
性能降级的节点已经停用并用新的节点更换后,OpenSearch Service 会发送此事件。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"7444215c-90f9-a52d-bcda-e85973a9a762", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2024-01-11T10:20:30Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-node-replacement" ], "detail":{ "severity":"Medium", "description":"The node has been retired and replaced with a new node.", "event":"Degraded Node Retirement Notification", "status":"Completed" } }
性能降级节点停用失败
性能降级节点停用失败时,OpenSearch Service 会发送此事件。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"c328e9bb-93b9-c0b2-b17a-df527fdf96b6", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2024-01-11T08:31:38Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-node-replacement" ], "detail":{ "severity":"Medium", "description":"Node retirement failed. No actions are required from your end. We will automatically re-try replacing the node.", "event":"Degraded Node Retirement Notification", "status":"Failed" } }
域错误事件
当发生以下任一域错误时,OpenSearch Service 会将事件发送到 EventBridge。
域更新验证故障
在尝试更新或执行域配置更改时,如果 OpenSearch Service 遇到一个或多个验证故障,则会发送此事件。要获取解决这些故障的步骤,请参阅对验证错误进行故障排除。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Domain Update Notification", "source":"aws.es", "account":"123456789012", "time":"2016-11-01T13:12:22Z", "region":"us-east-1", "resources":[ "arn:aws:es:us-east-1:123456789012:domain/test-domain" ], "detail":{ "event":"Domain Update Validation", "status":"Failed", "severity":"High", "description":"Unable to perform updates to your domain due to the following validation failures: <failures> Please see the documentation for more information https://docs.aws.amazon.com/opensearch-service/latest/developerguide/managedomains-configuration-changes.html#validation" } }
KMS 密钥不可访问
当无法访问您的 AWS KMS 密钥时,OpenSearch Service 将会发送此事件。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Domain Error Notification", "source":"aws.es", "account":"123456789012", "time":"2016-11-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"KMS Key Inaccessible", "status":"Error", "severity":"High", "description":"The KMS key associated with this domain is inaccessible. You are at risk of losing access to your domain. For more information, please refer to https://docs.aws.amazon.com/opensearch-service/latest/developerguide/encryption-at-rest.html#disabled-key." } }
域隔离
当您的域被隔离并且由于网络无法访问而无法接收、读取或写入请求时,OpenSearch Service 会发送此事件。
示例
以下是该类型的示例事件:
{ "version":"0", "id":"01234567-0123-0123-0123-012345678901", "detail-type":"Amazon OpenSearch Service Notification", "source":"aws.es", "account":"123456789012", "time":"2023-11-01T13:12:22Z", "region":"us-east-1", "resources":["arn:aws:es:us-east-1:123456789012:domain/test-domain"], "detail":{ "event":"Domain Isolation Notification", "status":"Error", "severity":"High", "description":"Your OpenSearch Service domain has been isolated. An isolated domain is unreachable by network and cannot receive, read, or write requests. For more information and assistance, please contact AWS Support at https://docs.aws.amazon.com/opensearch-service/latest/developerguide/encryption-at-rest.html#disabled-key." } }