Creating incidents automatically or manually in Incident Manager
Incident Manager, a capability of AWS Systems Manager, helps you manage and quickly respond to incidents. You can configure Amazon CloudWatch and Amazon EventBridge to automatically create incidents based on CloudWatch alarms and EventBridge events. You can also create incidents manually on the incident list page or by using the StartIncident API action from the AWS CLI or the AWS SDK. Incident Manager deduplicates incidents created from the same CloudWatch alarm or EventBridge event into the same incident.
For incidents automatically created by CloudWatch alarms or EventBridge events, Incident Manager attempts to create an incident in the same AWS Region as the event rule or alarm. In the event that Incident Manager is not available in the AWS Region, CloudWatch or EventBridge automatically create the incident in one of the available Regions specified in your replication set. For more information, see Managing incidents across AWS accounts and Regions in Incident Manager.
When the system creates an incident, Incident Manager automatically collects information about the AWS resources involved in the incident and adds this information to the Related items tab. If you specified a runbook in your response plan, when the system creates an incident, Incident Manager can send the information about the AWS resources involved in the incident to the runbook. The system can then target those resources when it initiates the runbook and attempts to remediate the issue.
When the system creates an incident, it also creates a parent operational workitem
(OpsItem) in OpsCenter, a component of Systems Manager, and links it to the incident as a related
item. You can use this OpsItem to track related work and future incident analyses. Calls to
OpsCenter incur costs. For more information about OpsCenter pricing, see Systems Manager pricing
Important
Note the following important details.
-
In the event that Incident Manager is not available, the system can only fail over and create incidents in other AWS Regions if you have specified at least two Regions in your replication set. For information about configuring a replication set, see Getting started with Incident Manager.
Incidents created by a cross-Region failover don't invoke runbooks specified in response plans.
Creating incidents automatically with CloudWatch alarms
CloudWatch uses your CloudWatch metrics to alert you about changes in your environment and to automatically perform the start incident action. CloudWatch works with Systems Manager and Incident Manager to create an incident from a response plan template when an alarm goes into alarm state. This requires the following prerequisites:
-
Incident Manager configured and replication set created. This step creates the Incident Manager service linked role in your account, providing the necessary permissions.
-
A configured Incident Manager response plan. To learn how to configure Incident Manager response plans, see Creating and configuring response plans in Incident Manager in the Incident preparation section of this guide.
-
Configured CloudWatch metrics monitoring your application. For monitoring best practices, see Monitoring in the Incident preparation section of this guide.
To create an alarm with a Start incident action
-
Create an alarm in CloudWatch. For more information, see Using Amazon CloudWatch alarms in the Amazon CloudWatch User Guide.
-
When choosing the action for the alarm to perform, select Add Systems Manager action.
-
Choose Create incident and select the Response plan for this incident.
-
Complete the remaining steps in your selected alarm type guide.
Tip
You can also add the create incident action to any existing alarm.
Creating incidents automatically with EventBridge events
EventBridge rules watch for event patterns. If the event matches the defined pattern, Incident Manager creates an incident using the chosen response plan.
Creating incidents using SaaS partners events
You can configure EventBridge to receive events from software as a service (SaaS) partner applications and services, allowing for third-party integration. After configuring EventBridge to receive events from third-party partners, you can create rules that match on partner events to create incidents. To see a list of third-party integrations, see Receiving events from a SaaS partner.
Configure EventBridge to receive events from a SaaS integration.
Open the Amazon EventBridge console at https://console.aws.amazon.com/events/
. -
In the navigation pane, choose Partner event sources.
-
Use the search bar to find the partner that you want and choose Set up for that partner.
-
Choose Copy to copy your account ID to the clipboard.
Note
To integrate with Salesforce use the steps described in the Amazon AppFlow user guide.
-
Go to the partner's website and follow the instructions to create a partner event source. Use your account ID for this. The event source that you create is available only on your account.
-
Go back to the EventBridge console and choose Partner event sources in the navigation pane.
-
Select the button next to the partner event source, and choose Associate with event bus.
Create a rule that triggers on events from a SaaS partner
Open the Amazon EventBridge console at https://console.aws.amazon.com/events/
. -
In the navigation pane, choose Rules.
-
Choose Create rule.
-
Enter a name and description for the rule.
A rule can't have the same name as another rule in the same Region and on the same event bus.
-
For Event bus, choose the event bus that corresponds to this partner.
-
For Rule type, choose Rule with an event pattern.
-
Choose Next.
-
For Event source, choose AWS events or EventBridge partner events.
-
For Event pattern, choose Event pattern form.
-
For Event source, choose EventBridge partners
-
For Partners, choose the name of the partner.
-
For Event type, choose All Events or choose the type of event to use for this rule. If you choose All Events, all events emitted by this partner event source will match the rule.
If you want to customize the event pattern, choose Edit, make your changes, and then choose Save.
-
Choose Next.
-
For Select a target, choose Incident Manager response plan, and then choose a Response plan.
Note
When selecting a response plan, all response plans that you own and have been shared with your account appear in the Response plan dropdown list.
-
EventBridge can create the IAM role needed for your rule to run:
-
To create an IAM role automatically, choose Create a new role for this specific resource.
-
To use an IAM role that you created before, choose Use existing role.
-
-
Choose Next.
-
(Optional) Enter one or more tags for the rule. For more information, see Amazon EventBridge tags in the Amazon EventBridge User Guide.
-
Choose Next.
-
Review your rule then choose Create rule.
Creating incidents using AWS service events
EventBridge also receives events from the AWS services listed in Events from Supported AWS Services. Similar to how you configure rules for SaaS partners, you can configure them for AWS services.
Create a rule that triggers on events from an AWS service
Open the Amazon EventBridge console at https://console.aws.amazon.com/events/
. -
In the navigation pane, choose Rules.
-
Choose Create rule.
-
Enter a name and description for the rule.
A rule can't have the same name as another rule in the same Region and on the same event bus.
-
For Event bus, choose default.
-
For Rule type, choose Rule with an event pattern.
-
Choose Next.
-
For Event source, choose AWS events or EventBridge partner events.
-
For Event pattern, choose Event pattern form.
-
For Event source, choose AWS services.
-
For Service name, choose the service that monitors for an incident.
-
For Event type, choose All Events or choose the type of event to use for this rule. If you choose All Events, all events emitted by this partner event source will match the rule.
If you want to customize the event pattern, choose Edit, make your changes, and then choose Save.
-
Choose Next.
-
For Select a target, choose Incident Manager response plan, and then choose a Response plan.
Note
When selecting a response plan, all response plans that you own and have been shared with your account appear in the Response plan dropdown list.
-
EventBridge can create the IAM role needed for your rule to run:
-
To create an IAM role automatically, choose Create a new role for this specific resource.
-
To use an IAM role that you created before, choose Use existing role.
-
-
Choose Next.
-
(Optional) Enter one or more tags for the rule. For more information, see Amazon EventBridge tags in the Amazon EventBridge User Guide.
-
Choose Next.
-
Review your rule then choose Create rule.
Creating incidents manually
Responders can manually track an incident using the Incident Manager console by using a predefined response plan. Use the following steps to create an incident.
-
Open the Incident Manager console
. -
Choose Start incident.
-
For Response plan, choose a response plan from the list.
-
(Optional) To override the title provided by the defined response plan, enter an Incident title.
-
(Optional) To override the impact provided by the defined response plan, enter the Impact of the incident.