Monitoring
After you implement your workload, you must monitor its performance so that you can remediate any issues before they impact your customers. Monitoring metrics should be used to raise alarms when thresholds are breached.
Amazon CloudWatch is a monitoring and observability service that provides you with data and actionable insights to monitor your workload, respond to system-wide performance changes, optimize resource utilization, and get a unified view of operational health. CloudWatch collects monitoring and operational data in the form of logs, metrics, and events from workloads that run on AWS and on-premises servers. AWS X-Ray helps developers analyze and debug production, distributed applications. With AWS X-Ray, you can glean insights into how your application is performing and discover root causes and identify performance bottlenecks. You can use these insights to react quickly and keep your workload running smoothly.
The following question focuses on these considerations for performance efficiency.
PERF 7: How do you monitor your resources to verify they are performing? |
---|
System performance can degrade over time. Monitor system performance to identify degradation and remediate internal or external factors, such as the operating system or application load. |
Validating that you do not see false positives is key to an effective monitoring solution. Automated initiation functions avoid human error and can reduce the time it takes to fix problems. Plan for game days, where simulations are conducted in the production environment, to test your alarm solution and verify that it correctly recognizes issues.