This whitepaper is for historical reference only. Some content might be outdated and some links might not be available.
Observability
Observability is to continuously discover actionable insights based on signals from the system under observation over time. The signals can be low-level such as CPU, memory, disk space, and higher-level such as business signals including API response times, error rates, and transactions per second. Having insight into metrics, logs, and traces, understanding what is happening not just at the cluster or host level, but also within the container runtime and application, helps organizations make better informed decisions, such as when to scale in/out nodes/tasks/pods, change Amazon EC2 instance types, and purchasing options (on-demand, reserved, and spot).
Treating logs as a continuous stream of events instead of as static files allows you to react to the continuous nature of log generation. You can capture, store, and analyze real-time log data to get meaningful insights into the application’s performance, network, and other characteristics. An application must not be required to manage its own log files.
You can specify the awslogs
log driver for containers in your Amazon ECS task
definition under the logConfiguration object to
ship the stdout
and stderr
I/O streams to a designated log group in
Amazon CloudWatch logs for viewing and archival. Additionally, FireLens for Amazon ECS enables you to use
task definition parameters with the awsfirelens
log driver to route logs to other
AWS services or third-party log aggregation tools for log storage and analytics. FireLens
works with Fluentd and Fluent Bit, fully compatible with Kubernetes. Using the
Fluent Bit daemonset, you can send container logs from your Amazon EKS clusters to CloudWatch logs.
Amazon CloudWatch is a monitoring service that you can use to collect various system, and application-wide metrics and logs, and set alarms. CloudWatch Container Insights helps you explore, aggregate, and summarize your container metrics, application logs and performance log events at the cluster, node, pod, task, and service level through automated dashboards in the CloudWatch console. Container Insights also provides diagnostic information, such as container restart failures, crashloop backoffs in an Amazon EKS cluster to help you isolate issues and resolve them quickly. Container Insights is available for Amazon Elastic Container Service (Amazon ECS, including Fargate), Amazon Elastic Kubernetes Service (Amazon EKS), and Kubernetes platforms on Amazon EC2.
AWS X-Ray
Amazon Managed Service for Prometheus
Amazon Managed Grafana
AWS Distro for OpenTelemetry