Designing for high availability and resiliency in Amazon EKS applications - AWS Prescriptive Guidance

Designing for high availability and resiliency in Amazon EKS applications

Haofei Feng and Rus Kalakutskiy, Amazon Web Services (AWS)

January 2025 (document history)

Ensuring high availability (HA) and resiliency in application design is crucial for achieving near-zero recovery point objective (RPO) and recovery time objective (RTO). As organizations increasingly migrate and modernize their applications to Kubernetes environments, the demand for robust and scalable solutions continues to increase. Amazon Elastic Kubernetes Service (Amazon EKS) helps you to efficiently manage containerized applications at scale.

This guide delves into a set of widely recognized recommendations and best practices for designing and managing Amazon EKS microservice applications. Based on extensive experience and real-world deployments, these insights offer valuable guidance for architects and developers. Implement these recommendations for high performance, reliability, and scalability of your Kubernetes-based applications to achieve robust operations.