Performance at scale - AWS Serverless Multi-Tier Architectures with Amazon API Gateway and AWS Lambda

Performance at scale

Code pulled in as a container image from Amazon Elastic Container Registry (Amazon ECR), or from a zip file uploaded to Amazon S3, runs in an isolated environment managed by AWS. You do not have to scale your Lambda functions—each time an event notification is received by your function, AWS Lambda locates available capacity within its compute fleet and runs your code with runtime, memory, disk, and timeout configurations that you define. With this pattern, AWS can start as many copies of your function as needed.

A Lambda-based logic tier is always right sized for your customer needs. The ability to quickly absorb surges in traffic through managed scaling and concurrent code initiation, combined with Lambda pay-per-use pricing, enables you to always meet customer requests while simultaneously not paying for idle compute capacity.