Lambda quotas - AWS Lambda

Lambda quotas

Important

New AWS accounts have reduced concurrency and memory quotas. AWS raises these quotas automatically based on your usage.

Compute and storage

Lambda sets quotas for the amount of compute and storage resources that you can use to run and store functions. Quotas for concurrent executions and storage apply per AWS Region. Elastic network interface (ENI) quotas apply per virtual private cloud (VPC), regardless of Region. The following quotas can be increased from their default values. For more information, see Requesting a quota increase in the Service Quotas User Guide.

Resource Default quota Can be increased up to

Concurrent executions

1,000

Tens of thousands

Storage for uploaded functions (.zip file archives) and layers. Each function version and layer version consumes storage.

For best practices on managing your code storage, see Monitoring Lambda code storage in Serverless Land.

75 GB

Terabytes

Storage for functions defined as container images. These images are stored in Amazon ECR.

See Amazon ECR service quotas.

Elastic network interfaces per virtual private cloud (VPC)

Note

This quota is shared with other services, such as Amazon Elastic File System (Amazon EFS). See Amazon VPC quotas.

250

Thousands

For details on concurrency and how Lambda scales your function concurrency in response to traffic, see Lambda function scaling.

Function configuration, deployment, and execution

The following quotas apply to function configuration, deployment, and execution. Except as noted, they can't be changed.

Note

The Lambda documentation, log messages, and console use the abbreviation MB (rather than MiB) to refer to 1,024 KB.

Resource Quota

Function memory allocation

128 MB to 10,240 MB, in 1-MB increments.

Note: Lambda allocates CPU power in proportion to the amount of memory configured. You can increase or decrease the memory and CPU power allocated to your function using the Memory (MB) setting. At 1,769 MB, a function has the equivalent of one vCPU.

Function timeout

900 seconds (15 minutes)

Function environment variables

4 KB, for all environment variables associated with the function, in aggregate

Function resource-based policy

20 KB

Function layers

five layers

Function concurrency scaling limit

For each function, 1,000 execution environments every 10 seconds

Invocation payload (request and response)

6 MB each for request and response (synchronous)

20 MB for each streamed response (Synchronous. The payload size for streamed responses can be increased from default values. Contact AWS Support to inquire further.)

256 KB (asynchronous)

1 MB for the total combined size of request line and header values

Bandwidth for streamed responses

Uncapped for the first 6 MB of your function's response

For responses larger than 6 MB, 2MBps for the remainder of the response

Deployment package (.zip file archive) size

50 MB (zipped, for direct upload)

250 MB (unzipped)

This quota applies to all the files you upload, including layers and custom runtimes.

3 MB (console editor)

Container image settings size

16 KB

Container image code package size

10 GB (maximum uncompressed image size, including all layers)

Test events (console editor)

10

/tmp directory storage

Between 512 MB and 10,240 MB, in 1-MB increments

File descriptors

1,024

Execution processes/threads

1,024

Lambda API requests

The following quotas are associated with Lambda API requests.

Resource Quota

Invocation requests per function per Region (synchronous)

Each instance of your execution environment can serve up to 10 requests per second. In other words, the total invocation limit is 10 times your concurrency limit. See Lambda function scaling.

Invocation requests per function per Region (asynchronous)

Each instance of your execution environment can serve an unlimited number of requests. In other words, the total invocation limit is based only on concurrency available to your function. See Lambda function scaling.

Invocation requests per function version or alias (requests per second)

10 x allocated provisioned concurrency

Note

This quota applies only to functions that use provisioned concurrency.

GetFunction API requests

100 requests per second. Cannot be increased.

GetPolicy API requests

15 requests per second. Cannot be increased.

Remainder of the control plane API requests (excludes invocation, GetFunction, and GetPolicy requests)

15 requests per second across all APIs (not 15 requests per second per API). Cannot be increased.

Other services

Quotas for other services, such as AWS Identity and Access Management (IAM), Amazon CloudFront (Lambda@Edge), and Amazon Virtual Private Cloud (Amazon VPC), can impact your Lambda functions. For more information, see AWS service quotas in the Amazon Web Services General Reference, and Using AWS Lambda with other services.