Scale to zero for Amazon OpenSearch Serverless
Note
Scale to zero is only available for NextGen collections that are part of a collection group. For more information, see Amazon OpenSearch Serverless collection groups.
Scale to zero automatically shuts down compute resources when all collections in a collection group are idle. This eliminates charges for unused capacity. When no collection in the group has incoming requests for 10 minutes, search and indexing workers scale down to zero OCU and billing stops. When traffic resumes to any collection in the group, workers are automatically provisioned and autoscaling resumes based on your request pattern.
This is ideal for development environments, batch processing workloads, and applications with predictable idle periods.
Scale to zero behavior
The following describes how scale to zero works for your collections:
-
NextGen collection groups default to a minimum OCU of 0 for both indexing and search unless otherwise specified.
-
After 10 minutes of no incoming requests across all collections in the group, compute resources scale to zero OCU. This idle period is not configurable.
-
Search and indexing scale to zero and wake independently. Each component remains at zero until it receives its own traffic.
-
When traffic resumes, OpenSearch Serverless provisions workers at the same tier as before scale-to-zero:
-
Search requests — two search workers
-
Indexing requests — one indexing worker
-
-
Expect 10–30 seconds of latency on the first request to each component while capacity is restored.
Enabling scale to zero
To enable scale to zero, create a collection group with a minimum OCU of 0 for both indexing and search, then create a collection within that group.
Enabling scale to zero
-
Create a collection group with zero minimum OCU:
aws opensearchserverless create-collection-group \ --namecollection-group-name\ --standby-replicas ENABLED \ --generation NEXTGEN \ --capacity-limits '{ "maxIndexingCapacityInOCU": 8, "maxSearchCapacityInOCU": 8, "minIndexingCapacityInOCU": 0, "minSearchCapacityInOCU": 0 }' -
Create a collection in the group:
aws opensearchserverless create-collection \ --namecollection-name\ --typecollection-type\ --collection-group-namecollection-group-name\ --standby-replicas ENABLED
Opting out of scale to zero
If you don't want your collection capacities to scale to zero, make sure they are part of a collection group with minimum capacity set to a non-zero value.