package software.amazon.awscdk.services.applicationautoscaling

AWS Auto Scaling Construct Library

Application AutoScaling is used to configure autoscaling for all services other than scaling EC2 instances. For example, you will use this to scale ECS tasks, DynamoDB capacity, Spot Fleet sizes, Comprehend document classification endpoints, Lambda function provisioned concurrency and more.

As a CDK user, you will probably not have to interact with this library directly; instead, it will be used by other construct libraries to offer AutoScaling features for their own constructs.

This document will describe the general autoscaling features and concepts; your particular service may offer only a subset of these.

AutoScaling basics

Resources can offer one or more attributes to autoscale, typically representing some capacity dimension of the underlying service. For example, a DynamoDB Table offers autoscaling of the read and write capacity of the table proper and its Global Secondary Indexes, an ECS Service offers autoscaling of its task count, an RDS Aurora cluster offers scaling of its replica count, and so on.

When you enable autoscaling for an attribute, you specify a minimum and a maximum value for the capacity. AutoScaling policies that respond to metrics will never go higher or lower than the indicated capacity (but scheduled scaling actions might, see below).

There are three ways to scale your capacity:

In response to a metric (also known as step scaling); for example, you might want to scale out if the CPU usage across your cluster starts to rise, and scale in when it drops again.
By trying to keep a certain metric around a given value (also known as target tracking scaling); you might want to automatically scale out an in to keep your CPU usage around 50%.
On a schedule; you might want to organize your scaling around traffic flows you expect, by scaling out in the morning and scaling in in the evening.

The general pattern of autoscaling will look like this:

 SomeScalableResource resource;
 
 
 ScalableAttribute capacity = resource.autoScaleCapacity(new Caps()
         .minCapacity(5)
         .maxCapacity(100)
         );

Step Scaling

This type of scaling scales in and out in deterministic steps that you configure, in response to metric values. For example, your scaling strategy to scale in response to CPU usage might look like this:

  Scaling        -1          (no change)          +1       +3
             │        │                       │        │        │
             ├────────┼───────────────────────┼────────┼────────┤
             │        │                       │        │        │
 CPU usage   0%      10%                     50%       70%     100%

(Note that this is not necessarily a recommended scaling strategy, but it's a possible one. You will have to determine what thresholds are right for you).

You would configure it like this:

 ScalableAttribute capacity;
 Metric cpuUtilization;
 
 
 capacity.scaleOnMetric("ScaleToCPU", BasicStepScalingPolicyProps.builder()
         .metric(cpuUtilization)
         .scalingSteps(List.of(ScalingInterval.builder().upper(10).change(-1).build(), ScalingInterval.builder().lower(50).change(+1).build(), ScalingInterval.builder().lower(70).change(+3).build()))
 
         // Change this to AdjustmentType.PercentChangeInCapacity to interpret the
         // 'change' numbers before as percentages instead of capacity counts.
         .adjustmentType(AdjustmentType.CHANGE_IN_CAPACITY)
         .build());

The AutoScaling construct library will create the required CloudWatch alarms and AutoScaling policies for you.

Scaling based on multiple datapoints

The Step Scaling configuration above will initiate a scaling event when a single datapoint of the scaling metric is breaching a scaling step breakpoint. In cases where you might want to initiate scaling actions on a larger number of datapoints (ie in order to smooth out randomness in the metric data), you can use the optional evaluationPeriods and datapointsToAlarm properties:

 ScalableAttribute capacity;
 Metric cpuUtilization;
 
 
 capacity.scaleOnMetric("ScaleToCPUWithMultipleDatapoints", BasicStepScalingPolicyProps.builder()
         .metric(cpuUtilization)
         .scalingSteps(List.of(ScalingInterval.builder().upper(10).change(-1).build(), ScalingInterval.builder().lower(50).change(+1).build(), ScalingInterval.builder().lower(70).change(+3).build()))
 
         // if the cpuUtilization metric has a period of 1 minute, then data points
         // in the last 10 minutes will be evaluated
         .evaluationPeriods(10)
 
         // Only trigger a scaling action when 6 datapoints out of the last 10 are
         // breaching. If this is left unspecified, then ALL datapoints in the
         // evaluation period must be breaching to trigger a scaling action
         .datapointsToAlarm(6)
         .build());

Target Tracking Scaling

This type of scaling scales in and out in order to keep a metric (typically representing utilization) around a value you prefer. This type of scaling is typically heavily service-dependent in what metric you can use, and so different services will have different methods here to set up target tracking scaling.

The following example configures the read capacity of a DynamoDB table to be around 60% utilization:

 import software.amazon.awscdk.services.dynamodb.*;
 
 Table table;
 
 
 IScalableTableAttribute readCapacity = table.autoScaleReadCapacity(EnableScalingProps.builder()
         .minCapacity(10)
         .maxCapacity(1000)
         .build());
 readCapacity.scaleOnUtilization(UtilizationScalingProps.builder()
         .targetUtilizationPercent(60)
         .build());

Scheduled Scaling

This type of scaling is used to change capacities based on time. It works by changing the minCapacity and maxCapacity of the attribute, and so can be used for two purposes:

Scale in and out on a schedule by setting the minCapacity high or the maxCapacity low.
Still allow the regular scaling actions to do their job, but restrict the range they can scale over (by setting both minCapacity and maxCapacity but changing their range over time).

The following schedule expressions can be used:

at(yyyy-mm-ddThh:mm:ss) -- scale at a particular moment in time
rate(value unit) -- scale every minute/hour/day
cron(mm hh dd mm dow) -- scale on arbitrary schedules

Of these, the cron expression is the most useful but also the most complicated. A schedule is expressed as a cron expression. The Schedule class has a cron method to help build cron expressions.

The following example scales the fleet out in the morning, and lets natural scaling take over at night:

 import software.amazon.awscdk.TimeZone;
 SomeScalableResource resource;
 
 
 ScalableAttribute capacity = resource.autoScaleCapacity(new Caps()
         .minCapacity(1)
         .maxCapacity(50)
         );
 
 capacity.scaleOnSchedule("PrescaleInTheMorning", ScalingSchedule.builder()
         .schedule(Schedule.cron(CronOptions.builder().hour("8").minute("0").build()))
         .minCapacity(20)
         .timeZone(TimeZone.AMERICA_DENVER)
         .build());
 
 capacity.scaleOnSchedule("AllowDownscalingAtNight", ScalingSchedule.builder()
         .schedule(Schedule.cron(CronOptions.builder().hour("20").minute("0").build()))
         .minCapacity(1)
         .timeZone(TimeZone.AMERICA_DENVER)
         .build());

Examples

Lambda Provisioned Concurrency Auto Scaling

 import software.amazon.awscdk.services.lambda.*;
 
 Code code;
 
 
 Function handler = Function.Builder.create(this, "MyFunction")
         .runtime(Runtime.PYTHON_3_12)
         .handler("index.handler")
         .code(code)
 
         .reservedConcurrentExecutions(2)
         .build();
 
 Version fnVer = handler.getCurrentVersion();
 
 ScalableTarget target = ScalableTarget.Builder.create(this, "ScalableTarget")
         .serviceNamespace(ServiceNamespace.LAMBDA)
         .maxCapacity(100)
         .minCapacity(10)
         .resourceId(String.format("function:%s:%s", handler.getFunctionName(), fnVer.getVersion()))
         .scalableDimension("lambda:function:ProvisionedConcurrency")
         .build();
 
 target.scaleToTrackMetric("PceTracking", BasicTargetTrackingScalingPolicyProps.builder()
         .targetValue(0.9)
         .predefinedMetric(PredefinedMetric.LAMBDA_PROVISIONED_CONCURRENCY_UTILIZATION)
         .build());

ElastiCache Redis shards scaling with target value

 ScalableTarget shardsScalableTarget = ScalableTarget.Builder.create(this, "ElastiCacheRedisShardsScalableTarget")
         .serviceNamespace(ServiceNamespace.ELASTICACHE)
         .scalableDimension("elasticache:replication-group:NodeGroups")
         .minCapacity(2)
         .maxCapacity(10)
         .resourceId("replication-group/main-cluster")
         .build();
 
 shardsScalableTarget.scaleToTrackMetric("ElastiCacheRedisShardsCPUUtilization", BasicTargetTrackingScalingPolicyProps.builder()
         .targetValue(20)
         .predefinedMetric(PredefinedMetric.ELASTICACHE_PRIMARY_ENGINE_CPU_UTILIZATION)
         .build());

SageMaker variant provisioned concurrency utilization with target value

 ScalableTarget target = ScalableTarget.Builder.create(this, "SageMakerVariantScalableTarget")
         .serviceNamespace(ServiceNamespace.SAGEMAKER)
         .scalableDimension("sagemaker:variant:DesiredProvisionedConcurrency")
         .minCapacity(2)
         .maxCapacity(10)
         .resourceId("endpoint/MyEndpoint/variant/MyVariant")
         .build();
 
 target.scaleToTrackMetric("SageMakerVariantProvisionedConcurrencyUtilization", BasicTargetTrackingScalingPolicyProps.builder()
         .targetValue(0.9)
         .predefinedMetric(PredefinedMetric.SAGEMAKER_VARIANT_PROVISIONED_CONCURRENCY_UTILIZATION)
         .build());

Class

Description

AdjustmentTier

An adjustment.

AdjustmentTier.Builder

A builder for AdjustmentTier

AdjustmentTier.Jsii$Proxy

An implementation for AdjustmentTier

AdjustmentType

How adjustment numbers are interpreted.

BaseScalableAttribute

Represent an attribute for which autoscaling can be configured.

BaseScalableAttributeProps

Properties for a ScalableTableAttribute.

BaseScalableAttributeProps.Builder

A builder for BaseScalableAttributeProps

BaseScalableAttributeProps.Jsii$Proxy

An implementation for BaseScalableAttributeProps

BaseTargetTrackingProps

Base interface for target tracking props.

BaseTargetTrackingProps.Builder

A builder for BaseTargetTrackingProps

BaseTargetTrackingProps.Jsii$Proxy

An implementation for BaseTargetTrackingProps

BasicStepScalingPolicyProps

Example:

BasicStepScalingPolicyProps.Builder

A builder for BasicStepScalingPolicyProps

BasicStepScalingPolicyProps.Jsii$Proxy

An implementation for BasicStepScalingPolicyProps

BasicTargetTrackingScalingPolicyProps

Properties for a Target Tracking policy that include the metric but exclude the target.

BasicTargetTrackingScalingPolicyProps.Builder

A builder for BasicTargetTrackingScalingPolicyProps

BasicTargetTrackingScalingPolicyProps.Jsii$Proxy

An implementation for BasicTargetTrackingScalingPolicyProps

CfnScalableTarget

The AWS::ApplicationAutoScaling::ScalableTarget resource specifies a resource that Application Auto Scaling can scale, such as an AWS::DynamoDB::Table or AWS::ECS::Service resource.

CfnScalableTarget.Builder

A fluent builder for CfnScalableTarget.

CfnScalableTarget.ScalableTargetActionProperty

ScalableTargetAction specifies the minimum and maximum capacity for the ScalableTargetAction property of the AWS::ApplicationAutoScaling::ScalableTarget ScheduledAction property type.

CfnScalableTarget.ScalableTargetActionProperty.Builder

A builder for CfnScalableTarget.ScalableTargetActionProperty

CfnScalableTarget.ScalableTargetActionProperty.Jsii$Proxy

An implementation for CfnScalableTarget.ScalableTargetActionProperty

CfnScalableTarget.ScheduledActionProperty

ScheduledAction is a property of the AWS::ApplicationAutoScaling::ScalableTarget resource that specifies a scheduled action for a scalable target.

CfnScalableTarget.ScheduledActionProperty.Builder

A builder for CfnScalableTarget.ScheduledActionProperty

CfnScalableTarget.ScheduledActionProperty.Jsii$Proxy

An implementation for CfnScalableTarget.ScheduledActionProperty

CfnScalableTarget.SuspendedStateProperty

SuspendedState is a property of the AWS::ApplicationAutoScaling::ScalableTarget resource that specifies whether the scaling activities for a scalable target are in a suspended state.

CfnScalableTarget.SuspendedStateProperty.Builder

A builder for CfnScalableTarget.SuspendedStateProperty

CfnScalableTarget.SuspendedStateProperty.Jsii$Proxy

An implementation for CfnScalableTarget.SuspendedStateProperty

CfnScalableTargetProps

Properties for defining a CfnScalableTarget.

CfnScalableTargetProps.Builder

A builder for CfnScalableTargetProps

CfnScalableTargetProps.Jsii$Proxy

An implementation for CfnScalableTargetProps

CfnScalingPolicy

The AWS::ApplicationAutoScaling::ScalingPolicy resource defines a scaling policy that Application Auto Scaling uses to adjust the capacity of a scalable target.

CfnScalingPolicy.Builder

A fluent builder for CfnScalingPolicy.

CfnScalingPolicy.CustomizedMetricSpecificationProperty

Contains customized metric specification information for a target tracking scaling policy for Application Auto Scaling.

CfnScalingPolicy.CustomizedMetricSpecificationProperty.Builder

A builder for CfnScalingPolicy.CustomizedMetricSpecificationProperty

CfnScalingPolicy.CustomizedMetricSpecificationProperty.Jsii$Proxy

An implementation for CfnScalingPolicy.CustomizedMetricSpecificationProperty

CfnScalingPolicy.MetricDimensionProperty

MetricDimension specifies a name/value pair that is part of the identity of a CloudWatch metric for the Dimensions property of the AWS::ApplicationAutoScaling::ScalingPolicy CustomizedMetricSpecification property type.

CfnScalingPolicy.MetricDimensionProperty.Builder

A builder for CfnScalingPolicy.MetricDimensionProperty

CfnScalingPolicy.MetricDimensionProperty.Jsii$Proxy

An implementation for CfnScalingPolicy.MetricDimensionProperty

CfnScalingPolicy.PredefinedMetricSpecificationProperty

Contains predefined metric specification information for a target tracking scaling policy for Application Auto Scaling.

CfnScalingPolicy.PredefinedMetricSpecificationProperty.Builder

A builder for CfnScalingPolicy.PredefinedMetricSpecificationProperty

CfnScalingPolicy.PredefinedMetricSpecificationProperty.Jsii$Proxy

An implementation for CfnScalingPolicy.PredefinedMetricSpecificationProperty

CfnScalingPolicy.StepAdjustmentProperty

StepAdjustment specifies a step adjustment for the StepAdjustments property of the AWS::ApplicationAutoScaling::ScalingPolicy StepScalingPolicyConfiguration property type.

CfnScalingPolicy.StepAdjustmentProperty.Builder

A builder for CfnScalingPolicy.StepAdjustmentProperty

CfnScalingPolicy.StepAdjustmentProperty.Jsii$Proxy

An implementation for CfnScalingPolicy.StepAdjustmentProperty

CfnScalingPolicy.StepScalingPolicyConfigurationProperty

StepScalingPolicyConfiguration is a property of the AWS::ApplicationAutoScaling::ScalingPolicy resource that specifies a step scaling policy configuration for Application Auto Scaling.

CfnScalingPolicy.StepScalingPolicyConfigurationProperty.Builder

A builder for CfnScalingPolicy.StepScalingPolicyConfigurationProperty

CfnScalingPolicy.StepScalingPolicyConfigurationProperty.Jsii$Proxy

An implementation for CfnScalingPolicy.StepScalingPolicyConfigurationProperty

CfnScalingPolicy.TargetTrackingMetricDataQueryProperty

The metric data to return.

CfnScalingPolicy.TargetTrackingMetricDataQueryProperty.Builder

A builder for CfnScalingPolicy.TargetTrackingMetricDataQueryProperty

CfnScalingPolicy.TargetTrackingMetricDataQueryProperty.Jsii$Proxy

An implementation for CfnScalingPolicy.TargetTrackingMetricDataQueryProperty

CfnScalingPolicy.TargetTrackingMetricDimensionProperty

TargetTrackingMetricDimension specifies a name/value pair that is part of the identity of a CloudWatch metric for the Dimensions property of the AWS::ApplicationAutoScaling::ScalingPolicy TargetTrackingMetric property type.

CfnScalingPolicy.TargetTrackingMetricDimensionProperty.Builder

A builder for CfnScalingPolicy.TargetTrackingMetricDimensionProperty

CfnScalingPolicy.TargetTrackingMetricDimensionProperty.Jsii$Proxy

An implementation for CfnScalingPolicy.TargetTrackingMetricDimensionProperty

CfnScalingPolicy.TargetTrackingMetricProperty

Represents a specific metric for a target tracking scaling policy for Application Auto Scaling.

CfnScalingPolicy.TargetTrackingMetricProperty.Builder

A builder for CfnScalingPolicy.TargetTrackingMetricProperty

CfnScalingPolicy.TargetTrackingMetricProperty.Jsii$Proxy

An implementation for CfnScalingPolicy.TargetTrackingMetricProperty

CfnScalingPolicy.TargetTrackingMetricStatProperty

This structure defines the CloudWatch metric to return, along with the statistic and unit.

CfnScalingPolicy.TargetTrackingMetricStatProperty.Builder

A builder for CfnScalingPolicy.TargetTrackingMetricStatProperty

CfnScalingPolicy.TargetTrackingMetricStatProperty.Jsii$Proxy

An implementation for CfnScalingPolicy.TargetTrackingMetricStatProperty

CfnScalingPolicy.TargetTrackingScalingPolicyConfigurationProperty

TargetTrackingScalingPolicyConfiguration is a property of the AWS::ApplicationAutoScaling::ScalingPolicy resource that specifies a target tracking scaling policy configuration for Application Auto Scaling.

CfnScalingPolicy.TargetTrackingScalingPolicyConfigurationProperty.Builder

A builder for CfnScalingPolicy.TargetTrackingScalingPolicyConfigurationProperty

CfnScalingPolicy.TargetTrackingScalingPolicyConfigurationProperty.Jsii$Proxy

An implementation for CfnScalingPolicy.TargetTrackingScalingPolicyConfigurationProperty

CfnScalingPolicyProps

Properties for defining a CfnScalingPolicy.

CfnScalingPolicyProps.Builder

A builder for CfnScalingPolicyProps

CfnScalingPolicyProps.Jsii$Proxy

An implementation for CfnScalingPolicyProps

CronOptions

Options to configure a cron expression.

CronOptions.Builder

A builder for CronOptions

CronOptions.Jsii$Proxy

An implementation for CronOptions

EnableScalingProps

Properties for enabling Application Auto Scaling.

EnableScalingProps.Builder

A builder for EnableScalingProps

EnableScalingProps.Jsii$Proxy

An implementation for EnableScalingProps

IScalableTarget

IScalableTarget.Jsii$Default

Internal default implementation for IScalableTarget.

IScalableTarget.Jsii$Proxy

A proxy class which represents a concrete javascript instance of this type.

MetricAggregationType

How the scaling metric is going to be aggregated.

PredefinedMetric

One of the predefined autoscaling metrics.

ScalableTarget

Define a scalable target.

ScalableTarget.Builder

A fluent builder for ScalableTarget.

ScalableTargetProps

Properties for a scalable target.

ScalableTargetProps.Builder

A builder for ScalableTargetProps

ScalableTargetProps.Jsii$Proxy

An implementation for ScalableTargetProps

ScalingInterval

A range of metric values in which to apply a certain scaling operation.

ScalingInterval.Builder

A builder for ScalingInterval

ScalingInterval.Jsii$Proxy

An implementation for ScalingInterval

ScalingSchedule

A scheduled scaling action.

ScalingSchedule.Builder

A builder for ScalingSchedule

ScalingSchedule.Jsii$Proxy

An implementation for ScalingSchedule

Schedule

Schedule for scheduled scaling actions.

ServiceNamespace

The service that supports Application AutoScaling.

StepScalingAction

Define a step scaling action.

StepScalingAction.Builder

A fluent builder for StepScalingAction.

StepScalingActionProps

Properties for a scaling policy.

StepScalingActionProps.Builder

A builder for StepScalingActionProps

StepScalingActionProps.Jsii$Proxy

An implementation for StepScalingActionProps

StepScalingPolicy

Define a scaling strategy which scales depending on absolute values of some metric.

StepScalingPolicy.Builder

A fluent builder for StepScalingPolicy.

StepScalingPolicyProps

Example:

StepScalingPolicyProps.Builder

A builder for StepScalingPolicyProps

StepScalingPolicyProps.Jsii$Proxy

An implementation for StepScalingPolicyProps

TargetTrackingScalingPolicy

Example:

TargetTrackingScalingPolicy.Builder

A fluent builder for TargetTrackingScalingPolicy.

TargetTrackingScalingPolicyProps

Properties for a concrete TargetTrackingPolicy.

TargetTrackingScalingPolicyProps.Builder

A builder for TargetTrackingScalingPolicyProps

TargetTrackingScalingPolicyProps.Jsii$Proxy

An implementation for TargetTrackingScalingPolicyProps

Package software.amazon.awscdk.services.applicationautoscaling