Amazon SageMaker endpoints and quotas - AWS General Reference

Amazon SageMaker endpoints and quotas

The following are the service endpoints and service quotas for this service. To connect programmatically to an AWS service, you use an endpoint. In addition to the standard AWS endpoints, some AWS services offer FIPS endpoints in selected Regions. For more information, see AWS service endpoints. Service quotas, also referred to as limits, are the maximum number of service resources or operations for your AWS account. For more information, see AWS service quotas.

Service Endpoints

The following table provides a list of Region-specific endpoints that SageMaker supports for training and deploying models. This include creating and managing notebook instances, training jobs, model, endpoint configurations, and endpoints.

Region Name Region Endpoint Protocol
US East (Ohio) us-east-2

api.sagemaker.us-east-2.amazonaws.com

api-fips.sagemaker.us-east-2.amazonaws.com

HTTPS

HTTPS

US East (N. Virginia) us-east-1

api.sagemaker.us-east-1.amazonaws.com

api-fips.sagemaker.us-east-1.amazonaws.com

HTTPS

HTTPS

US West (N. California) us-west-1

api.sagemaker.us-west-1.amazonaws.com

api-fips.sagemaker.us-west-1.amazonaws.com

HTTPS

HTTPS

US West (Oregon) us-west-2

api.sagemaker.us-west-2.amazonaws.com

api-fips.sagemaker.us-west-2.amazonaws.com

HTTPS

HTTPS

Africa (Cape Town) af-south-1 api.sagemaker.af-south-1.amazonaws.com HTTPS
Asia Pacific (Hong Kong) ap-east-1 api.sagemaker.ap-east-1.amazonaws.com HTTPS
Asia Pacific (Mumbai) ap-south-1 api.sagemaker.ap-south-1.amazonaws.com HTTPS
Asia Pacific (Osaka) ap-northeast-3 api.sagemaker.ap-northeast-3.amazonaws.com HTTPS
Asia Pacific (Seoul) ap-northeast-2 api.sagemaker.ap-northeast-2.amazonaws.com HTTPS
Asia Pacific (Singapore) ap-southeast-1 api.sagemaker.ap-southeast-1.amazonaws.com HTTPS
Asia Pacific (Sydney) ap-southeast-2 api.sagemaker.ap-southeast-2.amazonaws.com HTTPS
Asia Pacific (Tokyo) ap-northeast-1 api.sagemaker.ap-northeast-1.amazonaws.com HTTPS
Canada (Central) ca-central-1 api.sagemaker.ca-central-1.amazonaws.com HTTPS
Europe (Frankfurt) eu-central-1 api.sagemaker.eu-central-1.amazonaws.com HTTPS
Europe (Ireland) eu-west-1 api.sagemaker.eu-west-1.amazonaws.com HTTPS
Europe (London) eu-west-2 api.sagemaker.eu-west-2.amazonaws.com HTTPS
Europe (Milan) eu-south-1 api.sagemaker.eu-south-1.amazonaws.com HTTPS
Europe (Paris) eu-west-3 api.sagemaker.eu-west-3.amazonaws.com HTTPS
Europe (Stockholm) eu-north-1 api.sagemaker.eu-north-1.amazonaws.com HTTPS
Middle East (Bahrain) me-south-1 api.sagemaker.me-south-1.amazonaws.com HTTPS
South America (São Paulo) sa-east-1 api.sagemaker.sa-east-1.amazonaws.com HTTPS
AWS GovCloud (US-West) us-gov-west-1

api.sagemaker.us-gov-west-1.amazonaws.com

api-fips.sagemaker.us-gov-west-1.amazonaws.com

HTTPS

HTTPS

The following table provides a list of Region-specific endpoints that Amazon SageMaker supports for making inference requests against models hosted in SageMaker.

Region Name Region Endpoint Protocol
US East (Ohio) us-east-2

runtime.sagemaker.us-east-2.amazonaws.com

runtime-fips.sagemaker.us-east-2.amazonaws.com

HTTPS

HTTPS

US East (N. Virginia) us-east-1

runtime.sagemaker.us-east-1.amazonaws.com

runtime-fips.sagemaker.us-east-1.amazonaws.com

HTTPS

HTTPS

US West (N. California) us-west-1

runtime.sagemaker.us-west-1.amazonaws.com

runtime-fips.sagemaker.us-west-1.amazonaws.com

HTTPS

HTTPS

US West (Oregon) us-west-2

runtime.sagemaker.us-west-2.amazonaws.com

runtime-fips.sagemaker.us-west-2.amazonaws.com

HTTPS

HTTPS

Africa (Cape Town) af-south-1 runtime.sagemaker.af-south-1.amazonaws.com HTTPS
Asia Pacific (Hong Kong) ap-east-1 runtime.sagemaker.ap-east-1.amazonaws.com HTTPS
Asia Pacific (Mumbai) ap-south-1 runtime.sagemaker.ap-south-1.amazonaws.com HTTPS
Asia Pacific (Osaka) ap-northeast-3 runtime.sagemaker.ap-northeast-3.amazonaws.com HTTPS
Asia Pacific (Seoul) ap-northeast-2 runtime.sagemaker.ap-northeast-2.amazonaws.com HTTPS
Asia Pacific (Singapore) ap-southeast-1 runtime.sagemaker.ap-southeast-1.amazonaws.com HTTPS
Asia Pacific (Sydney) ap-southeast-2 runtime.sagemaker.ap-southeast-2.amazonaws.com HTTPS
Asia Pacific (Tokyo) ap-northeast-1 runtime.sagemaker.ap-northeast-1.amazonaws.com HTTPS
Canada (Central) ca-central-1 runtime.sagemaker.ca-central-1.amazonaws.com HTTPS
Europe (Frankfurt) eu-central-1 runtime.sagemaker.eu-central-1.amazonaws.com HTTPS
Europe (Ireland) eu-west-1 runtime.sagemaker.eu-west-1.amazonaws.com HTTPS
Europe (London) eu-west-2 runtime.sagemaker.eu-west-2.amazonaws.com HTTPS
Europe (Milan) eu-south-1 runtime.sagemaker.eu-south-1.amazonaws.com HTTPS
Europe (Paris) eu-west-3 runtime.sagemaker.eu-west-3.amazonaws.com HTTPS
Europe (Stockholm) eu-north-1 runtime.sagemaker.eu-north-1.amazonaws.com HTTPS
Middle East (Bahrain) me-south-1 runtime.sagemaker.me-south-1.amazonaws.com HTTPS
South America (São Paulo) sa-east-1 runtime.sagemaker.sa-east-1.amazonaws.com HTTPS
AWS GovCloud (US-West) us-gov-west-1

runtime.sagemaker.us-gov-west-1.amazonaws.com

runtime.sagemaker.us-gov-west-1.amazonaws.com

HTTPS

HTTPS

Service Quotas

Depending on your activities and resource usage over time, your SageMaker quotas might be different from the default SageMaker quotas listed in the following tables. The default quotas in this page are based on new accounts. If you encounter error messages that you've exceeded your quota, use AWS Support to request a service limit increase for SageMaker resources you want to scale up. For instructions on how to request a service limit increase, see Supported Regions and Quotas in the Amazon SageMaker Developer Guide.

SageMaker Studio
Resource Default
Total Studio Domains per AWS account 1
KernelGateway-ml.c5.large 0
KernelGateway-ml.c5.xlarge 0
KernelGateway-ml.c5.2xlarge 0
KernelGateway-ml.c5.4xlarge 0
KernelGateway-ml.c5.9xlarge 0
KernelGateway-ml.c5.12xlarge 0
KernelGateway-ml.c5.18xlarge 0
KernelGateway-ml.c5.24xlarge 0
KernelGateway-ml.g4dn.xlarge 0
KernelGateway-ml.g4dn.2xlarge 0
KernelGateway-ml.g4dn.4xlarge 0
KernelGateway-ml.g4dn.8xlarge 0
KernelGateway-ml.g4dn.12xlarge 0
KernelGateway-ml.g4dn.16xlarge 0
KernelGateway-ml.m5.large 0
KernelGateway-ml.m5.xlarge 0
KernelGateway-ml.m5.2xlarge 0
KernelGateway-ml.m5.4xlarge 1
KernelGateway-ml.m5.8xlarge 0
KernelGateway-ml.m5.12xlarge 0
KernelGateway-ml.m5.16xlarge 0
KernelGateway-ml.m5.24xlarge 0
KernelGateway-ml.p3.2xlarge 0

KernelGateway-ml.p3.8xlarge

0
KernelGateway-ml.p3.16xlarge 0

KernelGateway-ml.t3.medium

2

KernelGateway-ml.t3.large

0

KernelGateway-ml.t3.xlarge

0

KernelGateway-ml.t3.2xlarge

0

Maximum number of UserProfiles per Domain

2

Maximum number of Running Apps per Domain

20

Maximum number of custom images per Domain

30

Maximum number of custom images per UserProfile

5
SageMaker Images
Resource Default
Number of SageMaker Images 250
Number of image versions per SageMaker image 1,000
SageMaker Notebooks
Resource Default
ml.t2.medium instances 2
ml.t2.large instances 0
ml.t2.xlarge instances 0
ml.t2.2xlarge instances 0
ml.t3.medium instances 2
ml.t3.large instances 0
ml.t3.xlarge instances 0
ml.t3.2xlarge instances 0
ml.m4.xlarge instances 0
ml.m4.2xlarge instances 0
ml.m4.4xlarge instances 0
ml.m4.10xlarge instances 0
ml.m4.16xlarge instances 0
ml.m5.xlarge instances 0
ml.m5.2xlarge instances 0
ml.m5.4xlarge instances 0
ml.m5.12xlarge instances 0
ml.m5.24xlarge instances 0
ml.c4.xlarge instances 0
ml.c4.2xlarge instances 0
ml.c4.4xlarge instances 0
ml.c4.8xlarge instances 0
ml.c5.xlarge instances 0
ml.c5.2xlarge instances 0
ml.c5.4xlarge instances 0
ml.c5.9xlarge instances 0
ml.c5.18xlarge instances 0
ml.c5d.xlarge instances 0
ml.c5d.2xlarge instances 0
ml.c5d.4xlarge instances 0
ml.c5d.9xlarge instances 0
ml.c5d.18xlarge instances 0
ml.p2.xlarge instances 0
ml.p2.8xlarge instances 0
ml.p2.16xlarge instances 0
ml.p3.2xlarge instances 0
ml.p3.8xlarge instances 0
ml.p3.16xlarge instances 0
ml.g4dn.xlarge instances 2
ml.g4dn.2xlarge instances 2
ml.g4dn.4xlarge instances 2
ml.g4dn.8xlarge instances 2
ml.g4dn.12xlarge instances 2
ml.g4dn.16xlarge instances 2
ml.eia1.medium instances 0
ml.eia1.large instances 0
ml.eia1.xlarge instances 0
ml.eia2.medium instances 0
ml.eia2.large instances 0
ml.eia2.xlarge instances 0
Number of accelerators 0
Number of notebook instances 4
EBS volume size in GB for an instance 102400
SageMaker Ground Truth
Resource Default
Total labeling jobs 1
Total streaming labeling jobs 0
Max dataset objects per labeling job 10,000
Number of workteams 25
SageMaker Projects
Resource Default
Number of projects 500
SageMaker Pipelines
Resource Default
Number of pipelines 5,000
SageMaker Pipeline Executions
Resource Default
Maximum execution time 28 days
Concurrent pipeline executions per account 200
Concurrent pipeline executions per pipeline 200
Parameters
Resource Default
Parameters per pipeline 50
Parameter name length 64 characters
Parameters name regular expression pattern ([A-Za-z0-9\\-_])*
Parameter description length 4,096 characters
Parameter enum values 16 distinct values
SageMaker Condition Steps
Resource Default
Conditions per ConditionStep 200
Steps in If-List 20
Steps in Else-List 20
Conditions in Or-List 200
Property Files
Resource Default
PropertyFiles in a pipeline 10
JsonGet functions in a pipeline 200
Size of the property file 2 MB
SageMaker Metadata
Resource Default
Maximum number of metadata 20 key-value pairs
Metadata key size 128 characters
Metadata key regular expression pattern ([A-Za-z0-9\\-_])*
Maximum metadata value size 1024 characters
Metadata value regular expression pattern [\\p{M}\\p{L}\\p{S}\\p{S}\\p{N}\\p{P}\\s
SageMaker Feature Store
Resource Default
Number of feature groups 10
Concurrent feature group creation workflows 4
SageMaker Processing
Resource Default
ml.c4.xlarge 4
ml.c4.2xlarge 4
ml.c4.4xlarge 4
ml.c4.8xlarge 4
ml.c5.xlarge 4
ml.c5.2xlarge 4
ml.c5.4xlarge 1
ml.c5.9xlarge 1
ml.c5.18xlarge 1
ml.g4dn.xlarge 0
ml.g4dn.2xlarge 0
ml.g4dn.4xlarge 0
ml.g4dn.8xlarge 0
ml.g4dn.12xlarge 0
ml.g4dn.16xlarge 0
ml.m4.xlarge 4
ml.m4.2xlarge 4
ml.m4.4xlarge 2
ml.m4.10xlarge 1
ml.m4.16xlarge 1
ml.m5.large 4
ml.m5.xlarge 4
ml.m5.2xlarge 4
ml.m5.4xlarge 2
ml.m5.12xlarge 0
ml.m5.24xlarge 0
ml.p2.xlarge 0
ml.p2.8xlarge 0
ml.p2.16xlarge 0
ml.p3.2xlarge 0
ml.p3.8xlarge 0
ml.p3.16xlarge 0
ml.r5.large 4
ml.r5.xlarge 4
ml.r5.2xlarge 4
ml.r5.4xlarge 1
ml.r5.8xlarge 1
ml.r5.12xlarge 1
ml.r5.16xlarge 1
ml.r5.24xlarge 0
ml.t3.medium 4
ml.t3.large 4
ml.t3.xlarge 2
ml.t3.2xlarge 0
Longest run time for a processing job 5 days
Number of instances across processing jobs 4
Number of instances per processing job 20
Size of EBS volume for an instance 1 TB
Note

In case of SageMaker training, on-demand and spot instance quotas are tracked and modified separately. For example, with the default quotas, you can run up to 20 training jobs with ml.m4.xlarge on-demand instances and up to 20 training jobs with ml.m4.xlarge spot instances simultaneously.

SageMaker Training
Resource Default
ml.c4.xlarge instances 4
ml.c4.2xlarge instances 4
ml.c4.4xlarge instances 4
ml.c4.8xlarge instances 4
ml.c5.xlarge instances 4
ml.c5.2xlarge instances 4
ml.c5.4xlarge instances 1
ml.c5.9xlarge instances 1
ml.c5.18xlarge instances 0
ml.c5n.xlarge instances 0
ml.c5n.2xlarge instances 0
ml.c5n.4xlarge instances 0
ml.c5n.9xlarge instances 0
ml.c5n.18xlarge instances 0
ml.g4dn.xlarge instances 0
ml.g4dn.2xlarge instances 0
ml.g4dn.4xlarge instances 0
ml.g4dn.8xlarge instances 0
ml.g4dn.12xlarge instances 0
ml.g4dn.16xlarge instances 0
ml.m4.xlarge instances 4
ml.m4.2xlarge instances 4
ml.m4.4xlarge instances 2
ml.m4.10xlarge instances 0
ml.m4.16xlarge instances 0
ml.m5.large instances 4
ml.m5.xlarge instances 4
ml.m5.2xlarge instances 4
ml.m5.4xlarge instances 20
ml.m5.12xlarge instances 0
ml.m5.24xlarge instances 0
ml.p2.xlarge instances 0
ml.p2.8xlarge instances 0
ml.p2.16xlarge instances 0
ml.p3.2xlarge instances 0
ml.p3.8xlarge instances 0
ml.p3.16xlarge instances 0
ml.p3dn.24xlarge instances 0
ml.p4d.24xlarge instances 0
The longest run time for a training job 5 days
Number of instances across training jobs 4
Number of instances per training job 20
Size of EBS volume for an instance 1 TB
SageMaker Managed Spot Training
Resource Default
ml.c4.xlarge instances 4
ml.c4.2xlarge instances 4
ml.c4.4xlarge instances 4
ml.c4.8xlarge instances 4
ml.c5.xlarge instances 4
ml.c5.2xlarge instances 4
ml.c5.4xlarge instances 1
ml.c5.9xlarge instances 1
ml.c5.18xlarge instances 0
ml.c5n.xlarge instances 0
ml.c5n.2xlarge instances 0
ml.c5n.4xlarge instances 0
ml.c5n.9xlarge instances 0
ml.c5n.18xlarge instances 0
ml.g4dn.xlarge instances 0
ml.g4dn.2xlarge instances 0
ml.g4dn.4xlarge instances 0
ml.g4dn.8xlarge instances 0
ml.g4dn.12xlarge instances 0
ml.g4dn.16xlarge instances 0
ml.m4.xlarge instances 4
ml.m4.2xlarge instances 4
ml.m4.4xlarge instances 2
ml.m4.10xlarge instances 0
ml.m4.16xlarge instances 0
ml.m5.large instances 4
ml.m5.xlarge instances 4
ml.m5.2xlarge instances 4
ml.m5.4xlarge instances 2
ml.m5.12xlarge instances 0
ml.m5.24xlarge instances 0
ml.p2.xlarge instances 0
ml.p2.8xlarge instances 0
ml.p2.16xlarge instances 0
ml.p3.2xlarge instances 0
ml.p3.8xlarge instances 0
ml.p3.16xlarge instances 0
ml.p3dn.24xlarge instances 0
ml.p4d.24xlarge instances 0
Number of instances across training jobs 4
Number of instances per training job 20
SageMaker Autopilot
Resource Default
Maximum dataset size in GB 5
Maximum number of parallel Autopilot Jobs 1
SageMaker Automatic Model Hyperparameter Tuning
Resource Default
Number of concurrent hyperparameter tuning jobs 100
Number of parallel training jobs per hyperparameter tuning job 10
Number of training jobs per hyperparameter tuning job 500
SageMaker Experiments (Lineage Tracking / Experiment Tracking)
Resource Default
Experiments 5,000
Trial components 20,000

Trial components in a single trial

50
Trials in a single experiment 300
Trials a single trial component can be associated with 500
Number of actions 3,000
Number of artifacts 6,000
Number of associations 6,000
Number of contexts 500
Note

Use AWS Support to request a service limit increase in order to use an instance with a default quota of 0.

SageMaker Hosting
Resource Default
ml.c4.large instances 0
ml.c4.xlarge instances 0
ml.c4.2xlarge instances 0
ml.c4.4xlarge instances 0
ml.c4.8xlarge instances 0
ml.c5.large instances 0
ml.c5.xlarge instances 0
ml.c5.2xlarge instances 0
ml.c5.4xlarge instances 0
ml.c5.9xlarge instances 0
ml.c5.12xlarge instances 0
ml.c5.18xlarge instances 0
ml.c5.24xlarge instances 0
ml.c5d.large instances 0
ml.c5d.xlarge instances 0
ml.c5d.2xlarge instances 0
ml.c5d.4xlarge instances 0
ml.c5d.9xlarge instances 0
ml.c5d.18xlarge instances 0
ml.c5n.large instances 0
ml.c5n.xlarge instances 0
ml.c5n.2xlarge instances 0
ml.c5n.4xlarge instances 0
ml.c5n.9xlarge instances 0
ml.c5n.18xlarge instances 0
ml.g4dn.xlarge instances 0
ml.g4dn.2xlarge instances 0
ml.g4dn.4xlarge instances 0
ml.g4dn.8xlarge instances 0
ml.g4dn.12xlarge instances 0
ml.g4dn.16xlarge instances 0
ml.m4.xlarge instances 2
ml.m4.2xlarge instances 0
ml.m4.4xlarge instances 0
ml.m4.10xlarge instances 0
ml.m4.16xlarge instances 0
ml.m5.large instances 2
ml.m5.xlarge instances 0
ml.m5.2xlarge instances 0
ml.m5.4xlarge instances 0
ml.m5.8xlarge instances 0
ml.m5.12xlarge instances 0
ml.m5.16xlarge instances 0
ml.m5.24xlarge instances 0
ml.m5d.large instances 0
ml.m5d.xlarge instances 0
ml.m5d.2xlarge instances 0
ml.m5d.4xlarge instances 0
ml.m5d.8xlarge instances 0
ml.m5d.12xlarge instances 0
ml.m5d.16xlarge instances 0
ml.m5d.24xlarge instances 0
ml.m5dn.large instances 0
ml.m5dn.xlarge instances 0
ml.m5dn.2xlarge instances 0
ml.m5dn.4xlarge instances 0
ml.m5dn.8xlarge instances 0
ml.m5dn.12xlarge instances 0
ml.m5dn.16xlarge instances 0
ml.m5dn.24xlarge instances 0
ml.m5n.large instances 0
ml.m5n.xlarge instances 0
ml.m5n.2xlarge instances 0
ml.m5n.4xlarge instances 0
ml.m5n.8xlarge instances 0
ml.m5n.12xlarge instances 0
ml.m5n.16xlarge instances 0
ml.m5n.24xlarge instances 0
ml.p2.xlarge instances 0
ml.p2.8xlarge instances 0
ml.p2.16xlarge instances 0
ml.p3.2xlarge instances 0
ml.p3.8xlarge instances 0
ml.p3.16xlarge instances 0
ml.r5.large instances 0
ml.r5.xlarge instances 0
ml.r5.2xlarge instances 0
ml.r5.4xlarge instances 0
ml.r5.8xlarge instances 0
ml.r5.12xlarge instances 0
ml.r5.16xlarge instances 0
ml.r5.24xlarge instances 0
ml.r5d.large instances 0
ml.r5d.xlarge instances 0
ml.r5d.2xlarge instances 0
ml.r5d.4xlarge instances 0
ml.r5d.8xlarge instances 0
ml.r5d.12xlarge instances 0
ml.r5d.16xlarge instances 0
ml.r5d.24xlarge instances 0
ml.r5dn.large instances 0
ml.r5dn.xlarge instances 0
ml.r5dn.2xlarge instances 0
ml.r5dn.4xlarge instances 0
ml.r5dn.8xlarge instances 0
ml.r5dn.12xlarge instances 0
ml.r5dn.16xlarge instances 0
ml.r5dn.24xlarge instances 0
ml.r5n.large instances 0
ml.r5n.xlarge instances 0
ml.r5n.2xlarge instances 0
ml.r5n.4xlarge instances 0
ml.r5n.8xlarge instances 0
ml.r5n.12xlarge instances 0
ml.r5n.16xlarge instances 0
ml.r5n.24xlarge instances 0
ml.t2.medium instances 2
ml.t2.large instances 0
ml.t2.xlarge instances 0
ml.t2.2xlarge instances 0
ml.t3.medium instances 2
ml.t3.large instances 0
ml.t3.xlarge instances 0
ml.t3.2xlarge instances 0
Number of instances across endpoints 2
Number of instances per endpoint 0
Number of accelerators per endpoint 4
Total TPS for all endpoints 10,000
Maximum payload size for endpoint invocation 6 MB
Inference timeout for endpoint invocation 60 seconds
SageMaker Batch Transform
Resource Default
ml.c4.xlarge instances 4
ml.c4.2xlarge instances 4
ml.c4.4xlarge instances 4
ml.c4.8xlarge instances 4
ml.c5.xlarge instances 4
ml.c5.2xlarge instances 4
ml.c5.4xlarge instances 1
ml.c5.9xlarge instances 1
ml.c5.18xlarge instances 1
ml.g4dn.xlarge 0
ml.g4dn.2xlarge 0
ml.g4dn.4xlarge 0
ml.g4dn.8xlarge 0
ml.g4dn.12xlarge 0
ml.g4dn.16xlarge 0
ml.m4.xlarge instances 4
ml.m4.2xlarge instances 4
ml.m4.4xlarge instances 2
ml.m4.10xlarge instances 1
ml.m4.16xlarge instances 1
ml.m5.large instances 4
ml.m5.xlarge instances 4
ml.m5.2xlarge instances 4
ml.m5.4xlarge instances 2
ml.m5.12xlarge instances 0
ml.m5.24xlarge instances 0
ml.p2.xlarge instances 0
ml.p2.8xlarge instances 0
ml.p2.16xlarge instances 0
ml.p3.2xlarge instances 0
ml.p3.8xlarge instances 0
ml.p3.16xlarge instances 0
Number of instances per transform job 4
SageMaker Human Task UI
Resource Default
Number of human task UIs 100