Amazon SageMaker endpoints and quotas - AWS General Reference

Amazon SageMaker endpoints and quotas

The following are the service endpoints and service quotas for this service. To connect programmatically to an AWS service, you use an endpoint. In addition to the standard AWS endpoints, some AWS services offer FIPS endpoints in selected Regions. For more information, see AWS service endpoints. Service quotas, also referred to as limits, are the maximum number of service resources or operations for your AWS account. For more information, see AWS service quotas.

Service Endpoints

The following table provides a list of Region-specific endpoints that SageMaker supports for training and deploying models. This include creating and managing notebook instances, training jobs, model, endpoint configurations, and endpoints.

Region Name Region Endpoint Protocol
US East (Ohio) us-east-2

api.sagemaker.us-east-2.amazonaws.com

api-fips.sagemaker.us-east-2.amazonaws.com

HTTPS

HTTPS

US East (N. Virginia) us-east-1

api.sagemaker.us-east-1.amazonaws.com

api-fips.sagemaker.us-east-1.amazonaws.com

HTTPS

HTTPS

US West (N. California) us-west-1

api.sagemaker.us-west-1.amazonaws.com

api-fips.sagemaker.us-west-1.amazonaws.com

HTTPS

HTTPS

US West (Oregon) us-west-2

api.sagemaker.us-west-2.amazonaws.com

api-fips.sagemaker.us-west-2.amazonaws.com

HTTPS

HTTPS

Africa (Cape Town) af-south-1 api.sagemaker.af-south-1.amazonaws.com HTTPS
Asia Pacific (Hong Kong) ap-east-1 api.sagemaker.ap-east-1.amazonaws.com HTTPS
Asia Pacific (Mumbai) ap-south-1 api.sagemaker.ap-south-1.amazonaws.com HTTPS
Asia Pacific (Seoul) ap-northeast-2 api.sagemaker.ap-northeast-2.amazonaws.com HTTPS
Asia Pacific (Singapore) ap-southeast-1 api.sagemaker.ap-southeast-1.amazonaws.com HTTPS
Asia Pacific (Sydney) ap-southeast-2 api.sagemaker.ap-southeast-2.amazonaws.com HTTPS
Asia Pacific (Tokyo) ap-northeast-1 api.sagemaker.ap-northeast-1.amazonaws.com HTTPS
Canada (Central) ca-central-1 api.sagemaker.ca-central-1.amazonaws.com HTTPS
China (Beijing) cn-north-1 api.sagemaker.cn-north-1.amazonaws.com.cn HTTPS
China (Ningxia) cn-northwest-1 api.sagemaker.cn-northwest-1.amazonaws.com.cn HTTPS
Europe (Frankfurt) eu-central-1 api.sagemaker.eu-central-1.amazonaws.com HTTPS
Europe (Ireland) eu-west-1 api.sagemaker.eu-west-1.amazonaws.com HTTPS
Europe (London) eu-west-2 api.sagemaker.eu-west-2.amazonaws.com HTTPS
Europe (Milan) eu-south-1 api.sagemaker.eu-south-1.amazonaws.com HTTPS
Europe (Paris) eu-west-3 api.sagemaker.eu-west-3.amazonaws.com HTTPS
Europe (Stockholm) eu-north-1 api.sagemaker.eu-north-1.amazonaws.com HTTPS
Middle East (Bahrain) me-south-1 api.sagemaker.me-south-1.amazonaws.com HTTPS
South America (São Paulo) sa-east-1 api.sagemaker.sa-east-1.amazonaws.com HTTPS
AWS GovCloud (US-West) us-gov-west-1

api.sagemaker.us-gov-west-1.amazonaws.com

api-fips.sagemaker.us-gov-west-1.amazonaws.com

api.sagemaker.us-gov-west-1.amazonaws.com

HTTPS

HTTPS

HTTPS

The following table provides a list of Region-specific endpoints that Amazon SageMaker supports for making inference requests against models hosted in SageMaker.

Region Name Region Endpoint Protocol
US East (Ohio) us-east-2

runtime.sagemaker.us-east-2.amazonaws.com

runtime-fips.sagemaker.us-east-2.amazonaws.com

HTTPS

HTTPS

US East (N. Virginia) us-east-1

runtime.sagemaker.us-east-1.amazonaws.com

runtime-fips.sagemaker.us-east-1.amazonaws.com

HTTPS

HTTPS

US West (N. California) us-west-1

runtime.sagemaker.us-west-1.amazonaws.com

runtime-fips.sagemaker.us-west-1.amazonaws.com

HTTPS

HTTPS

US West (Oregon) us-west-2

runtime.sagemaker.us-west-2.amazonaws.com

runtime-fips.sagemaker.us-west-2.amazonaws.com

HTTPS

HTTPS

Africa (Cape Town) af-south-1 runtime.sagemaker.af-south-1.amazonaws.com HTTPS
Asia Pacific (Hong Kong) ap-east-1 runtime.sagemaker.ap-east-1.amazonaws.com HTTPS
Asia Pacific (Mumbai) ap-south-1 runtime.sagemaker.ap-south-1.amazonaws.com HTTPS
Asia Pacific (Seoul) ap-northeast-2 runtime.sagemaker.ap-northeast-2.amazonaws.com HTTPS
Asia Pacific (Singapore) ap-southeast-1 runtime.sagemaker.ap-southeast-1.amazonaws.com HTTPS
Asia Pacific (Sydney) ap-southeast-2 runtime.sagemaker.ap-southeast-2.amazonaws.com HTTPS
Asia Pacific (Tokyo) ap-northeast-1 runtime.sagemaker.ap-northeast-1.amazonaws.com HTTPS
Canada (Central) ca-central-1 runtime.sagemaker.ca-central-1.amazonaws.com HTTPS
China (Beijing) cn-north-1 runtime.sagemaker.cn-north-1.amazonaws.com.cn HTTPS
China (Ningxia) cn-northwest-1 runtime.sagemaker.cn-northwest-1.amazonaws.com.cn HTTPS
Europe (Frankfurt) eu-central-1 runtime.sagemaker.eu-central-1.amazonaws.com HTTPS
Europe (Ireland) eu-west-1 runtime.sagemaker.eu-west-1.amazonaws.com HTTPS
Europe (London) eu-west-2 runtime.sagemaker.eu-west-2.amazonaws.com HTTPS
Europe (Milan) eu-south-1 runtime.sagemaker.eu-south-1.amazonaws.com HTTPS
Europe (Paris) eu-west-3 runtime.sagemaker.eu-west-3.amazonaws.com HTTPS
Europe (Stockholm) eu-north-1 runtime.sagemaker.eu-north-1.amazonaws.com HTTPS
Middle East (Bahrain) me-south-1 runtime.sagemaker.me-south-1.amazonaws.com HTTPS
South America (São Paulo) sa-east-1 runtime.sagemaker.sa-east-1.amazonaws.com HTTPS
AWS GovCloud (US-West) us-gov-west-1

runtime.sagemaker.us-gov-west-1.amazonaws.com

runtime.sagemaker.us-gov-west-1.amazonaws.com

HTTPS

HTTPS

Service Quotas

Depending on your activities and resource usage over time, your SageMaker quotas might be different from the default SageMaker quotas listed in the following tables. The default quotas in this page are based on new accounts. If you encounter error messages that you've exceeded your quota, use AWS Support to request a service limit increase for SageMaker resources you want to scale up. For instructions on how to request a service limit increase, see Supported Regions and Quotas in the Amazon SageMaker Developer Guide.

SageMaker Studio
Resource Default
KernelGateway-ml.c5.large 0
KernelGateway-ml.c5.xlarge 0
KernelGateway-ml.c5.2xlarge 0
KernelGateway-ml.c5.4xlarge 0
KernelGateway-ml.c5.9xlarge 0
KernelGateway-ml.c5.12xlarge 0
KernelGateway-ml.c5.18xlarge 0
KernelGateway-ml.c5.24xlarge 0
KernelGateway-ml.g4dn.xlarge 0
KernelGateway-ml.g4dn.2xlarge 0
KernelGateway-ml.g4dn.4xlarge 0
KernelGateway-ml.g4dn.8xlarge 0
KernelGateway-ml.g4dn.12xlarge 0
KernelGateway-ml.g4dn.16xlarge 0
KernelGateway-ml.m5.large 0
KernelGateway-ml.m5.xlarge 0
KernelGateway-ml.m5.2xlarge 0
KernelGateway-ml.m5.4xlarge 1
KernelGateway-ml.m5.8xlarge 0
KernelGateway-ml.m5.12xlarge 0
KernelGateway-ml.m5.16xlarge 0
KernelGateway-ml.m5.24xlarge 0
KernelGateway-ml.p3.2xlarge 0

KernelGateway-ml.p3.8xlarge

0
KernelGateway-ml.p3.16xlarge 0

KernelGateway-ml.t3.medium

2

KernelGateway-ml.t3.large

0

KernelGateway-ml.t3.xlarge

0

KernelGateway-ml.t3.2xlarge

0

Max running apps per account

20

Number of user profiles per account

2
SageMaker Images
Resource Default
Number of SageMaker Images 250
Number of image versions per SageMaker image 1,000
SageMaker Notebooks
Resource Default
ml.t2.medium instances 2
ml.t2.large instances 0
ml.t2.xlarge instances 0
ml.t2.2xlarge instances 0
ml.t3.medium instances 2
ml.t3.large instances 0
ml.t3.xlarge instances 0
ml.t3.2xlarge instances 0
ml.m4.xlarge instances 0
ml.m4.2xlarge instances 0
ml.m4.4xlarge instances 0
ml.m4.10xlarge instances 0
ml.m4.16xlarge instances 0
ml.m5.xlarge instances 0
ml.m5.2xlarge instances 0
ml.m5.4xlarge instances 0
ml.m5.12xlarge instances 0
ml.m5.24xlarge instances 0
ml.c4.xlarge instances 0
ml.c4.2xlarge instances 0
ml.c4.4xlarge instances 0
ml.c4.8xlarge instances 0
ml.c5.xlarge instances 0
ml.c5.2xlarge instances 0
ml.c5.4xlarge instances 0
ml.c5.9xlarge instances 0
ml.c5.18xlarge instances 0
ml.c5d.xlarge instances 0
ml.c5d.2xlarge instances 0
ml.c5d.4xlarge instances 0
ml.c5d.9xlarge instances 0
ml.c5d.18xlarge instances 0
ml.p2.xlarge instances 0
ml.p2.8xlarge instances 0
ml.p2.16xlarge instances 0
ml.p3.2xlarge instances 0
ml.p3.8xlarge instances 0
ml.p3.16xlarge instances 0
ml.eia1.medium instances 0
ml.eia1.large instances 0
ml.eia1.xlarge instances 0
ml.eia2.medium instances 0
ml.eia2.large instances 0
ml.eia2.xlarge instances 0
Number of accelerators 0
Number of notebook instances 4
EBS volume size in GB for an instance 102400
SageMaker Ground Truth
Resource Default
Total labeling jobs 1
Total streaming labeling jobs 0
Max dataset objects per labeling job 10,000
Number of workteams 25
SageMaker Projects
Resource Default
Number of projects 500
SageMaker Pipelines
Resource Default
Number of pipelines 500
SageMaker Pipeline Executions
Resource Default
Number of pipeline executions 20
SageMaker Feature Store
Resource Default
Number of feature groups 10
SageMaker Processing
Resource Default
ml.c4.xlarge 4
ml.c4.2xlarge 4
ml.c4.4xlarge 4
ml.c4.8xlarge 4
ml.c5.xlarge 4
ml.c5.2xlarge 4
ml.c5.4xlarge 1
ml.c5.9xlarge 1
ml.c5.18xlarge 1
ml.m4.xlarge 4
ml.m4.2xlarge 4
ml.m4.4xlarge 2
ml.m4.10xlarge 1
ml.m4.16xlarge 1
ml.m5.large 4
ml.m5.xlarge 4
ml.m5.2xlarge 4
ml.m5.4xlarge 2
ml.m5.12xlarge 0
ml.m5.24xlarge 0
ml.p2.xlarge 0
ml.p2.8xlarge 0
ml.p2.16xlarge 0
ml.p3.2xlarge 0
ml.p3.8xlarge 0
ml.p3.16xlarge 0
ml.r5.large 4
ml.r5.xlarge 4
ml.r5.2xlarge 4
ml.r5.4xlarge 1
ml.r5.8xlarge 1
ml.r5.12xlarge 1
ml.r5.16xlarge 1
ml.r5.24xlarge 0
ml.t3.medium 4
ml.t3.large 4
ml.t3.xlarge 2
ml.t3.2xlarge 0
Longest run time for a processing job 5 days
Number of instances across processing jobs 4
Number of instances per processing job 20
Size of EBS volume for an instance 1 TB
Note

In case of SageMaker training, on-demand and spot instance quotas are tracked and modified separately. For example, with the default quotas, you can run up to 20 training jobs with ml.m4.xlarge on-demand instances and up to 20 training jobs with ml.m4.xlarge spot instances simultaneously.

SageMaker Training
Resource Default
ml.c4.xlarge instances 4
ml.c4.2xlarge instances 4
ml.c4.4xlarge instances 4
ml.c4.8xlarge instances 4
ml.c5.xlarge instances 4
ml.c5.2xlarge instances 4
ml.c5.4xlarge instances 1
ml.c5.9xlarge instances 1
ml.c5.18xlarge instances 0
ml.c5n.xlarge instances 0
ml.c5n.2xlarge instances 0
ml.c5n.4xlarge instances 0
ml.c5n.9xlarge instances 0
ml.c5n.18xlarge instances 0
ml.g4dn.xlarge instances 0
ml.g4dn.2xlarge instances 0
ml.g4dn.4xlarge instances 0
ml.g4dn.8xlarge instances 0
ml.g4dn.12xlarge instances 0
ml.g4dn.16xlarge instances 0
ml.m4.xlarge instances 4
ml.m4.2xlarge instances 4
ml.m4.4xlarge instances 2
ml.m4.10xlarge instances 0
ml.m4.16xlarge instances 0
ml.m5.large instances 4
ml.m5.xlarge instances 4
ml.m5.2xlarge instances 4
ml.m5.4xlarge instances 20
ml.m5.12xlarge instances 0
ml.m5.24xlarge instances 0
ml.p2.xlarge instances 0
ml.p2.8xlarge instances 0
ml.p2.16xlarge instances 0
ml.p3.2xlarge instances 0
ml.p3.8xlarge instances 0
ml.p3.16xlarge instances 0
ml.p3dn.24xlarge instances 0
ml.p4d.24xlarge instances 0
The longest run time for a training job 5 days
Number of instances across training jobs 4
Number of instances per training job 20
Size of EBS volume for an instance 1 TB
SageMaker Managed Spot Training
Resource Default
ml.c4.xlarge instances 4
ml.c4.2xlarge instances 4
ml.c4.4xlarge instances 4
ml.c4.8xlarge instances 4
ml.c5.xlarge instances 4
ml.c5.2xlarge instances 4
ml.c5.4xlarge instances 1
ml.c5.9xlarge instances 1
ml.c5.18xlarge instances 0
ml.c5n.xlarge instances 0
ml.c5n.2xlarge instances 0
ml.c5n.4xlarge instances 0
ml.c5n.9xlarge instances 0
ml.c5n.18xlarge instances 0
ml.g4dn.xlarge instances 0
ml.g4dn.2xlarge instances 0
ml.g4dn.4xlarge instances 0
ml.g4dn.8xlarge instances 0
ml.g4dn.12xlarge instances 0
ml.g4dn.16xlarge instances 0
ml.m4.xlarge instances 4
ml.m4.2xlarge instances 4
ml.m4.4xlarge instances 2
ml.m4.10xlarge instances 0
ml.m4.16xlarge instances 0
ml.m5.large instances 4
ml.m5.xlarge instances 4
ml.m5.2xlarge instances 4
ml.m5.4xlarge instances 2
ml.m5.12xlarge instances 0
ml.m5.24xlarge instances 0
ml.p2.xlarge instances 0
ml.p2.8xlarge instances 0
ml.p2.16xlarge instances 0
ml.p3.2xlarge instances 0
ml.p3.8xlarge instances 0
ml.p3.16xlarge instances 0
ml.p3dn.24xlarge instances 0
ml.p4d.24xlarge instances 0
Number of instances across training jobs 4
Number of instances per training job 20
SageMaker Autopilot
Resource Default
Maximum dataset size in GB 5
Maximum number of parallel Autopilot Jobs 1
SageMaker Automatic Model Hyperparameter Tuning
Resource Default
Number of concurrent hyperparameter tuning jobs 100
Number of parallel training jobs per hyperparameter tuning job 10
Number of training jobs per hyperparameter tuning job 500
SageMaker Experiments (Lineage Tracking / Experiment Tracking)
Resource Default
Number of trials 300
Number of experiments 5,000
Number of trial components for Experiments 50
Number of trial associations for Experiment Trial Components 500
Number of trial components for Experiment Trial Components 20,000
Number of actions 3,000
Number of artifacts 6,000
Number of associations 6,000
Number of contexts 500
SageMaker Hosting
Resource Default
ml.c4.large instances 0
ml.c4.xlarge instances 0
ml.c4.2xlarge instances 0
ml.c4.4xlarge instances 0
ml.c4.8xlarge instances 0
ml.c5.large instances 0
ml.c5.xlarge instances 0
ml.c5.2xlarge instances 0
ml.c5.4xlarge instances 0
ml.c5.9xlarge instances 0
ml.c5.12xlarge instances 0
ml.c5.18xlarge instances 0
ml.c5.24xlarge instances 0
ml.c5d.large instances 0
ml.c5d.xlarge instances 0
ml.c5d.2xlarge instances 0
ml.c5d.4xlarge instances 0
ml.c5d.9xlarge instances 0
ml.c5d.18xlarge instances 0
ml.c5n.large instances 0
ml.c5n.xlarge instances 0
ml.c5n.2xlarge instances 0
ml.c5n.4xlarge instances 0
ml.c5n.9xlarge instances 0
ml.c5n.18xlarge instances 0
ml.g4dn.xlarge instances 0
ml.g4dn.2xlarge instances 0
ml.g4dn.4xlarge instances 0
ml.g4dn.8xlarge instances 0
ml.g4dn.12xlarge instances 0
ml.g4dn.16xlarge instances 0
ml.m4.xlarge instances 2
ml.m4.2xlarge instances 0
ml.m4.4xlarge instances 0
ml.m4.10xlarge instances 0
ml.m4.16xlarge instances 0
ml.m5.large instances 2
ml.m5.xlarge instances 0
ml.m5.2xlarge instances 0
ml.m5.4xlarge instances 0
ml.m5.8xlarge instances 0
ml.m5.12xlarge instances 0
ml.m5.16xlarge instances 0
ml.m5.24xlarge instances 0
ml.m5d.large instances 0
ml.m5d.xlarge instances 0
ml.m5d.2xlarge instances 0
ml.m5d.4xlarge instances 0
ml.m5d.8xlarge instances 0
ml.m5d.12xlarge instances 0
ml.m5d.16xlarge instances 0
ml.m5d.24xlarge instances 0
ml.m5dn.large instances 0
ml.m5dn.xlarge instances 0
ml.m5dn.2xlarge instances 0
ml.m5dn.4xlarge instances 0
ml.m5dn.8xlarge instances 0
ml.m5dn.12xlarge instances 0
ml.m5dn.16xlarge instances 0
ml.m5dn.24xlarge instances 0
ml.m5n.large instances 0
ml.m5n.xlarge instances 0
ml.m5n.2xlarge instances 0
ml.m5n.4xlarge instances 0
ml.m5n.8xlarge instances 0
ml.m5n.12xlarge instances 0
ml.m5n.16xlarge instances 0
ml.m5n.24xlarge instances 0
ml.p2.xlarge instances 0
ml.p2.8xlarge instances 0
ml.p2.16xlarge instances 0
ml.p3.2xlarge instances 0
ml.p3.8xlarge instances 0
ml.p3.16xlarge instances 0
ml.r5.large instances 0
ml.r5.xlarge instances 0
ml.r5.2xlarge instances 0
ml.r5.4xlarge instances 0
ml.r5.8xlarge instances 0
ml.r5.12xlarge instances 0
ml.r5.16xlarge instances 0
ml.r5.24xlarge instances 0
ml.r5d.large instances 0
ml.r5d.xlarge instances 0
ml.r5d.2xlarge instances 0
ml.r5d.4xlarge instances 0
ml.r5d.8xlarge instances 0
ml.r5d.12xlarge instances 0
ml.r5d.16xlarge instances 0
ml.r5d.24xlarge instances 0
ml.r5dn.large instances 0
ml.r5dn.xlarge instances 0
ml.r5dn.2xlarge instances 0
ml.r5dn.4xlarge instances 0
ml.r5dn.8xlarge instances 0
ml.r5dn.12xlarge instances 0
ml.r5dn.16xlarge instances 0
ml.r5dn.24xlarge instances 0
ml.r5n.large instances 0
ml.r5n.xlarge instances 0
ml.r5n.2xlarge instances 0
ml.r5n.4xlarge instances 0
ml.r5n.8xlarge instances 0
ml.r5n.12xlarge instances 0
ml.r5n.16xlarge instances 0
ml.r5n.24xlarge instances 0
ml.t2.medium instances 2
ml.t2.large instances 0
ml.t2.xlarge instances 0
ml.t2.2xlarge instances 0
ml.t3.medium instances 2
ml.t3.large instances 0
ml.t3.xlarge instances 0
ml.t3.2xlarge instances 0
Number of instances across endpoints 2
Number of instances per endpoint 0
Number of accelerators per endpoint 4
Total TPS for all endpoints 10,000
Maximum payload size for endpoint invocation 6 MB
Inference timeout for endpoint invocation 60 seconds
SageMaker Batch Transform
Resource Default
ml.c4.xlarge instances 4
ml.c4.2xlarge instances 4
ml.c4.4xlarge instances 4
ml.c4.8xlarge instances 4
ml.c5.xlarge instances 4
ml.c5.2xlarge instances 4
ml.c5.4xlarge instances 1
ml.c5.9xlarge instances 1
ml.c5.18xlarge instances 1
ml.m4.xlarge instances 4
ml.m4.2xlarge instances 4
ml.m4.4xlarge instances 2
ml.m4.10xlarge instances 1
ml.m4.16xlarge instances 1
ml.m5.large instances 4
ml.m5.xlarge instances 4
ml.m5.2xlarge instances 4
ml.m5.4xlarge instances 2
ml.m5.12xlarge instances 0
ml.m5.24xlarge instances 0
ml.p2.xlarge instances 0
ml.p2.8xlarge instances 0
ml.p2.16xlarge instances 0
ml.p3.2xlarge instances 0
ml.p3.8xlarge instances 0
ml.p3.16xlarge instances 0
Number of instances per transform job 4
SageMaker Human Task UI
Resource Default
Number of human task UIs 100