AWS::SageMaker::InferenceComponent InferenceComponentComputeResourceRequirements - AWS CloudFormation

AWS::SageMaker::InferenceComponent InferenceComponentComputeResourceRequirements

Defines the compute resources to allocate to run a model, plus any adapter models, that you assign to an inference component. These resources include CPU cores, accelerators, and memory.


To declare this entity in your AWS CloudFormation template, use the following syntax:



The maximum MB of memory to allocate to run a model that you assign to an inference component.

Required: No

Type: Integer

Minimum: 128

Update requires: No interruption


The minimum MB of memory to allocate to run a model that you assign to an inference component.

Required: No

Type: Integer

Minimum: 128

Update requires: No interruption


The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and AWS Inferentia.

Required: No

Type: Number

Minimum: 1

Update requires: No interruption


The number of CPU cores to allocate to run a model that you assign to an inference component.

Required: No

Type: Number

Minimum: 0

Update requires: No interruption