AWS::SageMaker::InferenceComponent InferenceComponentComputeResourceRequirements - AWS CloudFormation

AWS::SageMaker::InferenceComponent InferenceComponentComputeResourceRequirements

Defines the compute resources to allocate to run a model that you assign to an inference component. These resources include CPU cores, accelerators, and memory.

Syntax

To declare this entity in your AWS CloudFormation template, use the following syntax:

Properties

MaxMemoryRequiredInMb

The maximum MB of memory to allocate to run a model that you assign to an inference component.

Required: No

Type: Integer

Minimum: 128

Update requires: No interruption

MinMemoryRequiredInMb

The minimum MB of memory to allocate to run a model that you assign to an inference component.

Required: No

Type: Integer

Minimum: 128

Update requires: No interruption

NumberOfAcceleratorDevicesRequired

The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and AWS Inferentia.

Required: No

Type: Number

Minimum: 1

Update requires: No interruption

NumberOfCpuCoresRequired

The number of CPU cores to allocate to run a model that you assign to an inference component.

Required: No

Type: Number

Minimum: 0

Update requires: No interruption