AWS::SageMaker::ModelPackage AdditionalInferenceSpecificationDefinition - AWS CloudFormation

AWS::SageMaker::ModelPackage AdditionalInferenceSpecificationDefinition

A structure of additional Inference Specification. Additional Inference Specification specifies details about inference jobs that can be run with models based on this model package

Syntax

To declare this entity in your AWS CloudFormation template, use the following syntax:

JSON

{ "Containers" : [ ModelPackageContainerDefinition, ... ], "Description" : String, "Name" : String, "SupportedContentTypes" : [ String, ... ], "SupportedRealtimeInferenceInstanceTypes" : [ String, ... ], "SupportedResponseMIMETypes" : [ String, ... ], "SupportedTransformInstanceTypes" : [ String, ... ] }

Properties

Containers

The Amazon ECR registry path of the Docker image that contains the inference code.

Required: Yes

Type: List of ModelPackageContainerDefinition

Maximum: 15

Update requires: No interruption

Description

A description of the additional Inference specification

Required: No

Type: String

Maximum: 1024

Pattern: [\p{L}\p{M}\p{Z}\p{S}\p{N}\p{P}]*

Update requires: No interruption

Name

A unique name to identify the additional inference specification. The name must be unique within the list of your additional inference specifications for a particular model package.

Required: Yes

Type: String

Minimum: 1

Maximum: 63

Pattern: ^[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}$

Update requires: No interruption

SupportedContentTypes

The supported MIME types for the input data.

Required: No

Type: List of String

Update requires: No interruption

SupportedRealtimeInferenceInstanceTypes

A list of the instance types that are used to generate inferences in real-time.

Required: No

Type: List of String

Update requires: No interruption

SupportedResponseMIMETypes

The supported MIME types for the output data.

Required: No

Type: List of String

Update requires: No interruption

SupportedTransformInstanceTypes

A list of the instance types on which a transformation job can be run or on which an endpoint can be deployed.

Required: No

Type: List of String

Update requires: No interruption