interface InferenceSpecificationProperty
Language | Type name |
---|---|
.NET | Amazon.CDK.AWS.Sagemaker.CfnModelPackage.InferenceSpecificationProperty |
Java | software.amazon.awscdk.services.sagemaker.CfnModelPackage.InferenceSpecificationProperty |
Python | aws_cdk.aws_sagemaker.CfnModelPackage.InferenceSpecificationProperty |
TypeScript | @aws-cdk/aws-sagemaker » CfnModelPackage » InferenceSpecificationProperty |
Defines how to perform inference generation after a training job is run.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import * as sagemaker from '@aws-cdk/aws-sagemaker';
declare const modelInput: any;
const inferenceSpecificationProperty: sagemaker.CfnModelPackage.InferenceSpecificationProperty = {
containers: [{
image: 'image',
// the properties below are optional
containerHostname: 'containerHostname',
environment: {
environmentKey: 'environment',
},
framework: 'framework',
frameworkVersion: 'frameworkVersion',
imageDigest: 'imageDigest',
modelDataUrl: 'modelDataUrl',
modelInput: modelInput,
nearestModelName: 'nearestModelName',
productId: 'productId',
}],
supportedContentTypes: ['supportedContentTypes'],
supportedResponseMimeTypes: ['supportedResponseMimeTypes'],
// the properties below are optional
supportedRealtimeInferenceInstanceTypes: ['supportedRealtimeInferenceInstanceTypes'],
supportedTransformInstanceTypes: ['supportedTransformInstanceTypes'],
};
Properties
Name | Type | Description |
---|---|---|
containers | IResolvable | IResolvable | Model [] | The Amazon ECR registry path of the Docker image that contains the inference code. |
supported | string[] | The supported MIME types for the input data. |
supported | string[] | The supported MIME types for the output data. |
supported | string[] | A list of the instance types that are used to generate inferences in real-time. |
supported | string[] | A list of the instance types on which a transformation job can be run or on which an endpoint can be deployed. |
containers
Type:
IResolvable
|
IResolvable
|
Model
[]
The Amazon ECR registry path of the Docker image that contains the inference code.
supportedContentTypes
Type:
string[]
The supported MIME types for the input data.
supportedResponseMimeTypes
Type:
string[]
The supported MIME types for the output data.
supportedRealtimeInferenceInstanceTypes?
Type:
string[]
(optional)
A list of the instance types that are used to generate inferences in real-time.
This parameter is required for unversioned models, and optional for versioned models.
supportedTransformInstanceTypes?
Type:
string[]
(optional)
A list of the instance types on which a transformation job can be run or on which an endpoint can be deployed.
This parameter is required for unversioned models, and optional for versioned models.