RecommendationJobContainerConfig - Amazon SageMaker

RecommendationJobContainerConfig

Specifies mandatory fields for running an Inference Recommender job directly in the CreateInferenceRecommendationsJob API. The fields specified in ContainerConfig override the corresponding fields in the model package. Use ContainerConfig if you want to specify these fields for the recommendation job but don't want to edit them in your model package.

Contents

DataInputConfig

Specifies the name and shape of the expected data inputs for your trained model with a JSON dictionary form. This field is used for optimizing your model using SageMaker Neo. For more information, see DataInputConfig.

Type: String

Length Constraints: Minimum length of 1. Maximum length of 1024.

Pattern: [\S\s]+

Required: No

Domain

The machine learning domain of the model and its components.

Valid Values: COMPUTER_VISION | NATURAL_LANGUAGE_PROCESSING | MACHINE_LEARNING

Type: String

Required: No

Framework

The machine learning framework of the container image.

Valid Values: TENSORFLOW | PYTORCH | XGBOOST | SAGEMAKER-SCIKIT-LEARN

Type: String

Required: No

FrameworkVersion

The framework version of the container image.

Type: String

Length Constraints: Minimum length of 3. Maximum length of 10.

Pattern: [0-9]\.[A-Za-z0-9.-]+

Required: No

NearestModelName

The name of a pre-trained machine learning model benchmarked by Amazon SageMaker Inference Recommender that matches your model.

Valid Values: efficientnetb7 | unet | xgboost | faster-rcnn-resnet101 | nasnetlarge | vgg16 | inception-v3 | mask-rcnn | sagemaker-scikit-learn | densenet201-gluon | resnet18v2-gluon | xception | densenet201 | yolov4 | resnet152 | bert-base-cased | xceptionV1-keras | resnet50 | retinanet

Type: String

Required: No

PayloadConfig

Specifies the SamplePayloadUrl and all other sample payload-related fields.

Type: RecommendationJobPayloadConfig object

Required: No

SupportedEndpointType

The endpoint type to receive recommendations for. By default this is null, and the results of the inference recommendation job return a combined list of both real-time and serverless benchmarks. By specifying a value for this field, you can receive a longer list of benchmarks for the desired endpoint type.

Type: String

Valid Values: RealTime | Serverless

Required: No

SupportedInstanceTypes

A list of the instance types that are used to generate inferences in real-time.

Type: Array of strings

Required: No

SupportedResponseMIMETypes

The supported MIME types for the output data.

Type: Array of strings

Length Constraints: Maximum length of 1024.

Pattern: ^[-\w]+\/.+$

Required: No

Task

The machine learning task that the model accomplishes.

Valid Values: IMAGE_CLASSIFICATION | OBJECT_DETECTION | TEXT_GENERATION | IMAGE_SEGMENTATION | FILL_MASK | CLASSIFICATION | REGRESSION | OTHER

Type: String

Required: No

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: