InferenceComponentSpecification
Details about the resources to deploy with this inference component, including the model, container, and compute resources.
Contents
- ComputeResourceRequirements
-
The compute resources allocated to run the model assigned to the inference component.
Type: InferenceComponentComputeResourceRequirements object
Required: Yes
- Container
-
Defines a container that provides the runtime environment for a model that you deploy with an inference component.
Type: InferenceComponentContainerSpecification object
Required: No
- ModelName
-
The name of an existing SageMaker model object in your account that you want to deploy with the inference component.
Type: String
Length Constraints: Maximum length of 63.
Pattern:
^[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?
Required: No
- StartupParameters
-
Settings that take effect while the model container starts up.
Type: InferenceComponentStartupParameters object
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: