interface AsyncInferenceClientConfigProperty
Language | Type name |
---|---|
.NET | Amazon.CDK.AWS.Sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty |
Java | software.amazon.awscdk.services.sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty |
Python | aws_cdk.aws_sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty |
TypeScript | @aws-cdk/aws-sagemaker » CfnEndpointConfig » AsyncInferenceClientConfigProperty |
Configures the behavior of the client used by SageMaker to interact with the model container during asynchronous inference.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import * as sagemaker from '@aws-cdk/aws-sagemaker';
const asyncInferenceClientConfigProperty: sagemaker.CfnEndpointConfig.AsyncInferenceClientConfigProperty = {
maxConcurrentInvocationsPerInstance: 123,
};
Properties
Name | Type | Description |
---|---|---|
max | number | The maximum number of concurrent requests sent by the SageMaker client to the model container. |
maxConcurrentInvocationsPerInstance?
Type:
number
(optional)
The maximum number of concurrent requests sent by the SageMaker client to the model container.
If no value is provided, SageMaker will choose an optimal value for you.