ProductionVariantServerlessConfig - Amazon SageMaker

ProductionVariantServerlessConfig

Specifies the serverless configuration for an endpoint variant.

Contents

MaxConcurrency

The maximum number of concurrent invocations your serverless endpoint can process.

Type: Integer

Valid Range: Minimum value of 1. Maximum value of 200.

Required: Yes

MemorySizeInMB

The memory size of your serverless endpoint. Valid values are in 1 GB increments: 1024 MB, 2048 MB, 3072 MB, 4096 MB, 5120 MB, or 6144 MB.

Type: Integer

Valid Range: Minimum value of 1024. Maximum value of 6144.

Required: Yes

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: