RecommendationJobStoppingConditions - Amazon SageMaker

RecommendationJobStoppingConditions

Specifies conditions for stopping a job. When a job reaches a stopping condition limit, SageMaker ends the job.

Contents

FlatInvocations

Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the instance has reached capacity. The default value is Stop. If you want the load test to continue after invocations have flattened, set the value to Continue.

Type: String

Valid Values: Continue | Stop

Required: No

MaxInvocations

The maximum number of requests per minute expected for the endpoint.

Type: Integer

Required: No

ModelLatencyThresholds

The interval of time taken by a model to respond as viewed from SageMaker. The interval includes the local communication time taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.

Type: Array of ModelLatencyThreshold objects

Array Members: Fixed number of 1 item.

Required: No

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: