ReservedCapacityOffering - Amazon SageMaker

ReservedCapacityOffering

Details about a reserved capacity offering for a training plan offering.

For more information about how to reserve GPU capacity for your SageMaker HyperPod clusters using Amazon SageMaker Training Plan, see CreateTrainingPlan .

Contents

InstanceCount

The number of instances in the reserved capacity offering.

Type: Integer

Valid Range: Minimum value of 0. Maximum value of 256.

Required: Yes

InstanceType

The instance type for the reserved capacity offering.

Type: String

Valid Values: ml.p4d.24xlarge | ml.p5.48xlarge | ml.p5e.48xlarge | ml.p5en.48xlarge | ml.trn1.32xlarge | ml.trn2.48xlarge | ml.p6-b200.48xlarge | ml.p4de.24xlarge | ml.p6e-gb200.36xlarge | ml.p5.4xlarge

Required: Yes

AvailabilityZone

The availability zone for the reserved capacity offering.

Type: String

Length Constraints: Minimum length of 1. Maximum length of 32.

Pattern: [a-z]+\-[0-9a-z\-]+

Required: No

DurationHours

The number of whole hours in the total duration for this reserved capacity offering.

Type: Long

Valid Range: Minimum value of 0. Maximum value of 87600.

Required: No

DurationMinutes

The additional minutes beyond whole hours in the total duration for this reserved capacity offering.

Type: Long

Valid Range: Minimum value of 0. Maximum value of 59.

Required: No

EndTime

The end time of the reserved capacity offering.

Type: Timestamp

Required: No

ReservedCapacityType

The type of reserved capacity offering.

Type: String

Valid Values: UltraServer | Instance

Required: No

StartTime

The start time of the reserved capacity offering.

Type: Timestamp

Required: No

UltraServerCount

The number of UltraServers included in this reserved capacity offering.

Type: Integer

Valid Range: Minimum value of 1.

Required: No

UltraServerType

The type of UltraServer included in this reserved capacity offering, such as ml.u-p6e-gb200x72.

Type: String

Length Constraints: Minimum length of 1. Maximum length of 64.

Pattern: ml.[a-z0-9\-.]+

Required: No

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: