AWS services or capabilities described in AWS Documentation may vary by region/location. Click Getting Started with Amazon AWS to see specific differences applicable to the China (Beijing) Region.
Defines the configuration for managed tier checkpointing in a HyperPod cluster. Managed tier checkpointing uses multiple storage tiers, including cluster CPU memory, to provide faster checkpoint operations and improved fault tolerance for large-scale model training. The system automatically saves checkpoints at high frequency to memory and periodically persists them to durable storage, like Amazon S3.
Namespace: Amazon.SageMaker.Model
Assembly: AWSSDK.SageMaker.dll
Version: 3.x.y.z
public class ClusterTieredStorageConfig
The ClusterTieredStorageConfig type exposes the following members
Name | Description | |
---|---|---|
![]() |
ClusterTieredStorageConfig() |
Name | Type | Description | |
---|---|---|---|
![]() |
InstanceMemoryAllocationPercentage | System.Int32 |
Gets and sets the property InstanceMemoryAllocationPercentage. The percentage (int) of cluster memory to allocate for checkpointing. |
![]() |
Mode | Amazon.SageMaker.ClusterConfigMode |
Gets and sets the property Mode.
Specifies whether managed tier checkpointing is enabled or disabled for the HyperPod
cluster. When set to |
.NET:
Supported in: 8.0 and newer, Core 3.1
.NET Standard:
Supported in: 2.0
.NET Framework:
Supported in: 4.5 and newer, 3.5