Amazon SageMaker Debugger in Amazon SageMaker Studio - Amazon SageMaker

Amazon SageMaker Debugger in Amazon SageMaker Studio

Use the Amazon SageMaker Debugger dashboards in Amazon SageMaker Studio to analyze your model performance and system bottlenecks while running training jobs on Amazon Elastic Compute Cloud (Amazon EC2) instances. Gain insights into your training jobs and improve your model training performance and accuracy with the Debugger dashboards. By default, Debugger monitors system metrics (CPU, GPU, CPU and GPU memory, network, and data I/O) every 500 milliseconds and basic output tensors (loss and accuracy) every 500 iterations for training jobs. You can also further customize Debugger configuration parameter values and adjust the saving intervals through the Studio UI or using the Amazon SageMaker Python SDK.

Important

If you're using existing Studio apps, restart them to use the new features. For instructions on how to restart and update your Studio environment, see Update Amazon SageMaker Studio.


            An example of a Studio Debugger dashboard