View training job metrics
You can view the metrics emitted from your Amazon SageMaker training jobs in either the Amazon CloudWatch or SageMaker AI console.
Monitor training job metrics (CloudWatch console)
You can monitor the metrics that a training job emits in real time in the CloudWatch console.
To monitor training job metrics (CloudWatch console)
-
Open the CloudWatch console at https://console.aws.amazon.com/cloudwatch
. -
Choose Metrics, then choose /aws/sagemaker/TrainingJobs.
-
Choose TrainingJobName.
-
On the All metrics tab, choose the names of the training metrics that you want to monitor.
-
On the Graphed metrics tab, configure the graph options. For more information about using CloudWatch graphs, see Graph Metrics in the Amazon CloudWatch User Guide.
Monitor training job metrics (SageMaker AI console)
You can monitor the metrics that a training job emits in real time by using the SageMaker AI console.
To monitor training job metrics (SageMaker AI console)
-
Open the SageMaker AI console at https://console.aws.amazon.com/sagemaker
. -
Choose Training jobs, then choose the training job whose metrics you want to see.
-
Choose TrainingJobName.
-
In the Monitor section, you can review the graphs of instance utilization and algorithm metrics.