Monitoring AWS Glue Spark jobs - AWS Glue

Monitoring AWS Glue Spark jobs

Spark Metrics available in AWS Glue Studio

The Metrics tab shows metrics collected when a job runs and profiling is turned on. The following graphs are shown in Spark jobs:

  • ETL Data Movement

  • Memory Profile: Driver and Executors

Choose View additional metrics to show the following graphs:

  • ETL Data Movement

  • Memory Profile: Driver and Executors

  • Data Shuffle Across Executors

  • CPU Load: Driver and Executors

  • Job Execution: Active Executors, Completed Stages & Maximum Needed Executors

Data for these graphs is pushed to CloudWatch metrics if the job is configured to collect metrics. For more information about how to turn on metrics and interpret the graphs, see Job monitoring and debugging.

Example ETL data movement graph

The ETL Data Movement graph shows the following metrics:


                The graph for ETL Data Movement in the Metrics tab of the AWS Glue console.
Example Memory profile graph

The Memory Profile graph shows the following metrics:


                The graph for Memory Profile in the Metrics tab of the AWS Glue console.
Example Data shuffle across executors graph

The Data Shuffle Across Executors graph shows the following metrics:


                The graph for Data Shuffle Across Executors in the Metrics tab of the AWS Glue console.
Example CPU load graph

The CPU Load graph shows the following metrics:


                The graph for CPU Load in the Metrics tab of the AWS Glue console.
Example Job execution graph

The Job Execution graph shows the following metrics:


                The graph for Job Execution in the Metrics tab of the AWS Glue console.