Menu
Amazon EMR
Amazon EMR Release Guide

Ganglia

The Ganglia open source project is a scalable, distributed system designed to monitor clusters and grids while minimizing the impact on their performance. When you enable Ganglia on your cluster, you can generate reports and view the performance of the cluster as a whole, as well as inspect the performance of individual node instances. Ganglia is also configured to ingest and visualize Hadoop and Spark metrics. For more information about the Ganglia open-source project, go to http://ganglia.info/.

When you view the Ganglia web UI in a browser, you see an overview of the cluster’s performance, with graphs detailing the load, memory usage, CPU utilization, and network traffic of the cluster. Below the cluster statistics are graphs for each individual server in the cluster.

The following table lists the version of Ganglia included in the latest release of Amazon EMR, along with the components that Amazon EMR installs with Ganglia.

For the version of components installed with Ganglia in this release, see Release 5.16.0 Component Versions.

Ganglia Version Information for emr-5.16.0

Amazon EMR Release Label Ganglia Version Components Installed With Ganglia

emr-5.16.0

Ganglia 3.7.2

emrfs, emr-goodies, ganglia-monitor, ganglia-metadata-collector, ganglia-web, hadoop-client, hadoop-hdfs-datanode, hadoop-hdfs-library, hadoop-hdfs-namenode, hadoop-kms-server, hadoop-yarn-nodemanager, hadoop-yarn-resourcemanager, hadoop-yarn-timeline-server, webserver