Troubleshoot a slow Amazon EMR cluster
This section walks you through the process of troubleshooting a cluster that is still running, but is taking a long time to return results. For more information about what to do if the cluster has terminated with an error code, see Troubleshoot an Amazon EMR cluster that has failed with an error code
Amazon EMR enables you to specify the number and kind of instances in the cluster. These specifications are the primary means of affecting the speed with which your data processing completes. One thing you might consider is re-running the cluster, this time specifying EC2 instances with greater resources, or specifying a larger number of instances in the cluster. For more information, see Configure Amazon EMR cluster hardware and networking.
The following topics walk you through the process of identifying alternative causes of a slow cluster.
Topics
- Step 1: Gather data about the issue with the Amazon EMR cluster
- Step 2: Check the EMR cluster environment
- Step 3: Examine the log files for the Amazon EMR cluster
- Step 4: Check Amazon EMR cluster and instance health
- Step 5: Check for suspended groups
- Step 6: Review configuration settings for the Amazon EMR cluster
- Step 7: Examine input data for the Amazon EMR cluster