Amazon Elastic MapReduce
Developer Guide (API Version 2009-03-31)
Did this page help you?  Yes | No |  Tell us about it...
« PreviousNext »
View the PDF for this guide.Go to the AWS Discussion Forum for this product.Go to the Kindle Store to download this guide in Kindle format.

Hadoop Configuration (AMI 1.0)

The following Amazon EMR default configuration settings are appropriate for most workloads.

If your cluster tasks are memory-intensive, you can enhance performance by using fewer tasks per core node and reducing your job tracker heap size. These and other memory-intensive configuration settings are described in Hadoop Memory-Intensive Configuration Settings (Legacy AMI 1.0.1 and earlier) .

The following tables list the default configuration settings for each EC2 instance type in clusters launched with Amazon EMR AMI version 1.0. For more information about the AMI versions supported by Amazon EMR, see Choose an Amazon Machine Image (AMI).

m1.small

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 768
HADOOP_NAMENODE_HEAPSIZE 256
HADOOP_TASKTRACKER_HEAPSIZE 512
HADOOP_DATANODE_HEAPSIZE 128
mapred.child.java.opts-Xmx725m
mapred.tasktracker.map.tasks.maximum 2
mapred.tasktracker.reduce.tasks.maximum 1

m1.medium

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 1536
HADOOP_NAMENODE_HEAPSIZE 512
HADOOP_TASKTRACKER_HEAPSIZE 256
HADOOP_DATANODE_HEAPSIZE 256
mapred.child.java.opts-Xmx1152m
mapred.tasktracker.map.tasks.maximum 2
mapred.tasktracker.reduce.tasks.maximum 1

m1.large

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 3072
HADOOP_NAMENODE_HEAPSIZE 1024
HADOOP_TASKTRACKER_HEAPSIZE 1536
HADOOP_DATANODE_HEAPSIZE 256
mapred.child.java.opts -Xmx1600m
mapred.tasktracker.map.tasks.maximum 4
mapred.tasktracker.reduce.tasks.maximum 2

m1.xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 9216
HADOOP_NAMENODE_HEAPSIZE 3072
HADOOP_TASKTRACKER_HEAPSIZE 3072
HADOOP_DATANODE_HEAPSIZE 512
mapred.child.java.opts -Xmx1600m
mapred.tasktracker.map.tasks.maximum 8
mapred.tasktracker.reduce.tasks.maximum 4

c1.medium

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 768
HADOOP_NAMENODE_HEAPSIZE 256
HADOOP_TASKTRACKER_HEAPSIZE 512
HADOOP_DATANODE_HEAPSIZE 256
mapred.child.java.opts -Xmx362m
mapred.tasktracker.map.tasks.maximum 4
mapred.tasktracker.reduce.tasks.maximum 2

c1.xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 3072
HADOOP_NAMENODE_HEAPSIZE 1024
HADOOP_TASKTRACKER_HEAPSIZE 1536
HADOOP_DATANODE_HEAPSIZE 512
mapred.child.java.opts -Xmx747m
mapred.tasktracker.map.tasks.maximum 8
mapred.tasktracker.reduce.tasks.maximum 4

m2.xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 12288
HADOOP_NAMENODE_HEAPSIZE 4096
HADOOP_TASKTRACKER_HEAPSIZE 3072
HADOOP_DATANODE_HEAPSIZE 512
mapred.child.java.opts -Xmx2048m
mapred.tasktracker.map.tasks.maximum 4
mapred.tasktracker.reduce.tasks.maximum 2

m2.2xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 24576
HADOOP_NAMENODE_HEAPSIZE 8192
HADOOP_TASKTRACKER_HEAPSIZE 3072
HADOOP_DATANODE_HEAPSIZE 1024
mapred.child.java.opts -Xmx3200m
mapred.tasktracker.map.tasks.maximum 8
mapred.tasktracker.reduce.tasks.maximum 4

m2.4xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 49152
HADOOP_NAMENODE_HEAPSIZE 16384
HADOOP_TASKTRACKER_HEAPSIZE 3072
HADOOP_DATANODE_HEAPSIZE 2048
mapred.child.java.opts -Xmx3733m
mapred.tasktracker.map.tasks.maximum 16
mapred.tasktracker.reduce.tasks.maximum 8

cg1.4xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 10240
HADOOP_NAMENODE_HEAPSIZE 5120
HADOOP_TASKTRACKER_HEAPSIZE 512
HADOOP_DATANODE_HEAPSIZE 512
mapred.child.java.opts -Xmx1024m
mapred.tasktracker.map.tasks.maximum 12
mapred.tasktracker.reduce.tasks.maximum 3