Amazon Elastic MapReduce
Developer Guide (API Version 2009-03-31)
« PreviousNext »
View the PDF for this guide.Go to the AWS Discussion Forum for this product.Go to the Kindle Store to download this guide in Kindle format.Did this page help you?  Yes | No |  Tell us about it...

Hadoop Configuration (AMI 2.2)

The following Amazon Elastic MapReduce (Amazon EMR) default configuration settings for clusters launched with Amazon EMR AMI 2.2 are appropriate for most workloads.

If your cluster tasks are memory-intensive, you can enhance performance by using fewer tasks per core node and reducing your job tracker heap size.

The following tables list the default configuration settings for each EC2 instance type in clusters launched with the Amazon EMR AMI version 2.2. For more information about the AMI versions supported by Amazon EMR, see Choose a Machine Image .

m1.small

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 576
HADOOP_NAMENODE_HEAPSIZE 192
HADOOP_TASKTRACKER_HEAPSIZE 192
HADOOP_DATANODE_HEAPSIZE 96
mapred.child.java.opts-Xmx288m
mapred.tasktracker.map.tasks.maximum 2
mapred.tasktracker.reduce.tasks.maximum 1

m1.medium

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 1152
HADOOP_NAMENODE_HEAPSIZE 384
HADOOP_TASKTRACKER_HEAPSIZE 192
HADOOP_DATANODE_HEAPSIZE 192
mapred.child.java.opts-Xmx576m
mapred.tasktracker.map.tasks.maximum 2
mapred.tasktracker.reduce.tasks.maximum 1

m1.large

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 2304
HADOOP_NAMENODE_HEAPSIZE 768
HADOOP_TASKTRACKER_HEAPSIZE 384
HADOOP_DATANODE_HEAPSIZE 384
mapred.child.java.opts -Xmx864m
mapred.tasktracker.map.tasks.maximum 3
mapred.tasktracker.reduce.tasks.maximum 1

m1.xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 6912
HADOOP_NAMENODE_HEAPSIZE 2304
HADOOP_TASKTRACKER_HEAPSIZE 384
HADOOP_DATANODE_HEAPSIZE 384
mapred.child.java.opts -Xmx768m
mapred.tasktracker.map.tasks.maximum 8
mapred.tasktracker.reduce.tasks.maximum 3

c1.medium

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 576
HADOOP_NAMENODE_HEAPSIZE 192
HADOOP_TASKTRACKER_HEAPSIZE 192
HADOOP_DATANODE_HEAPSIZE 96
mapred.child.java.opts -Xmx288m
mapred.tasktracker.map.tasks.maximum 2
mapred.tasktracker.reduce.tasks.maximum 1

c1.xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 2304
HADOOP_NAMENODE_HEAPSIZE 768
HADOOP_TASKTRACKER_HEAPSIZE 384
HADOOP_DATANODE_HEAPSIZE 384
mapred.child.java.opts -Xmx384m
mapred.tasktracker.map.tasks.maximum 7
mapred.tasktracker.reduce.tasks.maximum 2

m2.xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 9216
HADOOP_NAMENODE_HEAPSIZE 3072
HADOOP_TASKTRACKER_HEAPSIZE 384
HADOOP_DATANODE_HEAPSIZE 384
mapred.child.java.opts -Xmx2304m
mapred.tasktracker.map.tasks.maximum 3
mapred.tasktracker.reduce.tasks.maximum 1

m2.2xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 18432
HADOOP_NAMENODE_HEAPSIZE 6144
HADOOP_TASKTRACKER_HEAPSIZE 384
HADOOP_DATANODE_HEAPSIZE 384
mapred.child.java.opts -Xmx2688m
mapred.tasktracker.map.tasks.maximum 6
mapred.tasktracker.reduce.tasks.maximum 2

m2.4xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 36864
HADOOP_NAMENODE_HEAPSIZE 12288
HADOOP_TASKTRACKER_HEAPSIZE 384
HADOOP_DATANODE_HEAPSIZE 384
mapred.child.java.opts -Xmx2304m
mapred.tasktracker.map.tasks.maximum 14
mapred.tasktracker.reduce.tasks.maximum 4

cc1.4xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 7680
HADOOP_NAMENODE_HEAPSIZE 3840
HADOOP_TASKTRACKER_HEAPSIZE 384
HADOOP_DATANODE_HEAPSIZE 384
mapred.child.java.opts -Xmx912m
mapred.tasktracker.map.tasks.maximum 12
mapred.tasktracker.reduce.tasks.maximum 3

cc2.8xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 30114
HADOOP_NAMENODE_HEAPSIZE 12288
HADOOP_TASKTRACKER_HEAPSIZE 384
HADOOP_DATANODE_HEAPSIZE 384
mapred.child.java.opts -Xmx1536m
mapred.tasktracker.map.tasks.maximum 24
mapred.tasktracker.reduce.tasks.maximum 6

cg1.4xlarge

ParameterValue
HADOOP_JOBTRACKER_HEAPSIZE 7680
HADOOP_NAMENODE_HEAPSIZE 3840
HADOOP_TASKTRACKER_HEAPSIZE 384
HADOOP_DATANODE_HEAPSIZE 384
mapred.child.java.opts -Xmx864m
mapred.tasktracker.map.tasks.maximum 12
mapred.tasktracker.reduce.tasks.maximum 3