Amazon Elastic MapReduce
Developer Guide (API Version 2009-03-31)
« PreviousNext »
View the PDF for this guide.Go to the AWS Discussion Forum for this product.Go to the Kindle Store to download this guide in Kindle format.Did this page help you?  Yes | No |  Tell us about it...

Hadoop 0.20 Streaming Configuration

Hadoop 0.20 and later supports the three streaming parameters described in the following table, in addition to the version 0.18 parameters.

ParameterDescription
-files Specifies comma-separated files to copy to the MapReduce cluster.
-archives Specifies comma-separated archives to restore to the compute machines.
-D KEY=VALUE Sets a Hadoop configuration variable. KEY is a Hadoop configuration, such as mapred.map.tasks, and VALUE is the new value.

The --files and --archives parameters are similar to --cacheFile and --cacheArchive of Hadoop 0.18, except that they accept comma-separated values.