Amazon EMR
Amazon EMR Release Guide

Configuring Tez

You can customize Tez by setting values using the tez-site configuration classification when you create your cluster, which configures settings in the tez-site.xml configuration file. For more information, see TezConfiguration in the Apache Tez documentation. To change Hive or Pig to use the Tez execution engine, use the hive-site and pig-properties configuration classifications as appropriate. Examples are shown below.

Example: Customizing the Tez Root Logging Level and Setting Tez as the Execution Engine for Hive and Pig

The example create-cluster command shown below creates a cluster with Tez, Hive, and Pig installed. The command references a file stored in Amazon S3, myConfig.json, which specifies properties for the tez-site classification that sets tez.am.log.level to DEBUG, and sets the execution engine to Tez for Hive and Pig using the hive-site and pig-properties configuration classifications.

Note

Linux line continuation characters (\) are included for readability. They can be removed or used in Linux commands. For Windows, remove them or replace with a caret (^).

aws emr create-cluster --release-label emr-5.19.0 \ --applications Name=Tez Name=Hive Name=Pig --ec2-attributes KeyName=myKey \ --instance-type m4.large --instance-count 3 \ --configurations https://s3.amazonaws.com/mybucket/myfolder/myConfig.json --use-default-roles

Example contents of myConfig.json are shown below.

[ { "Classification": "tez-site", "Properties": { "tez.am.log.level": "DEBUG" } }, { "Classification": "hive-site", "Properties": { "hive.execution.engine": "tez" } }, { "Classification": "pig-properties", "Properties": { "exectype": "tez" } } ]