Apache Oozie
Use the Apache Oozie Workflow Scheduler to manage and coordinate Hadoop jobs. For more
information, see http://oozie.apache.org/
The Oozie native web interface is not supported on Amazon EMR. To use a front-end interface for Oozie, try the Hue Oozie application. For more information, see Hue. Oozie is included with Amazon EMR release version 5.0.0 and later. Oozie is included as a sandbox application in earlier releases. For more information, see Amazon EMR 4.x release versions.
If you use a custom Amazon Linux AMI based on an Amazon Linux AMI with a creation date of 2018-08-11, the Oozie server fails to start. If you use Oozie, create a custom AMI based on an Amazon Linux AMI ID with a different creation date. You can use the following AWS CLI command to return a list of Image IDs for all HVM Amazon Linux AMIs with a 2018.03 version, along with the release date, so that you can choose an appropriate Amazon Linux AMI as your base. Replace MyRegion with your Region identifier, such as us-west-2.
aws ec2 --region
MyRegion
describe-images --owner amazon --query 'Images[?Name!=`null`]|[?starts_with(Name, `amzn-ami-hvm-2018.03`) == `true`].[CreationDate,ImageId,Name]' --output text | sort -rk1
The following table lists the version of Oozie included in the latest release of the Amazon EMR 7.x series, along with the components that Amazon EMR installs with Oozie.
For the version of components installed with Oozie in this release, see Release 7.3.0 Component Versions.
Amazon EMR Release Label | Oozie Version | Components Installed With Oozie |
---|---|---|
emr-7.3.0 |
Oozie 5.2.1 |
emrfs, emr-ddb, emr-goodies, emr-kinesis, emr-s3-dist-cp, hadoop-client, hadoop-mapred, hadoop-hdfs-datanode, hadoop-hdfs-library, hadoop-hdfs-namenode, hadoop-httpfs-server, hadoop-kms-server, hadoop-yarn-nodemanager, hadoop-yarn-resourcemanager, hadoop-yarn-timeline-server, oozie-client, oozie-server, tez-on-yarn, tez-on-worker |
The following table lists the version of Oozie included in the latest release of the Amazon EMR 6.x series, along with the components that Amazon EMR installs with Oozie.
For the version of components installed with Oozie in this release, see Release 6.15.0 Component Versions.
Amazon EMR Release Label | Oozie Version | Components Installed With Oozie |
---|---|---|
emr-6.15.0 |
Oozie 5.2.1 |
emrfs, emr-ddb, emr-goodies, emr-kinesis, emr-s3-dist-cp, hadoop-client, hadoop-mapred, hadoop-hdfs-datanode, hadoop-hdfs-library, hadoop-hdfs-namenode, hadoop-httpfs-server, hadoop-kms-server, hadoop-yarn-nodemanager, hadoop-yarn-resourcemanager, hadoop-yarn-timeline-server, oozie-client, oozie-server, tez-on-yarn, tez-on-worker |
The following table lists the version of Oozie included in the latest release of the Amazon EMR 5.x series, along with the components that Amazon EMR installs with Oozie.
For the version of components installed with Oozie in this release, see Release 5.36.2 Component Versions.
Amazon EMR Release Label | Oozie Version | Components Installed With Oozie |
---|---|---|
emr-5.36.2 |
Oozie 5.2.1 |
emrfs, emr-ddb, emr-goodies, emr-kinesis, emr-s3-dist-cp, hadoop-client, hadoop-mapred, hadoop-hdfs-datanode, hadoop-hdfs-library, hadoop-hdfs-namenode, hadoop-httpfs-server, hadoop-kms-server, hadoop-yarn-nodemanager, hadoop-yarn-resourcemanager, hadoop-yarn-timeline-server, oozie-client, oozie-server, tez-on-yarn |