Application support and limitations - Amazon EMR

Application support and limitations

Supported applications

The integration between Amazon EMR and Apache Ranger in which EMR installs Ranger plugins currently supports the following applications:

  • Apache Spark (Available with EMR 5.32+ and EMR 6.3+)

  • Apache Hive (Available with EMR 5.32+ and EMR 6.3+)

  • S3 Access through EMRFS (Available with EMR 5.32+ and EMR 6.3+)

The following applications can be installed on an EMR cluster and may need to be configured to meet your security needs:

  • Apache Hadoop (Available with EMR 5.32+ and EMR 6.3+ including YARN and HDFS)

  • Apache Livy (Available with EMR 5.32+ and EMR 6.3+)

  • Apache Zeppelin (Available with EMR 5.32+ and EMR 6.3+)

  • Apache Hue (Available with EMR 5.32+ and EMR 6.3+)

  • Ganglia (Available with EMR 5.32+ and EMR 6.3+)

  • HCatalog (Available with EMR 5.32+ and EMR 6.3+)

  • Mahout (Available with EMR 5.32+ and EMR 6.3+)

  • MXNet (Available with EMR 5.32+ and EMR 6.3+)

  • TensorFlow (Available with EMR 5.32+ and EMR 6.3+)

  • Tez (Available with EMR 5.32+ and EMR 6.3+)

  • Trino (Available with EMR 6.7+)

  • ZooKeeper (Available with EMR 5.32+ and EMR 6.3+)

Important

Applications listed above are the only applications that are currently supported. To ensure cluster security, you are allowed to create an EMR cluster with only the applications in the above list when Apache Ranger is enabled.

Other applications are currently not supported. To ensure the security of your cluster, attempting to install other applications will cause the rejection your cluster.

Supported Features

The following Amazon EMR features can be used with Amazon EMR and Apache Ranger:

  • Encryption at rest and in transit

  • Kerberos authentication (required)

  • Instance groups, instance fleets, and Spot Instances

  • Reconfiguration of applications on a running cluster

  • EMRFS server-side encryption (SSE)

Note

Amazon EMR encryption settings govern SSE. For more information, see Encryption Options.

Application limitations

There are several limitations to keep in mind when you integrate Amazon EMR and Apache Ranger:

  • You cannot currently use the console to create a security configuration that specifies the AWS Ranger integration option in the AWS GovCloud (US) Region. Security configuration can be done using the CLI.

  • Kerberos must be installed on your cluster.

  • Application UIs (user interfaces) such as the YARN Resource Manager UI, HDFS NameNode UI, and Livy UI are not set with authentication by default.

  • The HDFS default permissions umask are configured so that objects created are set to world wide readable by default.

  • Amazon EMR doesn't support high-availability (multiple primary) mode with Apache Ranger.

  • For additional limitations, see limitations for each application.

Note

Amazon EMR encryption settings govern SSE. For more information, see Encryption Options.

Plugin limitations

Each plugin has specific limitations. For the Apache Hive plugin's limitations, see Apache Hive plugin limitations. For the Apache Spark plugin's limitations, see Apache Spark plugin limitations. For the EMRFS S3 plugin's limitations, see EMRFS S3 plugin limitations.