Amazon EMR - AWS GovCloud (US)

Amazon EMR

Amazon EMR is a cloud big data platform for running large-scale distributed data processing jobs, interactive SQL queries, and machine learning (ML) applications using open-source analytics frameworks such as Apache Spark, Apache Hive, and Presto.

For information related to Release history, refer to Amazon EMR Release Information.

How Amazon EMR Differs for AWS GovCloud (US)

  • MapR distributions are currently not supported in AWS GovCloud (US) Regions.

  • In AWS GovCloud (US) Regions, you launch all Amazon EMR job flows in Amazon Virtual Private Cloud (Amazon VPC). For information about configuring an Amazon VPC that can run a job flow, see Select an Amazon VPC and Subnet for the Cluster.

  • Launching a job flow with debugging is not currently supported in AWS GovCloud (US) Regions.

  • Auto-termination for idle clusters using an auto-termination policy is not available in AWS GovCloud (US) Regions.

  • Amazon EMR Studio is not available in AWS GovCloud (US) Regions.

  • Amazon EMR on EKS on Fargate is not available in AWS GovCloud (US) Regions.

  • Amazon EMR notebooks are not available in AWS GovCloud (US) Regions.

  • Amazon EMR with AWS Lake Formation is not available in AWS GovCloud (US) Regions.

Documentation for Amazon EMR

Amazon EMR documentation.

Export-Controlled Content

For AWS Services architected within the AWS GovCloud (US) Regions, the following list explains how certain components of data may leave the AWS GovCloud (US) Regions in the normal course of the service offerings. The list can be used as a guide to help meet applicable customer compliance obligations. Data not included in the following list remains within the AWS GovCloud (US) Regions.

  • Amazon EMR metadata is not permitted to contain export-controlled data. This metadata includes all configuration data that you enter when creating and maintaining your job flows.

  • Do not enter export-controlled data in Amazon EMR when doing the following:

    • Naming a job flow

    • Specifying a file location

    • Naming a bootstrap action

    • Providing arguments

    • Resource tags

  • Export-controlled data should not be printed to your logs. (Amazon EMR metadata and logs are not permitted to contain export-controlled data.)

If you are processing export-controlled data with this service, use the SSL (HTTPS) endpoint to maintain export compliance. For more information, see Service Endpoints.