Amazon EMR - AWS GovCloud (US)

Amazon EMR

Amazon EMR is a web service that makes it easy to process large amounts of data efficiently. Amazon EMR uses Hadoop processing combined with several AWS products to do such tasks as web indexing, data mining, log file analysis, machine learning, scientific simulation, and data warehousing.

For information related to Release history, refer to Amazon EMR Release Information.

How Amazon EMR Differs for AWS GovCloud (US)

  • MapR distributions are currently not supported in AWS GovCloud (US) Regions.

  • In AWS GovCloud (US) Regions, you launch all Amazon EMR job flows in Amazon Virtual Private Cloud (Amazon VPC). For information about configuring an Amazon VPC that can run a job flow, see Select an Amazon VPC and Subnet for the Cluster.

  • Launching a job flow with debugging is not currently supported in AWS GovCloud (US) Regions.

Documentation for Amazon EMR

Amazon EMR documentation.

Export-Controlled Content

For AWS Services architected within the AWS GovCloud (US) Regions, the table below explains how certain components of data may leave the Regions in the normal course of the Service Offerings. The table can be used as a guide to help meet applicable customer compliance obligations.

Data in the following service attributes will not leave the AWS GovCloud (US ) Regions in the normal course of the Service Offerings Data in the following service attributes may leave the AWS GovCloud (US ) Regions in the normal course of the Service Offerings
  • All input and output data that is entered, stored, and processed in Amazon EMR can contain export-controlled data.

  • Amazon EMR metadata is not permitted to contain export-controlled data. This metadata includes all configuration data that you enter when creating and maintaining your job flows.

  • Do not enter export-controlled data in Amazon EMR when doing the following:

    • Naming a job flow

    • Specifying a file location

    • Naming a bootstrap action

    • Providing arguments

    • Resource tags

  • Export-controlled data should not be printed to your logs. (Amazon EMR metadata and logs are not permitted to contain export-controlled data.)

If you are processing export-controlled data with this service, use the SSL (HTTPS) endpoint to maintain export compliance. For a list of endpoints, see Service Endpoints.