Plan, configure and launch Amazon EMR clusters
This section explains configuration options and instructions for planning, configuring, and launching clusters using Amazon EMR. Before you launch a cluster, you make choices about your system based on the data that you're processing and your requirements for cost, speed, capacity, availability, security, and manageability. Your choices include:
-
What region to run a cluster in, where and how to store data, and how to output results. See Configure cluster location and data storage.
-
Whether you are running Amazon EMR clusters on Outposts or Local Zones. See EMR clusters on AWS Outposts or EMR clusters on AWS Local Zones.
-
Whether a cluster is long-running or transient, and what software it runs. See Configuring a cluster to continue or terminate after step execution and Configure applications when you launch your cluster.
-
Whether a cluster has a single primary node or three primary nodes. See Plan and configure primary nodes in your Amazon EMR cluster.
-
The hardware and networking options that optimize cost, performance, and availability for your application. See Configure cluster hardware and networking.
-
How to set up clusters so you can manage them more easily, and monitor activity, performance, and health. See Configure cluster logging and debugging and Tag and categorize cluster resources.
-
How to authenticate and authorize access to cluster resources, and how to encrypt data. See Security in Amazon EMR.
-
How to integrate with other software and services. See Drivers and third-party application integration.