Step 2: Launch Your Sample Amazon EMR Cluster
In this step, you launch your sample cluster by using the Amazon EMR console. Before you perform this step, make sure that you meet the requirements in Step 1: Set Up Prerequisites for Your Sample Cluster.
Using Quick Cluster Configuration Overview
The following table describes the fields and default values when you launch a cluster using the Quick cluster configuration page in the Amazon EMR console.
|Console field||Default value||Description|
|Cluster name||My cluster|
The cluster name is an optional, descriptive name for your cluster that does not need to be unique.
This option specifies whether to enable or disable logging. When logging is enabled, Amazon EMR writes detailed log data to a folder in either a S3 bucket chosen for you or your own specified bucket. Logging is an immutable property that can only be enabled when you create the cluster.
This option specifies the path to a folder in an S3 bucket where you want Amazon EMR to write log data. The following example shows a path to an S3 bucket for AWS account ID 111122223333 in the us-east-1 region: s3://aws-logs-111122223333-us-east-1/elasticmapreduce/.
If the folder in the specified path does not exist in the bucket, it is created for you. You can specify a different folder by typing or browsing to a different location.
This option specifies whether to launch an ongoing cluster or a transient cluster as follows:
This option specifies the vendor from which you want to select the software release and applications for your cluster.
This option specifies the software and Amazon EMR platform components, such as EMRFS, to install on your cluster. Amazon EMR uses the release to initialize the Amazon EC2 instances on which your cluster runs. These releases are specific to Amazon EMR and can be used only in the context of running your Amazon EMR cluster. The latest release label is selected by default.
All applications (for Cluster launch mode)Core Hadoop (if you choose Step execution launch mode)
This option determines the applications to install on your cluster. If you chose Cluster launch mode, you can select the applications to install. If you chose Step execution launch mode, the list of applications is determined by the steps that you added.
This option determines the Amazon EC2 instance type that Amazon EMR initializes for the instances that run in your cluster.
|Number of instances||3|
This option determines the number of Amazon EC2 instances to initialize. Each instance corresponds to a node in the Amazon EMR cluster. You must have at least one node.
|EC2 key pair||Select an option|
This option specifies the Amazon EC2 key pair to use when connecting to the nodes in your cluster using Secure Shell (SSH). If you do not select a key pair, you cannot connect to the cluster.
This option configures permissions for your Amazon EMR cluster. These permissions are granted using policies that are applied to the following IAM roles:
With Default permissions, the IAM roles use the following AWS managed policies: AmazonElasticMapReduceRole for the Amazon EMR service and AmazonElasticMapReduceforEC2Role for your instance profile. You can choose View policy for EMR role or View policy for EC2 instance profile to view these policies.
With Custom permissions, you must select existing roles. The policies attached to those roles determine the permissions for Amazon EMR and your Amazon EC2 instance profile.
Launch the Sample Cluster
Perform the following steps to launch your sample cluster. Unless otherwise specified in the procedure, use the default values as described in the preceding table.
To launch an Amazon EMR cluster
Sign in to the AWS Management Console and open the Amazon EMR console at https://console.aws.amazon.com/elasticmapreduce/.
Choose Create cluster.
On the Quick cluster configuration page, accept the default values except for the following fields:
Choose Create cluster.
Proceed to the next step.