AWS SDK Version 3 for .NET
API Reference

AWS services or capabilities described in AWS Documentation may vary by region/location. Click Getting Started with Amazon AWS to see specific differences applicable to the China (Beijing) Region.

Classes

NameDescription
Class AddInstanceFleetRequest

Container for the parameters to the AddInstanceFleet operation. Adds an instance fleet to a running cluster.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x.

Class AddInstanceFleetResponse

This is the response object from the AddInstanceFleet operation.

Class AddInstanceGroup

Class representing creating a new instance group.

Class AddInstanceGroupsRequest

Container for the parameters to the AddInstanceGroups operation. Adds one or more instance groups to a running cluster.

Class AddInstanceGroupsResponse

Output from an AddInstanceGroups call.

Class AddJobFlowStepsRequest

Container for the parameters to the AddJobFlowSteps operation. AddJobFlowSteps adds new steps to a running cluster. A maximum of 256 steps are allowed in each job flow.

If your cluster is long-running (such as a Hive data warehouse) or complex, you may require more than 256 steps to process your data. You can bypass the 256-step limitation in various ways, including using SSH to connect to the master node and submitting queries directly to the software running on the master node, such as Hive and Hadoop. For more information on how to do this, see Add More than 256 Steps to a Cluster in the Amazon EMR Management Guide.

A step specifies the location of a JAR file stored either on the master node of the cluster or in Amazon S3. Each step is performed by the main function of the main class of the JAR file. The main class can be specified either in the manifest of the JAR or by using the MainFunction parameter of the step.

Amazon EMR executes each step in the order listed. For a step to be considered complete, the main function must exit with a zero exit code and all Hadoop jobs started while the step was running must have completed and run successfully.

You can only add steps to a cluster that is in one of the following states: STARTING, BOOTSTRAPPING, RUNNING, or WAITING.

Class AddJobFlowStepsResponse

The output for the AddJobFlowSteps operation.

Class AddTagsRequest

Container for the parameters to the AddTags operation. Adds tags to an Amazon EMR resource. Tags make it easier to associate clusters in various ways, such as grouping clusters to track your Amazon EMR resource allocation costs. For more information, see Tagging Amazon EMR Resources.

Class AddTagsResponse

This output indicates the result of adding tags to a resource.

Class Application

An application is any Amazon or third-party software that you can add to the cluster. This structure contains a list of strings that indicates the software to use with the cluster and accepts a user argument list. Amazon EMR accepts and forwards the argument list to the corresponding installation script as bootstrap action argument. For more information, see Using the MapR Distribution for Hadoop. Currently supported values are:

  • "mapr-m3" - launch the cluster using MapR M3 Edition.

  • "mapr-m5" - launch the cluster using MapR M5 Edition.

  • "mapr" with the user arguments specifying "--edition,m3" or "--edition,m5" - launch the cluster using MapR M3 or M5 Edition, respectively.

In Amazon EMR releases 4.x and later, the only accepted parameter is the application name. To pass arguments to applications, you supply a configuration for each application.

Class AutoScalingPolicy

An automatic scaling policy for a core instance group or task instance group in an Amazon EMR cluster. An automatic scaling policy defines how an instance group dynamically adds and terminates EC2 instances in response to the value of a CloudWatch metric. See PutAutoScalingPolicy.

Class AutoScalingPolicyDescription

An automatic scaling policy for a core instance group or task instance group in an Amazon EMR cluster. The automatic scaling policy defines how an instance group dynamically adds and terminates EC2 instances in response to the value of a CloudWatch metric. See PutAutoScalingPolicy.

Class AutoScalingPolicyStateChangeReason

The reason for an AutoScalingPolicyStatus change.

Class AutoScalingPolicyStatus

The status of an automatic scaling policy.

Class BootstrapActionConfig

Configuration of a bootstrap action.

Class BootstrapActionDetail

Reports the configuration of a bootstrap action in a cluster (job flow).

Class BootstrapActions

Class that provides helper methods for constructing predefined bootstrap actions.

Class CancelStepsInfo

Specification of the status of a CancelSteps request. Available only in Amazon EMR version 4.8.0 and later, excluding version 5.0.0.

Class CancelStepsRequest

Container for the parameters to the CancelSteps operation. Cancels a pending step or steps in a running cluster. Available only in Amazon EMR versions 4.8.0 and later, excluding version 5.0.0. A maximum of 256 steps are allowed in each CancelSteps request. CancelSteps is idempotent but asynchronous; it does not guarantee a step will be canceled, even if the request is successfully submitted. You can only cancel steps that are in a PENDING state.

Class CancelStepsResponse

The output for the CancelSteps operation.

Class CloudWatchAlarmDefinition

The definition of a CloudWatch metric alarm, which determines when an automatic scaling activity is triggered. When the defined alarm conditions are satisfied, scaling activity begins.

Class Cluster

The detailed description of the cluster.

Class ClusterStateChangeReason

The reason that the cluster changed to its current state.

Class ClusterStatus

The detailed status of the cluster.

Class ClusterSummary

The summary description of the cluster.

Class ClusterTimeline

Represents the timeline of the cluster's lifecycle.

Class Command

An entity describing an executable that runs on a cluster.

Class Configuration

Amazon EMR releases 4.x or later.

An optional configuration specification to be used when provisioning cluster instances, which can include configurations for applications and software bundled with Amazon EMR. A configuration consists of a classification, properties, and optional nested configurations. A classification refers to an application-specific configuration file. Properties are the settings you want to change in that file. For more information, see Configuring Applications.

Class ConfigureDaemons
Class ConfigureHadoop
Class CreateSecurityConfigurationRequest

Container for the parameters to the CreateSecurityConfiguration operation. Creates a security configuration, which is stored in the service and can be specified when a cluster is created.

Class CreateSecurityConfigurationResponse

This is the response object from the CreateSecurityConfiguration operation.

Class DeleteSecurityConfigurationRequest

Container for the parameters to the DeleteSecurityConfiguration operation. Deletes a security configuration.

Class DeleteSecurityConfigurationResponse

This is the response object from the DeleteSecurityConfiguration operation.

Class DescribeClusterRequest

Container for the parameters to the DescribeCluster operation. Provides cluster-level details including status, hardware and software configuration, VPC settings, and so on. For information about the cluster steps, see ListSteps.

Class DescribeClusterResponse

This output contains the description of the cluster.

Class DescribeJobFlowsRequest

Container for the parameters to the DescribeJobFlows operation. This API is deprecated and will eventually be removed. We recommend you use ListClusters, DescribeCluster, ListSteps, ListInstanceGroups and ListBootstrapActions instead.

DescribeJobFlows returns a list of job flows that match all of the supplied parameters. The parameters can include a list of job flow IDs, job flow states, and restrictions on job flow creation date and time.

Regardless of supplied parameters, only job flows created within the last two months are returned.

If no parameters are supplied, then job flows matching either of the following criteria are returned:

  • Job flows created and completed in the last two weeks

  • Job flows created within the last two months that are in one of the following states: RUNNING, WAITING, SHUTTING_DOWN, STARTING

Amazon EMR can return a maximum of 512 job flow descriptions.

Class DescribeJobFlowsResponse

The output for the DescribeJobFlows operation.

Class DescribeSecurityConfigurationRequest

Container for the parameters to the DescribeSecurityConfiguration operation. Provides the details of a security configuration by returning the configuration JSON.

Class DescribeSecurityConfigurationResponse

This is the response object from the DescribeSecurityConfiguration operation.

Class DescribeStepRequest

Container for the parameters to the DescribeStep operation. Provides more detail about the cluster step.

Class DescribeStepResponse

This output contains the description of the cluster step.

Class EbsBlockDevice

Configuration of requested EBS block device associated with the instance group.

Class EbsBlockDeviceConfig

Configuration of requested EBS block device associated with the instance group with count of volumes that will be associated to every instance.

Class EbsConfiguration

The Amazon EBS configuration of a cluster instance.

Class EbsVolume

EBS block device that's attached to an EC2 instance.

Class Ec2InstanceAttributes

Provides information about the EC2 instances in a cluster grouped by category. For example, key name, subnet ID, IAM instance profile, and so on.

Class FailureDetails

The details of the step failure. The service attempts to detect the root cause for many common failures.

Class HadoopJarStepConfig

A job flow step consisting of a JAR file whose main function will be executed. The main function submits a job for Hadoop to execute and waits for the job to finish or fail.

Class HadoopStepConfig

A cluster step consisting of a JAR file whose main function will be executed. The main function submits a job for Hadoop to execute and waits for the job to finish or fail.

Class Instance

Represents an EC2 instance provisioned as part of cluster.

Class InstanceFleet

Describes an instance fleet, which is a group of EC2 instances that host a particular node type (master, core, or task) in an Amazon EMR cluster. Instance fleets can consist of a mix of instance types and On-Demand and Spot instances, which are provisioned to meet a defined target capacity.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class InstanceFleetConfig

The configuration that defines an instance fleet.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class InstanceFleetModifyConfig

Configuration parameters for an instance fleet modification request.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class InstanceFleetProvisioningSpecifications

The launch specification for Spot instances in the fleet, which determines the defined duration and provisioning timeout behavior.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class InstanceFleetStateChangeReason

Provides status change reason details for the instance fleet.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class InstanceFleetStatus

The status of the instance fleet.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class InstanceFleetTimeline

Provides historical timestamps for the instance fleet, including the time of creation, the time it became ready to run jobs, and the time of termination.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class InstanceGroup

This entity represents an instance group, which is a group of instances that have common purpose. For example, CORE instance group is used for HDFS.

Class InstanceGroupConfig

Configuration defining a new instance group.

Class InstanceGroupDetail

Detailed information about an instance group.

Class InstanceGroupModifyConfig

Modify an instance group size.

Class InstanceGroupStateChangeReason

The status change reason details for the instance group.

Class InstanceGroupStatus

The details of the instance group status.

Class InstanceGroupTimeline

The timeline of the instance group lifecycle.

Class InstanceResizePolicy

Custom policy for requesting termination protection or termination of specific instances when shrinking an instance group.

Class InstanceStateChangeReason

The details of the status change reason for the instance.

Class InstanceStatus

The instance status details.

Class InstanceTimeline

The timeline of the instance lifecycle.

Class InstanceTypeConfig

An instance type configuration for each instance type in an instance fleet, which determines the EC2 instances Amazon EMR attempts to provision to fulfill On-Demand and Spot target capacities. There can be a maximum of 5 instance type configurations in a fleet.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class InstanceTypeSpecification

The configuration specification for each instance type in an instance fleet.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class InternalServerErrorException

ElasticMapReduce exception

Class InternalServerException

ElasticMapReduce exception

Class InvalidRequestException

ElasticMapReduce exception

Class JobFlowDetail

A description of a cluster (job flow).

Class JobFlowExecutionStatusDetail

Describes the status of the cluster (job flow).

Class JobFlowInstancesConfig

A description of the Amazon EC2 instance on which the cluster (job flow) runs. A valid JobFlowInstancesConfig must contain either InstanceGroups or InstanceFleets, which is the recommended configuration. They cannot be used together. You may also have MasterInstanceType, SlaveInstanceType, and InstanceCount (all three must be present), but we don't recommend this configuration.

Class JobFlowInstancesDetail

Specify the type of Amazon EC2 instances that the cluster (job flow) runs on.

Class KeyValue

A key value pair.

Class ListBootstrapActionsRequest

Container for the parameters to the ListBootstrapActions operation. Provides information about the bootstrap actions associated with a cluster.

Class ListBootstrapActionsResponse

This output contains the bootstrap actions detail.

Class ListClustersRequest

Container for the parameters to the ListClusters operation. Provides the status of all clusters visible to this AWS account. Allows you to filter the list of clusters based on certain criteria; for example, filtering by cluster creation date and time or by status. This call returns a maximum of 50 clusters per call, but returns a marker to track the paging of the cluster list across multiple ListClusters calls.

Class ListClustersResponse

This contains a ClusterSummaryList with the cluster details; for example, the cluster IDs, names, and status.

Class ListInstanceFleetsRequest

Container for the parameters to the ListInstanceFleets operation. Lists all available details about the instance fleets in a cluster.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class ListInstanceFleetsResponse

This is the response object from the ListInstanceFleets operation.

Class ListInstanceGroupsRequest

Container for the parameters to the ListInstanceGroups operation. Provides all available details about the instance groups in a cluster.

Class ListInstanceGroupsResponse

This input determines which instance groups to retrieve.

Class ListInstancesRequest

Container for the parameters to the ListInstances operation. Provides information for all active EC2 instances and EC2 instances terminated in the last 30 days, up to a maximum of 2,000. EC2 instances in any of the following states are considered active: AWAITING_FULFILLMENT, PROVISIONING, BOOTSTRAPPING, RUNNING.

Class ListInstancesResponse

This output contains the list of instances.

Class ListSecurityConfigurationsRequest

Container for the parameters to the ListSecurityConfigurations operation. Lists all the security configurations visible to this account, providing their creation dates and times, and their names. This call returns a maximum of 50 clusters per call, but returns a marker to track the paging of the cluster list across multiple ListSecurityConfigurations calls.

Class ListSecurityConfigurationsResponse

This is the response object from the ListSecurityConfigurations operation.

Class ListStepsRequest

Container for the parameters to the ListSteps operation. Provides a list of steps for the cluster in reverse order unless you specify stepIds with the request.

Class ListStepsResponse

This output contains the list of steps returned in reverse order. This means that the last step is the first element in the list.

Class MetricDimension

A CloudWatch dimension, which is specified using a Key (known as a Name in CloudWatch), Value pair. By default, Amazon EMR uses one dimension whose Key is JobFlowID and Value is a variable representing the cluster ID, which is ${emr.clusterId}. This enables the rule to bootstrap when the cluster ID becomes available.

Class ModifyInstanceFleetRequest

Container for the parameters to the ModifyInstanceFleet operation. Modifies the target On-Demand and target Spot capacities for the instance fleet with the specified InstanceFleetID within the cluster specified using ClusterID. The call either succeeds or fails atomically.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class ModifyInstanceFleetResponse

This is the response object from the ModifyInstanceFleet operation.

Class ModifyInstanceGroup
Class ModifyInstanceGroupsRequest

Container for the parameters to the ModifyInstanceGroups operation. ModifyInstanceGroups modifies the number of nodes and configuration settings of an instance group. The input parameters include the new target instance count for the group and the instance group ID. The call will either succeed or fail atomically.

Class ModifyInstanceGroupsResponse

This is the response object from the ModifyInstanceGroups operation.

Class PlacementType

The Amazon EC2 Availability Zone configuration of the cluster (job flow).

Class PutAutoScalingPolicyRequest

Container for the parameters to the PutAutoScalingPolicy operation. Creates or updates an automatic scaling policy for a core instance group or task instance group in an Amazon EMR cluster. The automatic scaling policy defines how an instance group dynamically adds and terminates EC2 instances in response to the value of a CloudWatch metric.

Class PutAutoScalingPolicyResponse

This is the response object from the PutAutoScalingPolicy operation.

Class RemoveAutoScalingPolicyRequest

Container for the parameters to the RemoveAutoScalingPolicy operation. Removes an automatic scaling policy from a specified instance group within an EMR cluster.

Class RemoveAutoScalingPolicyResponse

This is the response object from the RemoveAutoScalingPolicy operation.

Class RemoveTagsRequest

Container for the parameters to the RemoveTags operation. Removes tags from an Amazon EMR resource. Tags make it easier to associate clusters in various ways, such as grouping clusters to track your Amazon EMR resource allocation costs. For more information, see Tagging Amazon EMR Resources.

The following example removes the stack tag with value Prod from a cluster:

Class RemoveTagsResponse

This output indicates the result of removing tags from a resource.

Class ResizeJobFlowStep

This class provides some helper methods for creating a Resize Job Flow step as part of your job flow. The resize step can be used to automatically adjust the composition of your cluster while it is running. For example, if you have a large workflow with different compute requirements, you can use this step to automatically add a task instance group before your most compute intensive step.

AWSCredentials credentials = new BasicAWSCredentials(accessKey, secretKey);
IAmazonElasticMapReduce emr = new AmazonElasticMapReduceClient(credentials);
            
var resize = new ResizeJobFlowStep
{
    OnArrested = OnArrested.Continue,
    OnFailure = OnFailure.Continue
};
                    
resize.AddResizeAction(new AddInstanceGroup
{
    InstanceGroup = "core",
    InstanceCount = 10
});
                    
resize.AddResizeAction(new AddInstanceGroup
{
    InstanceGroup = "task",
    InstanceCount = 10,
    WithInstanceType = "m1.small"
});
                    
HadoopJarStepConfig config = resize.ToHadoopJarStepConfig();
            
StepConfig resizeJobFlow = new StepConfig {
     Name = "Resize job flow",
     ActionOnFailure = "TERMINATE_JOB_FLOW",
     HadoopJarStep = config,
};
            
RunJobFlowRequest request = new RunJobFlowRequest {
    Name = "Resize job flow",
    Steps = new List { resizeJobFlow },
    LogUri = "s3://log-bucket/",
    Instances = new JobFlowInstancesConfig {
        Ec2KeyName = "keypair",
        HadoopVersion = "0.20",
        InstanceCount = 5,
        KeepJobFlowAliveWhenNoSteps = true,
        MasterInstanceType = "m1.small",
        SlaveInstanceType = "m1.small"
    }
};
            
RunJobFlowResponse response = emr.RunJobFlow(request);
             

Class RunJobFlowRequest

Container for the parameters to the RunJobFlow operation. RunJobFlow creates and starts running a new cluster (job flow). The cluster runs the steps specified. After the steps complete, the cluster stops and the HDFS partition is lost. To prevent loss of data, configure the last step of the job flow to store results in Amazon S3. If the JobFlowInstancesConfig KeepJobFlowAliveWhenNoSteps parameter is set to TRUE, the cluster transitions to the WAITING state rather than shutting down after the steps have completed.

For additional protection, you can set the JobFlowInstancesConfig TerminationProtected parameter to TRUE to lock the cluster and prevent it from being terminated by API call, user intervention, or in the event of a job flow error.

A maximum of 256 steps are allowed in each job flow.

If your cluster is long-running (such as a Hive data warehouse) or complex, you may require more than 256 steps to process your data. You can bypass the 256-step limitation in various ways, including using the SSH shell to connect to the master node and submitting queries directly to the software running on the master node, such as Hive and Hadoop. For more information on how to do this, see Add More than 256 Steps to a Cluster in the Amazon EMR Management Guide.

For long running clusters, we recommend that you periodically store your results.

The instance fleets configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions. The RunJobFlow request can contain InstanceFleets parameters or InstanceGroups parameters, but not both.

Class RunJobFlowResponse

The result of the RunJobFlow operation.

Class ScalingAction

The type of adjustment the automatic scaling activity makes when triggered, and the periodicity of the adjustment.

Class ScalingConstraints

The upper and lower EC2 instance limits for an automatic scaling policy. Automatic scaling activities triggered by automatic scaling rules will not cause an instance group to grow above or below these limits.

Class ScalingRule

A scale-in or scale-out rule that defines scaling activity, including the CloudWatch metric alarm that triggers activity, how EC2 instances are added or removed, and the periodicity of adjustments. The automatic scaling policy for an instance group can comprise one or more automatic scaling rules.

Class ScalingTrigger

The conditions that trigger an automatic scaling activity.

Class ScriptBootstrapActionConfig

Configuration of the script to run during a bootstrap action.

Class SecurityConfigurationSummary

The creation date and time, and name, of a security configuration.

Class SetTerminationProtectionRequest

Container for the parameters to the SetTerminationProtection operation. SetTerminationProtection locks a cluster (job flow) so the EC2 instances in the cluster cannot be terminated by user intervention, an API call, or in the event of a job-flow error. The cluster still terminates upon successful completion of the job flow. Calling SetTerminationProtection on a cluster is similar to calling the Amazon EC2 DisableAPITermination API on all EC2 instances in a cluster.

SetTerminationProtection is used to prevent accidental termination of a cluster and to ensure that in the event of an error, the instances persist so that you can recover any data stored in their ephemeral instance storage.

To terminate a cluster that has been locked by setting SetTerminationProtection to true, you must first unlock the job flow by a subsequent call to SetTerminationProtection in which you set the value to false.

For more information, seeManaging Cluster Termination in the Amazon EMR Management Guide.

Class SetTerminationProtectionResponse

This is the response object from the SetTerminationProtection operation.

Class SetVisibleToAllUsersRequest

Container for the parameters to the SetVisibleToAllUsers operation. Sets whether all AWS Identity and Access Management (IAM) users under your account can access the specified clusters (job flows). This action works on running clusters. You can also set the visibility of a cluster when you launch it using the VisibleToAllUsers parameter of RunJobFlow. The SetVisibleToAllUsers action can be called only by an IAM user who created the cluster or the AWS account that owns the cluster.

Class SetVisibleToAllUsersResponse

This is the response object from the SetVisibleToAllUsers operation.

Class ShrinkPolicy

Policy for customizing shrink operations. Allows configuration of decommissioning timeout and targeted instance shrinking.

Class SimpleScalingPolicyConfiguration

An automatic scaling configuration, which describes how the policy adds or removes instances, the cooldown period, and the number of EC2 instances that will be added each time the CloudWatch metric alarm condition is satisfied.

Class SpotProvisioningSpecification

The launch specification for Spot instances in the instance fleet, which determines the defined duration and provisioning timeout behavior.

The instance fleet configuration is available only in Amazon EMR versions 4.8.0 and later, excluding 5.0.x versions.

Class Step

This represents a step in a cluster.

Class StepConfig

Specification of a cluster (job flow) step.

Class StepDetail

Combines the execution state and configuration of a step.

Class StepExecutionStatusDetail

The execution state of a step.

Class StepFactory

This class provides helper methods for creating common Elastic MapReduce step types. To use StepFactory, you should construct it with the appropriate bucket for your region. The official bucket format is ".elasticmapreduce", so us-east-1 would use the bucket "us-east-1.elasticmapreduce".

Class StepFactory.HiveVersion

The available Hive versions. For reference: http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/UsingEMR_SupportedHiveVersions.html

Class StepStateChangeReason

The details of the step state change reason.

Class StepStatus

The execution status details of the cluster step.

Class StepSummary

The summary of the cluster step.

Class StepTimeline

The timeline of the cluster step lifecycle.

Class StreamingStep

Class that makes it easy to define Hadoop Streaming steps.

See also: Hadoop Streaming

AWSCredentials credentials = new BasicAWSCredentials(accessKey, secretKey);
IAmazonElasticMapReduce emr = new AmazonElasticMapReduceClient(credentials);
            
HadoopJarStepConfig config = new StreamingStep {
    Inputs = new List { "s3://elasticmapreduce/samples/wordcount/input" },
    Output = "s3://my-bucket/output/",
    Mapper = "s3://elasticmapreduce/samples/wordcount/wordSplitter.py",
    Reducer = "aggregate"
}.ToHadoopJarStepConfig();
            
StepConfig wordCount = new StepConfig {
    Name = "Word Count",
    ActionOnFailure = "TERMINATE_JOB_FLOW",
    HadoopJarStep = config
};
            
RunJobFlowRequest request = new RunJobFlowRequest {
    Name = "Word Count",
    Steps = new List { wordCount },
    LogUri = "s3://log-bucket/",
    Instances = new JobFlowInstancesConfig {
        Ec2KeyName = "keypair",
        HadoopVersion = "0.20",
        InstanceCount = 5,
        KeepJobFlowAliveWhenNoSteps = true,
        MasterInstanceType = "m1.small",
        SlaveInstanceType = "m1.small"
   }
};
            
RunJobFlowResponse response = emr.RunJobFlow(request);
             

Class SupportedProductConfig

The list of supported product configurations which allow user-supplied arguments. EMR accepts these arguments and forwards them to the corresponding installation script as bootstrap action arguments.

Class Tag

A key/value pair containing user-defined metadata that you can associate with an Amazon EMR resource. Tags make it easier to associate clusters in various ways, such as grouping clusters to track your Amazon EMR resource allocation costs. For more information, see Tagging Amazon EMR Resources.

Class TerminateJobFlowsRequest

Container for the parameters to the TerminateJobFlows operation. TerminateJobFlows shuts a list of clusters (job flows) down. When a job flow is shut down, any step not yet completed is canceled and the EC2 instances on which the cluster is running are stopped. Any log files not already saved are uploaded to Amazon S3 if a LogUri was specified when the cluster was created.

The maximum number of clusters allowed is 10. The call to TerminateJobFlows is asynchronous. Depending on the configuration of the cluster, it may take up to 1-5 minutes for the cluster to completely terminate and release allocated resources, such as Amazon EC2 instances.

Class TerminateJobFlowsResponse

This is the response object from the TerminateJobFlows operation.

Class VolumeSpecification

EBS volume specifications such as volume type, IOPS, and size (GiB) that will be requested for the EBS volume attached to an EC2 instance in the cluster.

Interfaces

NameDescription
Interface ResizeAction

Enums

NameDescription
Enum ConfigFile

Valid config files.

Enum Daemon

List of Hadoop daemons which can be configured.

Enum OnArrested

The action to take if your step is waiting for the instance group to start and it enters the Arrested state.

Fail - Fail the step. Wait - Continue waiting until the instance group is no longer arrested (requires manual intervention). Continue - Proceed onto the next step.

Enum OnFailure

Action to take if there is a failure modifying your cluster composition. Fail - Fail the step. Continue - Proceed on to the next step.