Table Of Contents

Feedback

User Guide

First time using the AWS CLI? See the User Guide for help getting started.

Note: You are viewing the documentation for an older major version of the AWS CLI (version 1).

AWS CLI version 2, the latest major version of AWS CLI, is now stable and recommended for general use. To view this page for the AWS CLI version 2, click here. For more information see the AWS CLI version 2 installation instructions and migration guide.

[ aws . sagemaker ]

create-processing-job

Description

Creates a processing job.

See also: AWS API Documentation

See 'aws help' for descriptions of global parameters.

Synopsis

  create-processing-job
[--processing-inputs <value>]
[--processing-output-config <value>]
--processing-job-name <value>
--processing-resources <value>
[--stopping-condition <value>]
--app-specification <value>
[--environment <value>]
[--network-config <value>]
--role-arn <value>
[--tags <value>]
[--experiment-config <value>]
[--cli-input-json <value>]
[--generate-cli-skeleton <value>]

Options

--processing-inputs (list)

For each input, data is downloaded from S3 into the processing container before the processing job begins running if "S3InputMode" is set to File .

Shorthand Syntax:

InputName=string,S3Input={S3Uri=string,LocalPath=string,S3DataType=string,S3InputMode=string,S3DataDistributionType=string,S3CompressionType=string} ...

JSON Syntax:

[
  {
    "InputName": "string",
    "S3Input": {
      "S3Uri": "string",
      "LocalPath": "string",
      "S3DataType": "ManifestFile"|"S3Prefix",
      "S3InputMode": "Pipe"|"File",
      "S3DataDistributionType": "FullyReplicated"|"ShardedByS3Key",
      "S3CompressionType": "None"|"Gzip"
    }
  }
  ...
]

--processing-output-config (structure)

Output configuration for the processing job.

JSON Syntax:

{
  "Outputs": [
    {
      "OutputName": "string",
      "S3Output": {
        "S3Uri": "string",
        "LocalPath": "string",
        "S3UploadMode": "Continuous"|"EndOfJob"
      }
    }
    ...
  ],
  "KmsKeyId": "string"
}

--processing-job-name (string)

The name of the processing job. The name must be unique within an AWS Region in the AWS account.

--processing-resources (structure)

Identifies the resources, ML compute instances, and ML storage volumes to deploy for a processing job. In distributed training, you specify more than one instance.

Shorthand Syntax:

ClusterConfig={InstanceCount=integer,InstanceType=string,VolumeSizeInGB=integer,VolumeKmsKeyId=string}

JSON Syntax:

{
  "ClusterConfig": {
    "InstanceCount": integer,
    "InstanceType": "ml.t3.medium"|"ml.t3.large"|"ml.t3.xlarge"|"ml.t3.2xlarge"|"ml.m4.xlarge"|"ml.m4.2xlarge"|"ml.m4.4xlarge"|"ml.m4.10xlarge"|"ml.m4.16xlarge"|"ml.c4.xlarge"|"ml.c4.2xlarge"|"ml.c4.4xlarge"|"ml.c4.8xlarge"|"ml.p2.xlarge"|"ml.p2.8xlarge"|"ml.p2.16xlarge"|"ml.p3.2xlarge"|"ml.p3.8xlarge"|"ml.p3.16xlarge"|"ml.c5.xlarge"|"ml.c5.2xlarge"|"ml.c5.4xlarge"|"ml.c5.9xlarge"|"ml.c5.18xlarge"|"ml.m5.large"|"ml.m5.xlarge"|"ml.m5.2xlarge"|"ml.m5.4xlarge"|"ml.m5.12xlarge"|"ml.m5.24xlarge"|"ml.r5.large"|"ml.r5.xlarge"|"ml.r5.2xlarge"|"ml.r5.4xlarge"|"ml.r5.8xlarge"|"ml.r5.12xlarge"|"ml.r5.16xlarge"|"ml.r5.24xlarge",
    "VolumeSizeInGB": integer,
    "VolumeKmsKeyId": "string"
  }
}

--stopping-condition (structure)

The time limit for how long the processing job is allowed to run.

Shorthand Syntax:

MaxRuntimeInSeconds=integer

JSON Syntax:

{
  "MaxRuntimeInSeconds": integer
}

--app-specification (structure)

Configures the processing job to run a specified Docker container image.

Shorthand Syntax:

ImageUri=string,ContainerEntrypoint=string,string,ContainerArguments=string,string

JSON Syntax:

{
  "ImageUri": "string",
  "ContainerEntrypoint": ["string", ...],
  "ContainerArguments": ["string", ...]
}

--environment (map)

Sets the environment variables in the Docker container.

Shorthand Syntax:

KeyName1=string,KeyName2=string

JSON Syntax:

{"string": "string"
  ...}

--network-config (structure)

Networking options for a processing job.

Shorthand Syntax:

EnableInterContainerTrafficEncryption=boolean,EnableNetworkIsolation=boolean,VpcConfig={SecurityGroupIds=[string,string],Subnets=[string,string]}

JSON Syntax:

{
  "EnableInterContainerTrafficEncryption": true|false,
  "EnableNetworkIsolation": true|false,
  "VpcConfig": {
    "SecurityGroupIds": ["string", ...],
    "Subnets": ["string", ...]
  }
}

--role-arn (string)

The Amazon Resource Name (ARN) of an IAM role that Amazon SageMaker can assume to perform tasks on your behalf.

--tags (list)

(Optional) An array of key-value pairs. For more information, see Using Cost Allocation Tags in the AWS Billing and Cost Management User Guide .

Shorthand Syntax:

Key=string,Value=string ...

JSON Syntax:

[
  {
    "Key": "string",
    "Value": "string"
  }
  ...
]

--experiment-config (structure)

Configuration for the experiment.

Shorthand Syntax:

ExperimentName=string,TrialName=string,TrialComponentDisplayName=string

JSON Syntax:

{
  "ExperimentName": "string",
  "TrialName": "string",
  "TrialComponentDisplayName": "string"
}

--cli-input-json (string) Performs service operation based on the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton. If other arguments are provided on the command line, the CLI values will override the JSON-provided values. It is not possible to pass arbitrary binary values using a JSON-provided value as the string will be taken literally.

--generate-cli-skeleton (string) Prints a JSON skeleton to standard output without sending an API request. If provided with no value or the value input, prints a sample input JSON that can be used as an argument for --cli-input-json. If provided with the value output, it validates the command inputs and returns a sample output JSON for that command.

See 'aws help' for descriptions of global parameters.

Output

ProcessingJobArn -> (string)

The Amazon Resource Name (ARN) of the processing job.