Amazon SageMaker
Developer Guide

TransformInput

Describes the input source of a transform job and the way the transform job consumes it.

Contents

CompressionType

Compressing data helps save on storage space. If your transform data is compressed, specify the compression type.and Amazon SageMaker will automatically decompress the data for the transform job accordingly. The default value is None.

Type: String

Valid Values: None | Gzip

Required: No

ContentType

The multipurpose internet mail extension (MIME) type of the data. Amazon SageMaker uses the MIME type with each http call to transfer data to the transform job.

Type: String

Length Constraints: Maximum length of 256.

Required: No

DataSource

Describes the location of the channel data, meaning the S3 location of the input data that the model can consume.

Type: TransformDataSource object

Required: Yes

SplitType

The method to use to split the transform job's data into smaller batches. The default value is None. If you don't want to split the data, specify None. If you want to split records on a newline character boundary, specify Line. To split records according to the RecordIO format, specify RecordIO.

Amazon SageMaker will send maximum number of records per batch in each request up to the MaxPayloadInMB limit. For more information, see RecordIO data format.

Note

For information about the RecordIO format, see Data Format.

Type: String

Valid Values: None | Line | RecordIO

Required: No

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following:

On this page: