DatasetDefinition
Configuration for Dataset Definition inputs. The Dataset Definition input must specify
exactly one of either AthenaDatasetDefinition
or RedshiftDatasetDefinition
types.
Contents
- AthenaDatasetDefinition
-
Configuration for Athena Dataset Definition input.
Type: AthenaDatasetDefinition object
Required: No
- DataDistributionType
-
Whether the generated dataset is
FullyReplicated
orShardedByS3Key
(default).Type: String
Valid Values:
FullyReplicated | ShardedByS3Key
Required: No
- InputMode
-
Whether to use
File
orPipe
input mode. InFile
(default) mode, Amazon SageMaker copies the data from the input source onto the local Amazon Elastic Block Store (Amazon EBS) volumes before starting your training algorithm. This is the most commonly used input mode. InPipe
mode, Amazon SageMaker streams input data from the source directly to your algorithm without using the EBS volume.Type: String
Valid Values:
Pipe | File
Required: No
- LocalPath
-
The local path where you want Amazon SageMaker to download the Dataset Definition inputs to run a processing job.
LocalPath
is an absolute path to the input data. This is a required parameter whenAppManaged
isFalse
(default).Type: String
Length Constraints: Maximum length of 256.
Pattern:
.*
Required: No
- RedshiftDatasetDefinition
-
Configuration for Redshift Dataset Definition input.
Type: RedshiftDatasetDefinition object
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: