AWS services or capabilities described in AWS Documentation may vary by region/location. Click Getting Started with Amazon AWS to see specific differences applicable to the China (Beijing) Region.
DataSourcereferences data that can be used to perform CreateMLModel, CreateEvaluation, or CreateBatchPrediction operations.
CreateDataSourceFromS3 is an asynchronous operation. In response to
Amazon Machine Learning (Amazon ML) immediately returns and sets the
PENDING. After the
DataSource is created and ready
for use, Amazon ML sets the
Status parameter to
PENDING status can
only be used to perform CreateMLModel, CreateEvaluation or CreateBatchPrediction
If Amazon ML cannot accept the input source, it sets the
FAILED and includes an error message in the
of the GetDataSource operation response.
The observation data used in a
DataSource should be ready to use; that
is, it should have a consistent structure, and missing data values should be kept
to a minimum. The observation data must reside in one or more CSV files in an Amazon
Simple Storage Service (Amazon S3) bucket, along with a schema that describes the
data items by name and type. The same schema must be used for all of the data files
referenced by the
DataSource has been created, it's ready to use in evaluations
and batch predictions. If you plan to use the
DataSource to train an
DataSource requires another item: a recipe.
A recipe describes the observation variables that participate in training an
A recipe describes how each input variable will be used in training. Will the variable
be included or excluded from training? Will the variable be manipulated, for example,
combined with another variable, or split apart into word combinations? The recipe
provides answers to these questions. For more information, see the Amazon
Machine Learning Developer Guide.
Version: (assembly version)
public class CreateDataSourceFromS3Request : AmazonMachineLearningRequest IRequestEvents
The CreateDataSourceFromS3Request type exposes the following members
Gets and sets the property ComputeStatistics.
The compute statistics for a
Gets and sets the property DataSourceId.
A user-supplied identifier that uniquely identifies the
Gets and sets the property DataSourceName.
A user-supplied name or description of the
Gets and sets the property DataSpec.
The data specification of a
Supported in: 4.5, 4.0, 3.5