Preparing and Importing Data
Amazon Personalize uses data that you provide to train a model. When you import data, you can choose to import records in bulk or incrementally or both. With incremental imports, you can add individual historical records or data from live events, or both, depending on your business requirements.
This section provides information about importing historical data into Amazon Personalize. For information about recording live interactions data, see Recording Events.
To import your historical training data into Amazon Personalize, you do the following:
-
Create an empty dataset group. Dataset groups are domain-specific containers for related datasets. For more information, see Step 1: Creating a Dataset Group.
-
For each type of dataset you are using, create an empty dataset with an associated schema. Datasets are Amazon Personalize containers for data and schemas that specify contents of a dataset. For more information, see Step 2: Creating a Dataset and a Schema.
-
Import your data:
-
Import bulk records stored in an Amazon S3 bucket using a dataset import job. See Importing Bulk Records.
-
Import records incrementally using the AWS python SDK or AWS Command Line Interface (AWS CLI). See Importing Records Incrementally.
-