Data Nodes - AWS Data Pipeline

Data Nodes

In AWS Data Pipeline, a data node defines the location and type of data that a pipeline activity uses as input or output. AWS Data Pipeline supports the following types of data nodes:

DynamoDBDataNode

A DynamoDB table that contains data for HiveActivity or EmrActivity to use.

SqlDataNode

An SQL table and database query that represent data for a pipeline activity to use.

Note

Previously, MySqlDataNode was used. Use SqlDataNode instead.

RedshiftDataNode

An Amazon Redshift table that contains data for RedshiftCopyActivity to use.

S3DataNode

An Amazon S3 location that contains one or more files for a pipeline activity to use.