Data Nodes - AWS Data Pipeline

AWS Data Pipeline is no longer available to new customers. Existing customers of AWS Data Pipeline can continue to use the service as normal. Learn more

Data Nodes

In AWS Data Pipeline, a data node defines the location and type of data that a pipeline activity uses as input or output. AWS Data Pipeline supports the following types of data nodes:

DynamoDBDataNode

A DynamoDB table that contains data for HiveActivity or EmrActivity to use.

SqlDataNode

An SQL table and database query that represent data for a pipeline activity to use.

Note

Previously, MySqlDataNode was used. Use SqlDataNode instead.

RedshiftDataNode

An Amazon Redshift table that contains data for RedshiftCopyActivity to use.

S3DataNode

An Amazon S3 location that contains one or more files for a pipeline activity to use.