AWS Glue DataBrew - AWS Prescriptive Guidance

AWS Glue DataBrew

AWS Glue DataBrew differs from AWS Glue ETL in that you don't have write code to work with it. DataBrew is available in a separate console view from AWS Glue. It works with the following services:

  • AWS Data Exchange

  • AWS Glue Data Catalog

  • AWS Lake Formation

  • Amazon RDS

  • Amazon Redshift

  • Amazon S3

DataBrew is based on the following six core concepts:

Project

The entire data preparation workspace in DataBrew

Dataset

A set of data

Recipe

A set of instructions containing many steps; each step can contain many actions

Job

A set of instructions to run a recipe or a data profile job

Data lineage

The tracking of data in a visual interface to identify its origin

Data profile

A summary view of the shape of your data

To get started with DataBrew, use the AWS Glue DataBrew sample project tutorial.