Performing complex ETL activities using blueprints and workflows in AWS Glue
Some of your organization's complex extract, transform, and load (ETL) processes might best be implemented by using multiple, dependent AWS Glue jobs and crawlers. Using AWS Glue workflows, you can design a complex multi-job, multi-crawler ETL process that AWS Glue can run and track as single entity. After you create a workflow and specify the jobs, crawlers, and triggers in the workflow, you can run the workflow on demand or on a schedule.
Topics
- Overview of workflows in AWS Glue
- Creating and building out a workflow manually in AWS Glue
- Starting an AWS Glue workflow with an Amazon EventBridge event
- Viewing the EventBridge events that started a workflow
- Running and monitoring a workflow in AWS Glue
- Stopping a workflow run
- Repairing and resuming a workflow run
- Getting and setting workflow run properties in AWS Glue
- Querying workflows using the AWS Glue API
- Blueprint and workflow restrictions in AWS Glue
- Troubleshooting blueprint errors in AWS Glue
- Permissions for personas and roles for AWS Glue blueprints