Automation and access control
Automation
Pipeline automation is a crucial part of modern data-centric architecture design. To successfully run your production system, we recommend that you have a data pipeline that has a start trigger, connecting steps, and a mechanism for separating failed and passed stages. It's also important to log failures while not hindering the rest of the ETL process.
You can use AWS Glue workflows to create a pipeline. The pipeline supports all AWS Glue jobs, Amazon EventBridge triggers, and crawlers. You can also create workflows from scratch or by using AWS Glue blueprints. A blueprint provides a framework that helps you get started on reusable use cases. For example, this could be a workflow to import data from Amazon S3 into a DynamoDB table. You can even use parameters to make the blueprint reusable.
If the data pipeline involves more services outside of AWS Glue, then we recommend
that you use AWS Step
Functions
Finally, we recommend using EventBridge
Access control
We recommend that you use AWS Identity
and Access Management (IAM)