Creating a data lake from an AWS CloudTrail source

This tutorial guides you through the actions to take on the Lake Formation console to create and load your first data lake from an AWS CloudTrail source.

High-level steps for creating a data lake

Register an Amazon Simple Storage Service (Amazon S3) path as a data lake.
Grant Lake Formation permissions to write to the Data Catalog and to Amazon S3 locations in the data lake.
Create a database to organize the metadata tables in the Data Catalog.
Use a blueprint to create a workflow. Run the workflow to ingest data from a data source.
Set up your Lake Formation permissions to allow others to manage data in the Data Catalog and the data lake.
Set up Amazon Athena to query the data that you imported into your Amazon S3 data lake.
For some data store types, set up Amazon Redshift Spectrum to query the data that you imported into your Amazon S3 data lake.

Topics

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Tutorials

Intended audience