Getting Started with AWS Lake Formation - AWS Lake Formation

Getting Started with AWS Lake Formation

To learn about Lake Formation, go through one of tutorials provided in this guide.

Note

You can go through both tutorials. However, some steps, such as creating users, are duplicated, and can be skipped in the second tutorial. You can use the users that you created in the first tutorial in the second tutorial. The order in which you go through the tutorials is not important.

Before you begin, make sure that you've completed the steps in Setting Up AWS Lake Formation.

The following are the general steps to create and use a data lake:

  1. Register an Amazon Simple Storage Service (Amazon S3) path as a data lake.

  2. Grant Lake Formation permissions to write to the Data Catalog and to Amazon S3 locations in the data lake.

  3. Create a database to organize the metadata tables in the Data Catalog.

  4. Use a blueprint to create a workflow. Run the workflow to ingest data from a data source.

  5. Set up your Lake Formation permissions to allow others to manage data in the Data Catalog and the data lake.

  6. Set up Amazon Athena to query the data that you imported into your Amazon S3 data lake.

  7. For some data store types, set up Amazon Redshift Spectrum to query the data that you imported into your Amazon S3 data lake.