You can enable the automatic column statistics generation for all new Apache Iceberg tables and tables in non-OTF table (Parquet, JSON, CSV, XML, ORC, ION) formats in the Data Catalog. After creating the table, you can also explicitly update the column statistics settings manually.
To update the Data Catalog settings to enable catalog-level, the IAM role used must have the glue:UpdateCatalog
permission or AWS Lake Formation ALTER CATALOG
permission on the root catalog. You can use GetCatalog
API to verify the
catalog properties.
To enable the automatic column statistics generation at the account-level
Open the Lake Formation console at https://console.aws.amazon.com/lakeformation/
. On the left navigation bar, choose Catalogs.
On the Catalog summary page, choose Edit under Optimization configuration.
-
On the Table optimization configuration page, choose the Enable automatic statistics generation for the tables of the catalog option.
-
Choose an existing IAM role or create a new one that has the necessary permissions to run the column statistics task.
-
Choose Submit.