Iceberg table compression support by file format - Amazon Athena

Iceberg table compression support by file format

Apache Iceberg compression support in Athena depends on the engine version.

Iceberg compression support in Athena engine version 3

The following table summarizes the compression format support in Athena engine version 3 for storage file formats in Apache Iceberg. "Yes" or "No" in a cell apply equally to read and write operations except where noted. For the purposes of this table, CREATE TABLE, CTAS, and INSERT INTO are considered write operations. The default storage format for Iceberg in Athena engine version 3 is Parquet. The default compression format for Iceberg in Athena engine version 3 is ZSTD. For more information about using ZSTD compression levels in Athena, see Using ZSTD compression levels in Athena.

Avro ORC Parquet (default)
BZIP2 No No No
GZIP Yes No Yes
LZ4 No Yes No
SNAPPY Yes Yes Yes
ZLIB No Yes No
ZSTD Yes Yes Yes (default)
NONE Yes (specify None or Deflate) Yes Yes (specify None or Uncompressed)

Iceberg compression support in Athena engine version 2

The following table summarizes the compression format support in Athena engine version 2 for Apache Iceberg. "Yes" or "No" in a cell apply equally to read and write operations except where noted. For the purposes of this table, CREATE TABLE, CTAS, and INSERT INTO are considered write operations. The default storage format for Iceberg in Athena engine version 2 is Parquet. The default compression format for Iceberg in Athena engine version 2 is GZIP.

Avro

(Not supported)

ORC

(Not supported)

Parquet (default)
BZIP2 No No No
GZIP No No Yes (default)
LZ4 No No No
SNAPPY No No Yes
ZLIB No No No
ZSTD No No Yes
NONE No No Yes