Choose a SerDe for your data

The following table lists the data formats supported in Athena and their corresponding SerDe libraries.

Supported data formats and SerDes
Data format	Description	SerDe types supported in Athena
Amazon Ion	Amazon Ion is a richly-typed, self-describing data format that is a superset of JSON, developed and open-sourced by Amazon.	Use the Amazon Ion Hive SerDe.
Apache Avro	A format for storing data in Hadoop that uses JSON-based schemas for record values.	Use the Avro SerDe.
Apache Parquet	A format for columnar storage of data in Hadoop.	Use the Parquet SerDe and SNAPPY compression.
Apache WebServer logs	A format for storing logs in Apache WebServer.	Use the Grok SerDe or Regex SerDe.
CloudTrail logs	A format for storing logs in CloudTrail.	Use the Hive JSON SerDe. For more information, see Query AWS CloudTrail logs.
CSV (Comma-Separated Values)	For data in CSV, each line represents a data record, and each record consists of one or more fields, separated by commas.	Use the Lazy Simple SerDe for CSV, TSV, and custom-delimited files if your data does not include values enclosed in quotes or if it uses the `java.sql.Timestamp` format. Use the Open CSV SerDe for processing CSV when your data includes quotes in values or uses the UNIX numeric format for `TIMESTAMP` (for example, `1564610311`).
Custom-Delimited	For data in this format, each line represents a data record, and records are separated by a custom single-character delimiter.	Use the Lazy Simple SerDe for CSV, TSV, and custom-delimited files and specify a custom single-character delimiter.
JSON (JavaScript Object Notation)	For JSON data, each line represents a data record, and each record consists of attribute-value pairs and arrays, separated by commas.	Use the Hive JSON SerDe. Use the OpenX JSON SerDe.
Logstash logs	A format for storing logs in Logstash.	Use the Grok SerDe.
ORC (Optimized Row Columnar)	A format for optimized columnar storage of Hive data.	Use the ORC SerDe and ZLIB compression.
TSV (Tab-Separated Values)	For data in TSV, each line represents a data record, and each record consists of one or more fields, separated by tabs.	Use the Lazy Simple SerDe for CSV, TSV, and custom-delimited files and specify the separator character as `FIELDS TERMINATED BY '\t'`.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Use SerDes

Use a SerDe to create a table