Product and service integrations
Use this section to know which products and services integrate with DataBrew.
DataBrew works with the following AWS services for networking, management, and governance:
DataBrew works with the following AWS data lakes and data stores:
DataBrew supports the following file formats and extensions for uploading data.
Format | File extension (optional) | Extensions for compressed files (required) |
---|---|---|
Comma-separated values |
|
|
Microsoft Excel workbook |
|
No compression support |
JSON (JSON document and JSON lines) |
|
|
Apache ORC |
|
|
Apache Parquet |
|
|
DataBrew writes output files to Amazon S3, and supports the following file formats and extensions.
Format | File extension (uncompressed) | File extensions (compressed) |
---|---|---|
Comma-separated values |
.csv |
.csv.snappy , .csv.gz ,
.csv.lz4 , csv.bz2 ,
.csv.deflate , csv.br |
Tab-separated values |
.csv |
.tsv.snappy , .tsv.gz ,
.tsv.lz4 , tsv.bz2 ,
.tsv.deflate , tsv.br |
Apache Parquet | .parquet |
.parquet.snappy , .parquet.gz ,
.parquet.lz4 , .parquet.lzo ,
.parquet.br |
AWS Glue Parquet | Not supported | .glue.parquet.snappy |
Apache Avro | .avro |
.avro.snappy , .avro.gz ,
.avro.lz4 , .avro.bz2 ,
.avro.deflate , .avro.br |
Apache ORC | .orc |
.orc.snappy , .orc.lzo ,
.orc.zlib |
XML | .xml |
.xml.snappy , .xml.gz ,
.xml.lz4 , .xml.bz2 ,
.xml.deflate , .xml.br |
JSON (JSON Lines format only) |
.json
|
.json.snappy , .json.gz ,
.json.lz4 , json.bz2 ,
.json.deflate , .json.br |
Tableau Hyper | Not supported | Not applicable |