Load Data Formats - Amazon Neptune

Load Data Formats

The Amazon Neptune Load API currently requires specific formats for incoming data. The following formats are available, and are listed with their identifiers for the Neptune loader API in parentheses.

  • CSV format (csv) for property graph / Gremlin

  • CSV format (csv) for property graph / openCypher

  • N-Triples (ntriples) format for RDF / SPARQL

  • N-Quads (nquads) format for RDF / SPARQL

  • RDF/XML (rdfxml) format for RDF / SPARQL

  • Turtle (turtle) format for RDF / SPARQL


All files must be encoded in UTF-8 format. If a file is not in UTF format, Neptune tries to load it anyway as UTF-8 data.

If your data is not in a supported format, you must convert it before you load it into a Neptune DB instance.

A tool for converting GraphML to the Neptune CSV format is available in the GraphML2CSV project on GitHub.

Compression Support

Neptune supports compression of single files in gzip or bzip2 format. The file name must end in the .gz or .bz2 extension and must contain a single text file encoded in UTF-8 format. Multiple files can be loaded, but each one must be contained in a separate .gz or .bz2 file (or uncompressed text file). Archive files (for example, .tar, .tar.gz, and .tgz) are not supported.

The following sections describe the formats in more detail.