Amazon Neptune
User Guide (API Version 2017-11-29)

Load Data Formats

The Amazon Neptune Load API currently requires specific formats for incoming data. The following formats are available, and are listed with their identifiers for the Neptune loader API in parentheses.

  • CSV format (csv) for property graph / Gremlin

  • N -Triples (ntriples) format for RDF / SPARQL

  • N-Quads (nquads) format for RDF / SPARQL

  • RDF/XML (rdfxml) format for RDF / SPARQL

  • Turtle (turtle) format for RDF / SPARQL

Important

All files must be encoded in UTF-8 format. If a file is not in UTF format, Neptune tries to load it anyway as UTF-8 data.

If your data is not in a supported format, you must convert it before you load it into a Neptune DB instance.

A tool for converting GraphML to the Neptune CSV format is available in the GraphML2CSV project on GitHub.

Compression Support

Neptune supports compression of single files in gzip format. The file name must end in the .gz extension and must contain a single text file encoded in UTF-8 format. Multiple files can be loaded, but each one must be contained in a separate .gz file (or uncompressed text file). Archive files (for example, .tar, .tar.gz, and .tgz) are not supported.

The following sections describe the formats in more detail.