Supported Data Sources - Amazon QuickSight

Supported Data Sources

Amazon QuickSight supports a variety of data sources that you can use to provide data for analyses. The following data sources are supported.

Connecting to Relational Data

You can use any of the following relational data stores as data sources for Amazon QuickSight:

  • Amazon Athena

  • Amazon Aurora

  • Amazon Redshift

  • Amazon Redshift Spectrum

  • Amazon S3

  • Amazon S3 Analytics

  • Apache Spark 2.0 or later

  • AWS IoT Analytics

  • MariaDB 10.0 or later

  • Microsoft SQL Server 2012 or later

  • MySQL 5.1 or later

  • Oracle 12c (12.1.0.2) or later

  • PostgreSQL 9.3.1 or later

  • Presto 0.167 or later

  • Snowflake

  • Teradata 14.0 or later

Note

You can access additional data sources not listed here by linking or importing them through supported data sources.

Amazon Redshift clusters, Amazon Athena databases, and Amazon RDS instances must be in AWS. Other database instances must be in one of the following environments to be accessible from Amazon QuickSight:

  • Amazon EC2

  • Local (on-premises) databases

  • Data in a data center or some other internet-accessible environment

For more information, see Infrastructure Security in Amazon QuickSight.

Importing File Data

You can use files in Amazon S3 or on your local (on-premises) network as data sources. QuickSight supports files in the following formats:

  • CSV and TSV – Comma-delimited and tab-delimited text files

  • ELF and CLF – Extended and common log format files

  • JSON – Flat or semistructured data files

  • XLSX – Microsoft Excel files

QuickSight supports UTF-8 file encoding, but not UTF-8 (with BOM).

Files in Amazon S3 that have been compressed with zip, or gzip (www.gzip.org), can be imported as-is. If you used another compression program for files in Amazon S3, or if the files are on your local network, remove compression before importing them.

JSON Data

Amazon QuickSight natively supports JSON flat files and JSON semi-structured data files.

You can either upload a JSON file or connect to your Amazon S3 bucket that contains JSON data. Amazon QuickSight automatically performs schema and type inference on JSON files and embedded JSON objects. Then it flattens the JSON, so you can analyze and visualize application-generated data.

Basic support for JSON flat-file data includes the following:

  • Inferring the schema

  • Determining data types

  • Flattening the data

  • Parsing JSON (JSON embedded objects) from flat files

Support for JSON file structures (.json) includes the following:

  • JSON records with structures

  • JSON records with root elements as arrays

You can also use the parseJson function to extract values from JSON objects in a text file. For example, if your CSV file has a JSON object embedded in one of the fields, you can extract a value from a specified key-value pair (KVP). For more information on how to do this, see parseJson.

The following JSON features aren't supported:

  • Reading JSON with a structure containing a list of records

  • List attributes and list objects within a JSON record; these are skipped during import

  • Customizing upload or configuration settings

  • parseJSON functions for SQL and analyses

  • Error messaging for invalid JSON

  • Extracting a JSON object from a JSON structure

  • Reading delimited JSON records

You can use the parseJson function to parse flat files during data preparation. This function extracts elements from valid JSON structures and lists.

The following JSON values are supported:

  • JSON object

  • String (double quoted)

  • Number (integer and float)

  • Boolean

  • NULL

Software as a Service (SaaS) Data

QuickSight can connect to a variety of Software as a Service (SaaS) data sources either by connecting directly or by using Open Authorization (OAuth).

SaaS sources that support direct connection include the following:

  • Jira

  • ServiceNow

SaaS sources that use OAuth require that you authorize the connection on the SaaS website. For this to work, QuickSight must be able to access the SaaS data source over the network. These sources include the following:

  • Adobe Analytics

  • GitHub

  • Salesforce

    You can use reports or objects in the following editions of Salesforce as data sources for Amazon QuickSight:

    • Enterprise Edition

    • Unlimited Edition

    • Developer Edition

  • Twitter (Twitter Standard Search API returns data for 7 days only)

To connect to on premises data sources, you need to add your data sources and a QuickSight-specific network interface to Amazon Virtual Private Cloud (Amazon VPC). When configured properly, a VPC based on Amazon VPC resembles a traditional network that you operate in your own data center. It enables you to secure and isolate traffic between resources. You define and control the network elements to suit your requirements, while still getting the benefit of cloud networking and the scalable infrastructure of AWS.

For detailed information, see Infrastructure Security in Amazon QuickSight.