Apache Superset - Automated Data Analytics on AWS

Apache Superset

This extension allows you to deploy Apache Superset as data analytics and visualization platform for the Automated Data Analytics on AWS (ADA) solution. You can use Apache Superset to ingest data from ADA data products and help ADA users explore and visualize their data sets.

Note

Apache Superset is offered by an AWS Partner Solution Apache Superset On AWS, and is a standalone solution that runs alongside the ADA solution, serving as the visualization platform for ADA.

Connecting a data product in ADA

After you have you deployed Apache Superset in ADA, you can connect a data product in ADA to start visualizing data in Apache Superset.

To connect to a data product in ADA:

  1. From the ADA user profile, create an API key for the ADA user who will access ADA data using Apache Superset.

  2. Open Apache Superset and choose Settings→ Database Connections, and select +Database.

  3. From the Supported databases drop-down list, choose Other, and enter the following URI in the SQLALCHEMY URI field.

    awsathena+rest://api-key<Ada API Key>:1234@athena.<AWS Region>.amazonaws.com:443?catalog_name=<ADA Data Product Domain Identifier>&s3_staging_dir=s3%3A//

  4. Edit the Display Name from Other to ADA [Data Product Domain Name].

  5. Choose Test Connection to make sure Apache Superset can connect to the ADA data product.

  6. Once confirmed, choose Connect to create this database connection. You can now start using Apache Superset to analyze data from the ADA platform.

    Note

    Deploying Apache Superset requires user access to the AWS Management Console and an intermediate level of knowledge of AWS service administration. Make sure you have sufficient permissions and access rights before you start the deployment process.

Viewing the visualization in Apache Superset

Once you have deployed Apache Superset, and connected your data product to Apache Superset, you can view and use Apache Superset’s visualization platform for analyzing your data.

  1. From the AWS CloudFormation console, navigate to the superset stack.

  2. On the Outputs tab, under SupersetConsole, copy the URL and open it in a browser. This URL points to Apache Superset's web console. The default user name and password are below. You must change the admin password after the first login.

    • username: admin

    • password: admin

  3. After you sign in, open the Datasets tab to view your visualizations. For example, a bar graph view of your data could be represented like this.

    Apache Superset visualization.

    Apache Superset visualization