Solution to set up, operate, leverage scalable analytics capabilities, and manage a hosting environment for Apache Druid on AWS - Scalable Analytics Using Apache Druid on AWS

Solution to set up, operate, leverage scalable analytics capabilities, and manage a hosting environment for Apache Druid on AWS

Publication date: January 2024 (last update: September 2024)

The Scalable Analytics using Apache Druid on AWS solution allows you to efficiently deploy, operate, manage and customize a cost-effective, highly available, resilient, and fault tolerant hosting environment for Apache Druid analytics databases on AWS. We expect that customers will be familiar with Apache Druid before deploying and using this solution.

This implementation guide provides an overview of the Scalable Analytics using Apache Druid on AWS solution, its reference architecture and components, considerations for planning the deployment, configuration steps for deploying the solution to the Amazon Web Services (AWS) Cloud.

This guide is intended for solution architects, business decision makers, DevOps engineers, database services administrators, and cloud professionals who want to implement Apache Druid on AWS in their environment.

Use this navigation table to quickly find answers to these questions:

If you want to . . . Read . . .

Know the cost for running this solution across small, medium, or large usage profiles.

The estimated cost for running this solution in the US East (N. Virginia) Region for a medium usage profile is USD $2205.47 per month for AWS resources.

Cost
Understand the security considerations for this solution, and recommended security best practices across the solution features.

Security and Security best practices

Know how to configure the solution.

Describes the various options that you configure for your use case while deploying Apache Druid in your AWS account.

Configure the solution
Know which AWS Regions support this solution. Supported AWS Regions

Find out how to use CloudWatch to monitor the solution.

Provides information on all the Druid data logs in Amazon CloudWatch for monitoring purposes, including alarms, logs, and a dashboard for reporting purposes.

Monitoring the solution

Access the source code and optionally use the AWS Cloud Development Kit (AWS CDK) to deploy the solution.

GitHub repository