Choosing an AWS database service - Choosing an AWS database service

Choosing an AWS database service

Taking the first step

Purpose

Help determine which AWS database or databases are the best fit for your organization.

Last updated

May 13, 2024

Covered services

Introduction

AWS offers a growing number of database options (currently more than 15) to support diverse data models. These include relational, key-value, document, in-memory, graph, time series, wide-column, and ledger databases.

Choosing the right database or multiple databases requires you to make a series of decisions based on your organizational needs. This decision guide will help you ask the right questions, provide a clear path for implementation, and help you migrate from your existing database.

This 6½-minute video explains the basics of choosing an AWS database.

Understand

Databases are important backend systems for storing data for any type of application—whether it's a small mobile app or an enterprise application with internet-scale and real-time requirements.

This decision guide is designed to help you understand the range of choices that are available, establish the criteria for making your database choice, and provide you with detailed information on the unique properties of each database. Then you can learn more about the capabilities that each database offers.

Overview of AWS database services.

What kinds of applications do people build using AWS databases?

  • Internet-scale applications: These applications can handle millions of requests per second over hundreds of terabytes of data. They automatically scale vertically and horizontally to provide flexibility for your workloads.

  • Real-time applications: Real-time applications such as caching, session stores, gaming leaderboards, ride hailing, ad targeting, and real-time analytics need microsecond latency and high throughput to support millions of requests per second.

  • Enterprise applications: Enterprise applications manage core business processes (such as sales, billing, customer service, and human resources) and line-of-business processes (such as a reservation system at a hotel chain or a risk-management system at an insurance company). These applications need databases that are fast, scalable, secure, available, and reliable.

  • Generative AI applications: Your data is the key to moving from generic applications to generative AI applications that create differentiating value for your customers and their business. Often, this data is stored in operational databases that power your applications.

Note

This guide focuses on databases that are suitable for online transaction processing (OLTP) applications. If you need to store and analyze massive amounts of data quickly and efficiently (a requirement that is typically met by an OLAP application), AWS offers Amazon Redshift. Amazon Redshift is a fully managed, cloud-based data warehousing service that is designed to handle large-scale analytics workloads. 

There are two high-level categories of AWS OLTP databases—relational and non-relational.

  • The AWS relational database family includes eight popular engines for Amazon RDS and Amazon Aurora. The Amazon Aurora engines include Amazon Aurora with MySQL compatibility, and Amazon Aurora with PostgreSQL compatibility. The Amazon RDS engines include Db2, MySQL, MariaDB, PostgreSQL, Oracle, and SQL Server. AWS also offers deployment options such as Amazon RDS Custom and Amazon RDS on AWS Outposts.

  • The non-relational database options are designed for specific data models. These include key-value, document, caching, in-memory, graph, time series, wide-column, and ledger data models.

We explore all of these in detail in the Choose section of this guide.

Database migration

Before deciding which database service you want to use, you should consider your business objective, database selection, and how you're going to migrate your existing databases.

The best database migration strategy helps you to take full advantage of the AWS Cloud. This might involve migrating your applications to use purpose-built cloud databases. You might just want the benefit of using a fully managed version of your existing database, such as RDS for PostgreSQL or RDS for MySQL.

Alternatively, you might want to migrate from your commercially licensed databases, such as Oracle or SQL Server, to Amazon Aurora. Consider modernizing your applications and choosing the databases that best suit your applications' workflow requirements.

For example, if you choose to first transition your applications and then transform them, you might decide to re-platform. This process makes no changes to the application that you use, but lets you take advantage of a fully managed service in the cloud. When your databases are fully in the AWS Cloud, you can start working to modernize your application. This strategy can help you exit your current on-premises environment quickly, and then focus on modernization.

The following image shows how you can use AWS Database Migration Service to move data to Amazon Aurora. For resources to help with your migration strategy, see the Explore section.

Example that shows how you can use AWS Database Migration Service to move data to Amazon Aurora.

Consider

You're considering hosting a database on AWS. This might be to support a greenfield/pilot project as a first step in your cloud migration journey, or you might want to migrate an existing workload with as little disruption as possible. Or perhaps you might want to port your workload to managed AWS services, or even refactor it to be fully cloud focused.

Whatever your goal is, considering the right questions will make your database decision easier. Here's a summary of the key criteria to consider.

Business objective

The first major consideration when choosing your database is your business objective. What is the strategic direction that is driving your organization to change? As suggested in the 7 Rs of AWS, consider whether you want to rehost an existing workload, or refactor to a new platform so that you don't have to commit to commercial licenses.

Migration strategy

You can choose a rehosting strategy to deploy to the cloud faster, with fewer data migration problems. Install your database engine software on Amazon EC2, migrate your data, and manage your database in a similar way to how you manage on premises. While rehosting is a fast path to the cloud, you're still left with the operational tasks such as upgrades, patches, backups, capacity planning and management, maintaining performance, and availability targets.

Alternatively, you can choose a re-platform strategy where you migrate your on-premises relational database to a fully managed Amazon RDS instance.

You might consider that this an opportunity to refactor your workload to be cloud focused. For example, you could use Amazon Aurora or purpose-built NoSQL databases such as Amazon DynamoDB, Amazon Neptune, or Amazon DocumentDB (with MongoDB compatibility).

Finally, AWS offers serverless databases, which can scale to an application's demands with a pay-for-use pricing model and built-in high availability. With serverless databases, you can increase agility and optimize costs. In addition to removing the need to provision, patch, or manage servers, many AWS serverless databases provide maintenance options that reduce downtime.

AWS serverless offerings include Amazon Aurora Serverless, Amazon DynamoDB, Amazon ElastiCache, Amazon Keyspaces (for Apache Cassandra), Amazon Timestream, and Amazon Neptune Serverless.

Data considerations

The core of any database choice includes the characteristics of the data that you need to store, retrieve, analyze, and work with. This includes:

  • Your data model. For example, is it relational, structured, semi-structured, time series, or using a highly connected dataset?

  • Data access. How do you need to access your data?

  • The extent to which you need real-time data.

  • Whether there is a particular data record size you have in mind.

Operational considerations

Your primary operational considerations are where your data is going to be located and how it will be managed. The two key choices that you need to make are:

  • Will your database be self-hosted or fully managed?: The core question here is where is your team going to provide the most value to the business? If your database is self-hosted, you'll be responsible for the day-to-day maintenance, monitoring, and patching of the database.

    Choosing a fully managed AWS database simplifies your work by removing undifferentiated database management tasks. This option allows your team to focus on delivering value by improving schema design, query construction, and query optimization. Your team can also develop applications that align with your business objectives.

  • Do you need a serverless or provisioned database?: DynamoDB, Amazon Keyspaces, Timestream, ElastiCache, Neptune, and Aurora provide models for how to think about this choice. Amazon Aurora Serverless v2, for example, is suitable for highly variable workloads. For example, your database usage might be heavy for a short period of time, followed by long periods of light activity or no activity at all.

Resiliency considerations

Database resiliency is key for any business. To achieve resiliency, you have to pay attention to a number of key factors, including capabilities for backup and restore, replication, failover, and point-in-time recovery (PITR).

Performance considerations

Consider whether your database will need to support a high concurrency of transactions (10,000 or more), and whether it needs to be deployed in multiple geographic regions.

If your workload requires extremely high read performance with a response time measured in microseconds (rather than single-digit milliseconds), you might want to consider using in-memory caching solutions such as Amazon ElastiCache alongside your database, or a database that supports in-memory data access such as MemoryDB.

Security considerations

Security is a shared responsibility between AWS and you. The AWS shared responsibility model describes this as security of the cloud that AWS manages, and security in the cloud that the customer manages. Specific security considerations include data protection at all levels of your data, authentication, compliance, data security, storage of sensitive data, and support for auditing requirements.

Choose

Now that you know the criteria for evaluating your database options, you're ready to choose which AWS database services might be a good fit for your organization.

This table highlights the type of data that each database is optimized to handle. Use it to help determine the database that is the best fit for your use case.

Database type When would you use it? What is it optimized for? Related database engines or services
Relational

Use when you're migrating or modernizing an on-premises relational workload, or if your workload has less predictable query patterns.

Optimized for structured data that is stored in tables, rows, and columns. Relational databases support complex queries through joins.

Amazon Aurora

Amazon RDS

Key-value

Use for workloads such as session stores or shopping carts. Key-value databases can scale to large amounts of data and extremely high throughput of requests, while servicing millions of simultaneous users through distributed processing and storage.

Optimized for consistent single-digit millisecond performance at any scale (meaning any number of writes and reads).

Amazon DynamoDB

In-memory

Use Amazon ElastiCache when you need a caching layer to improve read performance. Use Amazon MemoryDB when you need full data persistence, but still need sub-millisecond read latencies.

Optimized to support microsecond reads and sub-millisecond writes. MemoryDB supports microsecond reads and single-digit millisecond writes. ElastiCache is an ephemeral cache, while MemoryDB is an in-memory database.

Amazon ElastiCache

Amazon MemoryDB

Document

Use when you want to store JSON-like documents with rich querying abilities across the fields of the documents.

Optimized for storing semi-structured data as documents with multilayered attributes.

Amazon DocumentDB (with MongoDB compatibility)

Wide-column

Use when you need to migrate your on-premises Cassandra workloads, or when you need to process data at high speeds for applications that require single-digit millisecond latency.

Optimized for workloads that require heavy reads/writes and high throughput, coupled with low latency and linear scalability.

Amazon Keyspaces (for Apache Cassandra)

Graph

Use when you have to model complex networks of objects, such as social networks, fraud detection, and recommendation engine use cases.

Optimized for traversing and evaluating large numbers of relationships, and identifying patterns with minimal latency.

Amazon Neptune

Time series

Use when you have a large amount of time series data, potentially from a number of sources, such as Internet of Things (IoT) data, application metrics, and asset tracking.

Optimized for storing and querying data that is associated with timestamps and trend lines.

Amazon Timestream

Ledger

Use when your organization has to communicate with other entities (businesses, customers) and you need a way to verify and trust each other—or when you need to retrieve the current state of data, and also need to prove how data mutated into the current state.

Optimized for maintaining a complete, immutable, and verifiable history of database changes.

Amazon Quantum Ledger Database (Amazon QLDB)

Use

This section helps you learn more about the database service or services that you've chosen, and how to get started with them.

The database you've chosen might not satisfy all of your requirements perfectly, so it's important to consider your needs and workload requirements carefully.

Prioritize based on the considerations covered in this guide, the requirements that you must meet to the highest standard, the requirements that you have some flexibility on, and the requirements that you might not need to meet. This will help you make effective trade-offs and lead to the best possible outcome for your unique circumstances.

Also consider that, usually, you can cover your application requirements with a mix of best-fit databases. By building a solution with multiple database types, you can use the strengths that each type provides.

For example, in an ecommerce use case, you might use Amazon DocumentDB (for product catalogs and user profiles) for the flexibility that is provided by semi-structured data—but also for the low, predictable latency provided by DynamoDB when your users are browsing your product catalog. You might also use Aurora for inventory and order processing, where a relational data model and transaction support are needed.

To help you learn more about each of the available AWS database services, we have provided a pathway to explore how each of the services work. The following section provides links to in-depth documentation, hands-on tutorials, and resources to help you get started.

Amazon Aurora
Aurora icon

Getting started with Amazon Aurora

This guide includes tutorials and covers more advanced Aurora concepts and procedures, such as the different kinds of endpoints and how to scale Aurora clusters up and down.

Explore the guide

Aurora icon

Create a highly available database

Learn how to configure an Amazon Aurora cluster to create a highly available database. This database consists of compute nodes that are replicated across multiple Availability Zones to provide increased read scalability and failover protection.

Get started with the tutorial

Aurora icon

Use Amazon Aurora global databases

We help you get started using Aurora global databases. This guide outlines the supported engines and AWS Region availability for Aurora global databases with Aurora MySQL and Aurora PostgreSQL.

Explore the guide

Amazon RDS
Amazon RDS icon

Getting started with Amazon RDS

We explain how to create and connect to a DB instance using Amazon RDS. You learn to create a DB instance that uses DB2, MariaDB, MySQL, Microsoft SQL Server, Oracle, or PostgreSQL.

Explore the guide

Amazon RDS icon

Create and connect to a PostgreSQL database

We show you how to create an environment to run your PostgreSQL database (we call this environment a DB instance), connect to the database, and delete the DB instance.

Get started with the tutorial

Amazon RDS icon

Create a web server and an Amazon RDS DB instance

Learn how to install an Apache web server with PHP and create a MySQL database. The web server runs on an Amazon EC2 instance using Amazon Linux. The MySQL database is a MySQL DB instance.

Get started using the tutorial

Amazon DocumentDB
Amazon DocumentDB icon

Getting started with Amazon DocumentDB

We help you get started using Amazon DocumentDB in just seven steps. This guide uses AWS Cloud9 to connect and query your cluster using the MongoDB shell directly from the AWS Management Console.

Explore the guide

Amazon DocumentDB icon

Setting up a document database with Amazon DocumentDB

This tutorial helps you to get started connecting to your Amazon DocumentDB cluster from your AWS Cloud9 environment with a MongoDB shell, and then run a few queries.

Get started with the tutorial

Amazon DocumentDB icon

Best practices for working with Amazon DocumentDB

Learn best practices for working with Amazon DocumentDB, along with the basic operational guidelines when working with it.

Explore the guide

Amazon DocumentDB icon

Migrate from MongoDB to Amazon DocumentDB  

Learn how to migrate an existing self-managed MongoDB database to a fully managed database on Amazon DocumentDB.

Get started with the tutorial

Amazon DocumentDB icon

Assessing MongoDB compatibility  

Use the Amazon DocumentDB compatibility tool to help you assess the compatibility of a MongoDB application by using the application's source code or MongoDB server profile logs.

Use the tool

Amazon DynamoDB
DynamoDB icon

What is Amazon DynamoDB?

This guide explains how you can use this fully managed NoSQL database service to offload the administrative burdens of operating and scaling a distributed database (including hardware provisioning, setup and configuration, replication, software patching, and cluster scaling).

Explore the guide

DynamoDB icon

Getting started with DynamoDB and the AWS SDKs

We help you get started with Amazon DynamoDB and the AWS SDKs. This guide includes hands-on tutorials that show you how to run code examples in DynamoDB.

Explore the guide

DynamoDB icon

Create and query a NoSQL table with Amazon DynamoDB

Use these hands-on tutorials to get started with Amazon DynamoDB and the AWS SDKs. You can run the code examples on either the downloadable version of DynamoDB or the DynamoDB web service.

Get started with the tutorials

DynamoDB icon

Create an Amazon DynamoDB table

We show you how to create a DynamoDB table and use the table to store and retrieve data. This tutorial uses an online bookstore application as a guiding example.

Get started with the tutorial

Amazon ElastiCache
ElastiCache icon

Documentation for Amazon ElastiCache  

Explore the full set of Amazon ElastiCache documentation, including user guides for ElastiCache (Redis OSS) and ElastiCache (Memcached), as well as specific AWS CLI and API references.

Explore the guide

ElastiCache icon

Getting started with Amazon ElastiCache (Redis OSS)  

Learn how to create, grant access to, connect to, and delete a Redis (cluster mode disabled) cluster using the Amazon ElastiCache console.

Explore the guide

ElastiCache icon

Build a fast session store for an online application  

Learn how to use Amazon ElastiCache (Redis OSS) as a distributed cache for session management. You will also learn the best practices for configuring your ElastiCache nodes and how to handle the sessions from your application.

Get started with the tutorial

ElastiCache icon

Setting up a Redis cluster for scalability and high availability

Learn how to create and configure a Redis cluster with ElastiCache (Redis OSS) version 7.0 with TLS encryption enabled. With cluster mode enabled, your Redis cluster gains enhanced scalability and high availability.

Get started with the tutorial

Amazon MemoryDB
MemoryDB icon

Getting started with Amazon MemoryDB

We guide you through the steps to create, grant access to, connect to, and delete a MemoryDB cluster using the MemoryDB console.

Use the guide

MemoryDB icon

Getting started using Amazon MemoryDB 

Learn how to simplify your architecture and use MemoryDB as a single, primary database instead of using a low-latency cache in front of a durable database.

Read the blog

MemoryDB icon

Integrating Amazon MemoryDB with Java-based AWS Lambda

We discuss some of the common use cases for the data store, Amazon MemoryDB, which is built to provide durability and faster reads and writes.

Read the blog

Amazon Keyspaces
Amazon Keyspaces icon

Getting started with Amazon Keyspaces (for Apache Cassandra)

This guide is for those who are new to Apache Cassandra and Amazon Keyspaces (for Apache Cassandra). It walks you through installing all the programs and drivers that you need to successfully use Amazon Keyspaces.

Explore the guide

Amazon Keyspaces icon

Beginner course on using Amazon Keyspaces (for Apache Cassandra)

Learn the benefits, typical use cases, and technical concepts of Amazon Keyspaces. You can try the service through the sample code provided or the interactive tool in the AWS Management Console.

Take the course (requires sign-in)

Amazon Neptune
Neptune icon

Getting started with Amazon Neptune

We help you get started using Amazon Neptune, a fully managed graph database service. This guide shows you how to create a Neptune database.

Explore the guide

Neptune icon

Build a fraud detection service using Amazon Neptune

We walk you through the steps to create a Neptune database, design your data model, and use the database in your application.

Explore the guide

Neptune icon

Build a recommendation engine with Amazon Neptune

We show you how to build a friend recommendation engine for a multiplayer game application using Amazon Neptune.

Explore the guide

Amazon Timestream
Timestream icon

Getting started with Amazon Timestream

We help you get started with Amazon Timestream. This guide provides instructions for setting up a fully functional sample application.

Explore the guide

Timestream icon

Best practices with Amazon Timestream

We explore best practices, including the practices that relate to data modeling, security, configuration, data ingestion, queries, client applications, and supported integrations.

Explore the guide

Timestream icon

Accessing Amazon Timestream using AWS SDKs 

Learn how to access Amazon Timestream using the AWS SDKs in the language of your choice: Java, Go, Python, Node.js, or .NET.

Explore the guide

Amazon QLDB
QLDB icon

Getting started with Amazon QLDB

In Amazon Quantum Ledger Database (Amazon QLDB), the journal is the core of the database. This guide provides a high-level overview of Amazon QLDB service components and how they interact.

Explore the guide

QLDB icon

Creating your first Amazon QLDB ledger

We guide you through the steps to create your first Amazon QLDB sample ledger and populate it with tables and sample data.

Get started with the tutorial

QLDB icon

Using an Amazon QLDB driver with an AWS SDK

Learn how to use the Amazon QLDB driver with an AWS SDK to create a QLDB ledger and populate it with sample data. The driver lets your application interact with QLDB using the transactional data API.

Get started with the tutorial

Explore

Architecture diagrams

Explore reference architecture diagrams to help you develop, scale, and test your databases on AWS.

Explore architecture diagrams

Whitepapers

Explore whitepapers to help you get started, learn best practices, and migrate your databases.

Explore whitepapers

AWS solutions

Explore vetted solutions and architectural guidance for common use cases for databases.

Explore solutions