Query best practices for Amazon Redshift

Ethan Stark, Amazon Web Services (AWS)

June 2024 (document history)

Overview

This guide provides recommendations and best practices for optimizing query and table performance in Amazon Redshift. You can use Amazon Redshift to query petabytes of structured and semi-structured data across your data warehouse and your data lake by using standard SQL. This guide also provides an overview of the core architecture components of an Amazon Redshift data warehouse. This knowledge—along with an understanding of query performance factors such as table properties, cluster configuration, and query structure—can help you design efficient and effective tables and queries for your Amazon Redshift data warehouse.

Intended audience

This guide is intended for data engineers, data architects, and data analysts who design or use tables and queries in Amazon Redshift.

Objectives

This guide can help you and your organization achieve the following objectives:

Design tables for optimal data storage and retrieval operations
Design queries for optimal performance and cost savings
Optimize the performance of Amazon Redshift Spectrum to query data directly from files on Amazon Simple Storage Service (Amazon S3)

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Architecture components