Amazon Redshift best practices for loading data

Topics

Take the loading data tutorial
Use a COPY command to load data
Use a single COPY command to load from multiple files
Loading data files
Compressing your data files
Verify data files before and after a load
Use a multi-row insert
Use a bulk insert
Load data in sort key order
Load data in sequential blocks
Use time-series tables
Schedule around maintenance windows

Loading very large datasets can take a long time and consume a lot of computing resources. How your data is loaded can also affect query performance. This section presents best practices for loading data efficiently using COPY commands, bulk inserts, and staging tables.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Use date/time data types for date columns

Take the loading data tutorial