Amazon Redshift Best Practices for Loading Data
Topics
- Take the Loading Data Tutorial
- Take the Tuning Table Design Tutorial
- Use a COPY Command to Load Data
- Use a Single COPY Command to Load from Multiple Files
- Split Your Load Data into Multiple Files
- Compress Your Data Files
- Use a Manifest File
- Verify Data Files Before and After a Load
- Use a Multi-Row Insert
- Use a Bulk Insert
- Load Data in Sort Key Order
- Load Data in Sequential Blocks
- Use Time-Series Tables
- Use a Staging Table to Perform a Merge (Upsert)
- Schedule Around Maintenance Windows
Loading very large datasets can take a long time and consume a lot of computing resources. How your data is loaded can also affect query performance. This section presents best practices for loading data efficiently using COPY commands, bulk inserts, and staging tables.