AWS Storage Optimization
AWS Whitepaper

Identify Your Data Storage Requirements

To optimize storage, the first step is to understand the performance profile for each of your workloads. You should conduct a performance analysis to measure input/output operations per second (IOPS), throughput, and other variables.

AWS storage services are optimized for different storage scenarios—there is no single data storage option that is ideal for all workloads. When evaluating your storage requirements, consider data storage options for each workload separately.

The following questions can help you segment data within each of your workloads and determine your storage requirements:

  • How often and how quickly do you need to access your data? AWS offers storage options and pricing tiers for frequently accessed, less frequently accessed, and infrequently accessed data.

  • Does your data store require high IOPS or throughput? AWS provides categories of storage that are optimized for performance and throughput. Understanding IOPS and throughput requirements will help you provision the right amount of storage and avoid overpaying.

  • How critical (durable) is your data? Critical or regulated data needs to be retained at almost any expense and tends to be stored for a long time.

  • How sensitive is your data? Highly sensitive data needs to be protected from accidental and malicious changes, not just data loss or corruption. Durability, cost, and security are equally important to consider.

  • How large is your data set? Knowing the total size of the data set helps in estimating storage capacity and cost.

  • How transient is your data? Transient data is short-lived and typically does not require high durability. (Note: Durability refers to average annual expected data loss.) Clickstream and Twitter data are good examples of transient data.

  • How much are you prepared to pay to store the data? Setting a budget for data storage will inform your decisions about storage options.