SUS03-BP05 Use software patterns and architectures that best support data access and storage patterns
Understand how data is used within your workload, consumed by your users, transferred, and stored. Use software patterns and architectures that best support data access and storage to minimize the compute, networking, and storage resources required to support the workload.
Common anti-patterns:
-
You assume that all workloads have similar data storage and access patterns.
-
You only use one tier of storage, assuming all workloads fit within that tier.
-
You assume that data access patterns will stay consistent over time.
-
Your architecture supports a potential high data access burst, which results in the resources remaining idle most of the time.
Benefits of establishing this best practice: Selecting and optimizing your architecture based on data access and storage patterns will help decrease development complexity and increase overall utilization. Understanding when to use global tables, data partitioning, and caching will help you decrease operational overhead and scale based on your workload needs.
Level of risk exposed if this best practice is not established: Medium
Implementation guidance
To improve long-term workload sustainability, use architecture patterns that support data access and storage characteristics for your workload. These patterns help you efficiently retrieve and process data. For example, you can use modern data architecture on AWS
Implementation steps
-
Understand data characteristics: Analyze your data characteristics and access patterns to identify the correct configuration for your cloud resources. Key characteristics to consider include:
-
Data type: structured, semi-structured, unstructured
-
Data growth: bounded, unbounded
-
Data durability: persistent, ephemeral, transient
-
Access patterns reads or writes, update frequency, spiky, or consistent
-
-
Use optimal architecture patterns: Use architecture patterns that best support data access and storage patterns.
-
Use purpose-built services: Use technologies that are fit-for-purpose.
-
Use technologies that work natively with compressed data.
-
Use purpose-built analytics services
for data processing in your architecture. For detail on AWS purpose-built analytics services, see AWS re:Invent 2022 - Building modern data architectures on AWS . -
Use the database engine that best supports your dominant query pattern. Manage your database indexes for efficient querying. For further details, see AWS Databases
and AWS re:Invent 2022 - Modernize apps with purpose-built databases .
-
-
Minimize data transfer: Select network protocols that reduce the amount of network capacity consumed in your architecture.
Resources
Related documents:
Related videos:
-
AWS re:Invent 2022 - Building data mesh architectures on AWS
-
AWS re:Invent 2023 - Deep dive into Amazon Aurora and its innovations
-
AWS re:Invent 2023 - Improve Amazon EBS efficiency and be more cost-efficient
-
AWS re:Invent 2023 - Optimizing storage price and performance with Amazon S3
-
AWS re:Invent 2023 - Building and optimizing a data lake on Amazon S3
-
AWS re:Invent 2023 - Advanced event-driven patterns with Amazon EventBridge
Related examples: