Why use AWS for Modern Data analytics?
Customers build databases, data warehouses, and data lake solutions
in isolation from each other, each having its own separate data
ingestion, storage, management, and governance layers. These
disjointed efforts to build separate data stores often end up
creating data silos, data integration complexities, excessive data
movement, and data consistency issues. These issues prevent
customers from getting deeper insights. To overcome these issues and
easily move data around, AWS introduced a
Modern
Data approach
AWS provides a broad platform of managed services to help you build, secure, and seamlessly scale end-to-end data analytics applications quickly by using a Modern Data approach. There is no hardware to procure, no infrastructure to maintain and scale—only what you need to collect, store, process, and analyze your data. AWS offers analytical solutions specifically designed to handle this growing amount of data and provide insight into your business.
AWS purpose-built analytics services
AWS gives you the broadest and deepest portfolio of purpose-built
analytics services, including
Amazon Athena
For example,
Amazon Redshift delivers up to three times better price performance than
other cloud data warehouses
Scalable data lakes
Tens of thousands of customers run their data lakes on AWS.
Setting up and managing data lakes today involves a lot of manual
and time-consuming tasks.
AWS Lake Formation
For your data lake storage,
Amazon S3
-
Unmatched 99.999999999% of durability and 99.99% availability
-
The best security, compliance, and audit capabilities with object level audit logging and access control
-
The most flexibility with five storage tiers
-
The lowest cost with pricing that starts at less than $1 per TB per month
Amazon S3 gives you robust capabilities to manage access, cost, replication, and data protection.
Performance and cost-effectiveness
AWS is committed to providing the best performance at the lowest
cost across all analytics services, and it is continually
innovating to improve the price-performance of our services. In
addition to industry-leading price performance for analytics
services, S3 intelligent tiering saves you up to 70% on storage
cost for data stored in your data lake.
Amazon EC2
With
Amazon Redshift RA3 instances
Seamless data movement
As the data in your data lakes and purpose-built data stores continues to grow, you need to be able to easily move a portion of that data from one data store to another. AWS enables you to combine, move, and replicate data across multiple data stores and your data lake.
For example, AWS Glue
Amazon Redshift and Amazon Athena both support federated queries, the ability to run queries across data stored in operational databases, data warehouses, and data lakes to provide insights across multiple data sources with no data movement and no need to set up and maintain complex extract, transform, and load (ETL) pipelines.
Centralized governance
One of the most important pieces of a modern analytics architecture is the ability for customers to authorize, manage, and audit access to data. This can be challenging, because managing security, access control, and audit trails across all of the data stores in your organization is complex, time-consuming, and error-prone. With capabilities like centralized access control and policies, and column-level filtering of data, no other analytics provider gives you the governance capability to manage access to all of your data across your data lake and your purpose-built data stores from a single place.
With capabilities like centralized access control and policies combined with column and row-level filtering, AWS Lake Formation gives you the fine-grained access control and governance to manage access to data across a data lake and purpose-built data stores from a single point of control.
AWS announced the preview of
row-level
security for AWS Lake Formation