This whitepaper is for historical reference only. Some content might be outdated and some links might not be available.
Analytics architecture
Analytics pipelines are designed to handle large volumes of incoming streams of data from heterogeneous sources such as databases, applications, and devices.
A typical analytics pipeline has the following stages:
-
Collect data
-
Store the data
-
Process the data
-
Analyze and visualize the data

Analytics pipeline
Data collection
At the data collection stage, consider that you probably have different types of data, such as transactional data, log data, streaming data, and Internet of Things (IoT) data. AWS provides solutions for data storage for each of these types of data.
Log data
Reliably capturing system-generated logs helps you troubleshoot issues, conduct
audits, and perform analytics using the information stored in the logs. Amazon S3
Streaming data
Web applications, mobile devices, and many software applications and services can
generate staggering amounts of streaming
data
IoT data
Devices and sensors around the world send messages continuously. Enterprises today
need to capture this data and derive intelligence from it. Using AWS IoT