Architecture details
This section describes the components and AWS services that make up this solution and the architecture details on how these components work together.
AWS services in this solution
AWS service | Description |
---|---|
Amazon Comprehend |
Core. Derive and understand valuable insights from data sources during the machine learning (ML) workflow. |
AWS Lambda |
Core. Provides logic for data ingestion and processing from social media and RSS feed. |
Amazon QuickSight |
Core. Provides a topic analysis dashboard for aggregating and contextualizing data from an ingestion source. |
Amazon Rekognition |
Core. This solution also uses Amazon Rekognition to analyze images, detect entities in images (currently only JPEG images are supported), and extract embedded text from images. |
AWS Step Functions |
Core. Provides a workflow that orchestrates Amazon Machine Learning capabilities. |
Amazon Translate |
Core. Translate data sources from different languages to English for ML processing workflow. |
Amazon Athena |
Supporting. Works with QuickSight as an analytics tool to query and analyze data. |
Amazon DynamoDB |
Supporting. Provides higher throughput with sub-millisecond latency and resiliency through automatic scaling and on-demand scaling. |
Amazon EventBridge |
Supporting. Monitors for data added into S3 buckets to create an event-driven application. |
AWS Glue |
Supporting. Works with QuickSight as a serverless data integration service that makes it easier to discover, prepare, move, and integrate data from multiple sources for analytics, ML, and application development. |
AWS Identity and Access Management |
Supporting. Manages identity and provides access to different AWS services and resources. |
Amazon Data Firehose |
Supporting. A fully managed service for delivering real-time streaming data to Amazon S3 buckets. |
Amazon Kinesis Data Streams |
Supporting. Provides data buffering to Lambda functions that makes the architecture resilient to data bursts and spikes. |
Amazon Simple Queue Service |
Supporting. Creates queues and DLQ's for processing data information and sources. |
Amazon S3 |
Supporting. Provides storage for raw data for ingestion and topic modeling. |
AWS Systems Manager |
Supporting. Provides application-level resource monitoring and visualization of resource operations and cost data. |
Amazon Macie |
Optional. Discovers and protects sensitive data by redaction before processing the data into the solution's workflow. |