Architecture details - Discovering Hot Topics Using Machine Learning

Architecture details

This section describes the components and AWS services that make up this solution and the architecture details on how these components work together.

AWS services in this solution

AWS service Description
Amazon Comprehend Core. Derive and understand valuable insights from data sources during the machine learning (ML) workflow.
AWS Lambda Core. Provides logic for data ingestion and processing from social media and RSS feed.
Amazon QuickSight Core. Provides a topic analysis dashboard for aggregating and contextualizing data from an ingestion source.
Amazon Rekognition Core. This solution also uses Amazon Rekognition to analyze images, detect entities in images (currently only JPEG images are supported), and extract embedded text from images.
AWS Step Functions Core. Provides a workflow that orchestrates Amazon Machine Learning capabilities.
Amazon Translate Core. Translate data sources from different languages to English for ML processing workflow.
Amazon Athena Supporting. Works with QuickSight as an analytics tool to query and analyze data.
Amazon DynamoDB Supporting. Provides higher throughput with sub-millisecond latency and resiliency through automatic scaling and on-demand scaling.
Amazon EventBridge Supporting. Monitors for data added into S3 buckets to create an event-driven application.
AWS Glue Supporting. Works with QuickSight as a serverless data integration service that makes it easier to discover, prepare, move, and integrate data from multiple sources for analytics, ML, and application development.
AWS Identity and Access Management Supporting. Manages identity and provides access to different AWS services and resources.
Amazon Data Firehose Supporting. A fully managed service for delivering real-time streaming data to Amazon S3 buckets.
Amazon Kinesis Data Streams Supporting. Provides data buffering to Lambda functions that makes the architecture resilient to data bursts and spikes.
Amazon Simple Queue Service Supporting. Creates queues and DLQ's for processing data information and sources.
Amazon S3 Supporting. Provides storage for raw data for ingestion and topic modeling.
AWS Systems Manager Supporting. Provides application-level resource monitoring and visualization of resource operations and cost data.
Amazon Macie Optional. Discovers and protects sensitive data by redaction before processing the data into the solution's workflow.