Develop a fully automated chat-based assistant by using Amazon Bedrock agents and knowledge bases
Created by Jundong Qiao (AWS), Kara Yang (AWS), Kiowa Jackson (AWS), Noah Hamilton (AWS), Praveen Kumar Jeyarajan (AWS), and Shuai Cao (AWS)
Code repository: genai-bedrock-agent-chatbot | Environment: PoC or pilot | Technologies: Machine learning & AI; Serverless |
AWS services: Amazon Bedrock; AWS CDK; AWS Lambda |
Summary
Many organizations face challenges when creating a chat-based assistant that is capable of orchestrating diverse data sources to offer comprehensive answers. This pattern presents a solution for developing a chat-based assistant that is capable of answering queries from both documentation and databases, with a straightforward deployment.
Starting with Amazon Bedrock, this fully managed generative artificial intelligence (AI) service provides a wide array of advanced foundation models (FMs). This facilitates the efficient creation of generative AI applications with a strong focus on privacy and security. In the context of documentation retrieval, the Retrieval Augmented Generation (RAG) is a pivotal feature. It uses knowledge bases to augment FM prompts with contextually relevant information from external sources. An Amazon OpenSearch Serverless index serves as the vector database behind the knowledge bases for Amazon Bedrock. This integration is enhanced through careful prompt engineering to minimize inaccuracies and make sure that responses are anchored in factual documentation. For database queries, the FMs of Amazon Bedrock transform textual inquiries into structured SQL queries, incorporating specific parameters. This enables the precise retrieval of data from databases managed by AWS Glue databases. Amazon Athena is used for these queries.
For handling more intricate queries, achieving comprehensive answers demands information sourced from both documentation and databases. Agents for Amazon Bedrock is a generative AI feature that helps you build autonomous agents that can understand complex tasks and break them down into simpler tasks for orchestration. The combination of insights retrieved from the simplified tasks, facilitated by Amazon Bedrock autonomous agents, enhances the synthesis of information, leading to more thorough and exhaustive answers. This pattern demonstrates how to build a chat-based assistant by using Amazon Bedrock and the related generative AI services and features within an automated solution.
Prerequisites and limitations
Prerequisites
An active AWS account
Docker, installed
AWS Cloud Development Kit (AWS CDK), installed and bootstrapped to the
us-east-1
orus-west-2
AWS RegionsAWS CDK Toolkit version 2.114.1 or later, installed
AWS Command Line Interface (AWS CLI), installed and configured
Python version 3.11 or later, installed
In Amazon Bedrock, enable access to Claude 2, Claude 2.1, Claude Instant, and Titan Embeddings G1 – Text
Limitations
This solution is deployed to a single AWS account.
This solution can be deployed only in AWS Regions where Amazon Bedrock and Amazon OpenSearch Serverless are supported. For more information, see the documentation for Amazon Bedrock and Amazon OpenSearch Serverless.
Product versions
Llama-index version 0.10.6 or later
Sqlalchemy version 2.0.23 or later
Opensearch-py version 2.4.2 or later
Requests_aws4auth version 1.2.3 or later
AWS SDK for Python (Boto3) version 1.34.57 or later
Architecture
Target technology stack
The AWS Cloud Development Kit (AWS CDK) is an open source software development framework for defining cloud infrastructure in code and provisioning it through AWS CloudFormation. The AWS CDK stack used in this pattern deploys the following AWS resources:
AWS Key Management Service (AWS KMS)
Amazon Simple Storage Service (Amazon S3)
AWS Glue Data Catalog, for the AWS Glue database component
AWS Lambda
AWS Identity and Access Management (IAM)
Amazon OpenSearch Serverless
Amazon Elastic Container Registry (Amazon ECR)
Amazon Elastic Container Service (Amazon ECS)
AWS Fargate
Amazon Virtual Private Cloud (Amazon VPC)
Target architecture
The diagram shows a comprehensive AWS cloud-native setup within a single AWS Region, using multiple AWS services. The primary interface for the chat-based assistant is a StreamlitInvocation
Lambda function, which then interfaces with agents for Amazon Bedrock. This agent responds to user inquiries by either consulting the knowledge bases for Amazon Bedrock or by invoking an Agent executor
Lambda function. This function triggers a set of actions associated with the agent, following a predefined API schema. The knowledge bases for Amazon Bedrock use an OpenSearch Serverless index as their vector database foundation. Additionally, the Agent executor
function generates SQL queries that are executed against the AWS Glue database through Amazon Athena.
Tools
AWS services
Amazon Athena is an interactive query service that helps you analyze data directly in Amazon Simple Storage Service (Amazon S3) by using standard SQL.
Amazon Bedrock is a fully managed service that makes high-performing foundation models (FMs) from leading AI startups and Amazon available for your use through a unified API.
AWS Cloud Development Kit (AWS CDK) is a software development framework that helps you define and provision AWS Cloud infrastructure in code.
AWS Command Line Interface (AWS CLI) is an open source tool that helps you interact with AWS services through commands in your command-line shell.
Amazon Elastic Container Service (Amazon ECS) is a fast and scalable container management service that helps you run, stop, and manage containers on a cluster.
Elastic Load Balancing (ELB) distributes incoming application or network traffic across multiple targets. For example, you can distribute traffic across Amazon Elastic Compute Cloud (Amazon EC2) instances, containers, and IP addresses in one or more Availability Zones.
AWS Glue is a fully managed extract, transform, and load (ETL) service. It helps you reliably categorize, clean, enrich, and move data between data stores and data streams. This pattern uses an AWS Glue crawler and an AWS Glue Data Catalog table.
AWS Lambda is a compute service that helps you run code without needing to provision or manage servers. It runs your code only when needed and scales automatically, so you pay only for the compute time that you use.
Amazon OpenSearch Serverless is an on-demand serverless configuration for Amazon OpenSearch Service. In this pattern, an OpenSearch Serverless index serves as a vector database for the knowledge bases for Amazon Bedrock.
Amazon Simple Storage Service (Amazon S3) is a cloud-based object storage service that helps you store, protect, and retrieve any amount of data.
Other tools
Streamlit
is an open source Python framework to create data applications.
Code repository
The code for this pattern is available in the GitHub genai-bedrock-agent-chatbot
assets
folder – The static assets, such as the architecture diagram and the public dataset.code/lambdas/action-lambda
folder – The Python code for the Lambda function that acts as an action for the Amazon Bedrock agent.code/lambdas/create-index-lambda
folder – The Python code for the Lambda function that creates the OpenSearch Serverless index.code/lambdas/invoke-lambda
folder – The Python code for the Lambda function that invokes the Amazon Bedrock agent, which is called directly from the Streamlit application.code/lambdas/update-lambda
folder – The Python code for the Lambda function that updates or deletes resources after the AWS resources are deployed through the AWS CDK.code/layers/boto3_layer
folder – The AWS CDK stack that creates a Boto3 layer that is shared across all Lambda functions.code/layers/opensearch_layer
folder – The AWS CDK stack that creates an OpenSearch Serverless layer that installs all dependencies to create the index.code/streamlit-app
folder – The Python code that is run as the container image in Amazon ECScode/code_stack.py
– The AWS CDK construct Python files that create AWS resources.app.py
– The AWS CDK stack Python files that deploy AWS resources in the target AWS account.requirements.txt
– The list of all Python dependencies that must be installed for the AWS CDK.cdk.json
– The input file to provide the values that are required to create resources. Also, in thecontext/config
fields, you can customize the solution accordingly. For more information about customization, see the Additional information section.
Best practices
The code example provided here is for proof-of-concept (PoC) or pilot purposes only. If you want to take the code to production, be sure to use the following best practices:
Enable Amazon S3 access logging
Enable VPC Flow Logs
Set up monitoring and alerting for the Lambda functions. For more information, see Monitoring and troubleshooting Lambda functions. For best practices, see the Best practices for working with AWS Lambda functions.
Epics
Task | Description | Skills required |
---|---|---|
Export variables for the account and Region. | To provide AWS credentials for the AWS CDK by using environment variables, run the following commands.
| AWS DevOps, DevOps engineer |
Set up the AWS CLI named profile. | To set up the AWS CLI named profile for the account, follow the instructions in Configuration and credential file settings. | AWS DevOps, DevOps engineer |
Task | Description | Skills required |
---|---|---|
Clone the repo to your local workstation. | To clone the repository, run the following command in your terminal.
| DevOps engineer, AWS DevOps |
Set up the Python virtual environment. | To set up the Python virtual environment, run the following commands.
To set up the required dependencies, run the following command.
| DevOps engineer, AWS DevOps |
Set up the AWS CDK environment. | To convert the code to an AWS CloudFormation template, run the command | AWS DevOps, DevOps engineer |
Task | Description | Skills required |
---|---|---|
Deploy resources in the account. | To deploy resources in the AWS account by using the AWS CDK, do the following:
After successful deployment, you can access the chat-based assistant application by using the URL provided on the Outputs tab in the CloudFormation console. | DevOps engineer, AWS DevOps |
Task | Description | Skills required |
---|---|---|
Remove the AWS resources. | After you test the solution, to clean up the resources, run the command | AWS DevOps, DevOps engineer |
Related resources
AWS documentation
Amazon Bedrock resources:
AWS CDK resources:
Other AWS resources
Other resources
Additional information
Customize the chat-based assistant with your own data
To integrate your custom data for deploying the solution, follow these structured guidelines. These steps are designed to ensure a seamless and efficient integration process, enabling you to deploy the solution effectively with your bespoke data.
For knowledge base data integration
Data preparation
Locate the
assets/knowledgebase_data_source/
directory.Place your dataset within this folder.
Configuration adjustments
Open the
cdk.json
file.Navigate to the
context/configure/paths/knowledgebase_file_name
field, and then update it accordingly.Navigate to the
bedrock_instructions/knowledgebase_instruction
field, and then update it to accurately reflect the nuances and context of your new dataset.
For structural data integration
Data organization
Within the
assets/data_query_data_source/
directory, create a subdirectory, such astabular_data
.Put your structured dataset (acceptable formats include CSV, JSON, ORC, and Parquet) into this newly created subfolder.
If you are connecting to an existing database, update the function
create_sql_engine()
incode/lambda/action-lambda/build_query_engine.py
to connect to your database.
Configuration and code updates
In the
cdk.json
file, update thecontext/configure/paths/athena_table_data_prefix
field to align with the new data path.Revise
code/lambda/action-lambda/dynamic_examples.csv
by incorporating new text-to-SQL examples that correspond with your dataset.Revise
code/lambda/action-lambda/prompt_templates.py
to mirror the attributes of your structured dataset.In the
cdk.json
file, update thecontext/configure/bedrock_instructions/action_group_description
field to explain the purpose and functionality of theAction group
Lambda function.In the
assets/agent_api_schema/artifacts_schema.json
file, explain the new functionalities of yourAction group
Lambda function.
General update
In the cdk.json
file, in the context/configure/bedrock_instructions/agent_instruction
section, provide a comprehensive description of the Amazon Bedrock agent's intended functionality and design purpose, taking into account the newly integrated data.