Package software.amazon.awscdk.services.bedrock.alpha
Amazon Bedrock Construct Library
---
The APIs of higher level constructs in this module are experimental and under active development. They are subject to non-backward compatible changes or removal in any future version. These are not subject to the Semantic Versioning model and breaking changes will be announced in the release notes. This means that while you may use them, you may need to update your source code when upgrading to a newer version of this package.
| Language | Package |
| :--------------------------------------------------------------------------------------------- | --------------------------------------- |
| TypeScript |
@aws-cdk/aws-bedrock-alpha
|
Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies and Amazon through a single API, along with a broad set of capabilities you need to build generative AI applications with security, privacy, and responsible AI.
This construct library facilitates the deployment of Bedrock Agents, enabling you to create sophisticated AI applications that can interact with your systems and data sources.
Table of contents
Agents
Amazon Bedrock Agents allow generative AI applications to automate complex, multistep tasks by seamlessly integrating with your company's systems, APIs, and data sources. It uses the reasoning of foundation models (FMs), APIs, and data to break down user requests, gather relevant information, and efficiently complete tasks.
Create an Agent
Building an agent is straightforward and fast. The following example creates an Agent with a simple instruction and default prompts:
Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(BedrockFoundationModel.ANTHROPIC_CLAUDE_HAIKU_V1_0) .instruction("You are a helpful and friendly agent that answers questions about literature.") .build();
You can also create an agent with a guardrail:
// Create a guardrail to filter inappropriate content Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") .description("Legal ethical guardrails.") .build(); guardrail.addContentFilter(ContentFilter.builder() .type(ContentFilterType.SEXUAL) .inputStrength(ContentFilterStrength.HIGH) .outputStrength(ContentFilterStrength.MEDIUM) .build()); // Create an agent with the guardrail Agent agentWithGuardrail = Agent.Builder.create(this, "AgentWithGuardrail") .foundationModel(BedrockFoundationModel.ANTHROPIC_CLAUDE_HAIKU_V1_0) .instruction("You are a helpful and friendly agent that answers questions about literature.") .guardrail(guardrail) .build();
Agent Properties
The Bedrock Agent class supports the following properties.
| Name | Type | Required | Description |
|---|---|---|---|
| name | string | No | The name of the agent. Defaults to a name generated by CDK |
| instruction | string | Yes | The instruction used by the agent that determines how it will perform its task. Must have a minimum of 40 characters |
| foundationModel | IBedrockInvokable | Yes | The foundation model used for orchestration by the agent |
| existingRole | iam.IRole | No | The existing IAM Role for the agent to use. Must have a trust policy allowing Bedrock service to assume the role. Defaults to a new created role |
| shouldPrepareAgent | boolean | No | Specifies whether to automatically update the DRAFT
version of the agent after making changes. Defaults to false |
| idleSessionTTL | Duration | No | How long sessions should be kept open for the agent. Session expires if no conversation occurs during this time. Defaults to 1 hour |
| kmsKey | kms.IKey | No | The KMS key of the agent if custom encryption is configured. Defaults to AWS managed key |
| description | string | No | A description of the agent. Defaults to no description |
| actionGroups | AgentActionGroup[] | No | The Action Groups associated with the agent |
| guardrail | IGuardrail | No | The guardrail that will be associated with the agent. Defaults to no guardrail |
| memory | Memory | No | The type and configuration of the memory to maintain context across multiple sessions and recall past interactions. Defaults to no memory |
| promptOverrideConfiguration | PromptOverrideConfiguration | No | Overrides some prompt templates in different parts of an agent sequence configuration |
| userInputEnabled | boolean | No | Select whether the agent can prompt additional information from the user when it lacks enough information. Defaults to false |
| codeInterpreterEnabled | boolean | No | Select whether the agent can generate, run, and troubleshoot code when trying to complete a task. Defaults to false |
| forceDelete | boolean | No | Whether to delete the resource even if it's in use. Defaults to true |
| agentCollaboration | AgentCollaboration | No | Configuration for agent collaboration settings, including type and collaborators. This property allows you to define how the agent collaborates with other agents and what collaborators it can work with. Defaults to no agent collaboration configuration |
| customOrchestrationExecutor | CustomOrchestrationExecutor | No | The Lambda function to use for custom orchestration. If provided, orchestrationType is set to CUSTOM_ORCHESTRATION. If not provided, orchestrationType defaults to DEFAULT. Defaults to default orchestration |
Action Groups
An action group defines functions your agent can call. The functions are Lambda functions. The action group uses an OpenAPI schema to tell the agent what your functions do and how to call them.
Action Group Properties
The AgentActionGroup class supports the following properties.
| Name | Type | Required | Description | |---|---|---|---| | name | string | No | The name of the action group. Defaults to a name generated in the format 'action_group_quick_start_UUID' | | description | string | No | A description of the action group | | apiSchema | ApiSchema | No | The OpenAPI schema that defines the functions in the action group | | executor | ActionGroupExecutor | No | The Lambda function that executes the actions in the group | | enabled | boolean | No | Whether the action group is enabled. Defaults to true | | forceDelete | boolean | No | Whether to delete the resource even if it's in use. Defaults to false | | functionSchema | FunctionSchema | No | Defines functions that each define parameters that the agent needs to invoke from the user | | parentActionGroupSignature | ParentActionGroupSignature | No | The AWS Defined signature for enabling certain capabilities in your agent |
There are three ways to provide an API schema for your action group:
From a local asset file (requires binding to scope):
Function actionGroupFunction = Function.Builder.create(this, "ActionGroupFunction") .runtime(Runtime.PYTHON_3_12) .handler("index.handler") .code(Code.fromAsset(join(__dirname, "../lambda/action-group"))) .build(); // When using ApiSchema.fromLocalAsset, you must bind the schema to a scope AssetApiSchema schema = ApiSchema.fromLocalAsset(join(__dirname, "action-group.yaml")); schema.bind(this); AgentActionGroup actionGroup = AgentActionGroup.Builder.create() .name("query-library") .description("Use these functions to get information about the books in the library.") .executor(ActionGroupExecutor.fromLambda(actionGroupFunction)) .enabled(true) .apiSchema(schema) .build(); Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(BedrockFoundationModel.ANTHROPIC_CLAUDE_HAIKU_V1_0) .instruction("You are a helpful and friendly agent that answers questions about literature.") .build(); agent.addActionGroup(actionGroup);
From an inline OpenAPI schema:
InlineApiSchema inlineSchema = ApiSchema.fromInline("\nopenapi: 3.0.3\ninfo:\n title: Library API\n version: 1.0.0\npaths:\n /search:\n get:\n summary: Search for books\n operationId: searchBooks\n parameters:\n - name: query\n in: query\n required: true\n schema:\n type: string\n"); Function actionGroupFunction = Function.Builder.create(this, "ActionGroupFunction") .runtime(Runtime.PYTHON_3_12) .handler("index.handler") .code(Code.fromAsset(join(__dirname, "../lambda/action-group"))) .build(); AgentActionGroup actionGroup = AgentActionGroup.Builder.create() .name("query-library") .description("Use these functions to get information about the books in the library.") .executor(ActionGroupExecutor.fromLambda(actionGroupFunction)) .enabled(true) .apiSchema(inlineSchema) .build(); Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(BedrockFoundationModel.ANTHROPIC_CLAUDE_HAIKU_V1_0) .instruction("You are a helpful and friendly agent that answers questions about literature.") .build(); agent.addActionGroup(actionGroup);
From an existing S3 file:
IBucket bucket = Bucket.fromBucketName(this, "ExistingBucket", "my-schema-bucket"); S3ApiSchema s3Schema = ApiSchema.fromS3File(bucket, "schemas/action-group.yaml"); Function actionGroupFunction = Function.Builder.create(this, "ActionGroupFunction") .runtime(Runtime.PYTHON_3_12) .handler("index.handler") .code(Code.fromAsset(join(__dirname, "../lambda/action-group"))) .build(); AgentActionGroup actionGroup = AgentActionGroup.Builder.create() .name("query-library") .description("Use these functions to get information about the books in the library.") .executor(ActionGroupExecutor.fromLambda(actionGroupFunction)) .enabled(true) .apiSchema(s3Schema) .build(); Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(BedrockFoundationModel.ANTHROPIC_CLAUDE_HAIKU_V1_0) .instruction("You are a helpful and friendly agent that answers questions about literature.") .build(); agent.addActionGroup(actionGroup);
Using FunctionSchema with Action Groups
As an alternative to using OpenAPI schemas, you can define functions directly using the FunctionSchema
class. This approach provides a more structured way to define the functions that your agent can call.
Function actionGroupFunction = Function.Builder.create(this, "ActionGroupFunction") .runtime(Runtime.PYTHON_3_12) .handler("index.handler") .code(Code.fromAsset(join(__dirname, "../lambda/action-group"))) .build(); // Define a function schema with parameters FunctionSchema functionSchema = FunctionSchema.Builder.create() .functions(List.of(FunctionProps.builder() .name("searchBooks") .description("Search for books in the library catalog") .parameters(Map.of( "query", FunctionParameterProps.builder() .type(ParameterType.STRING) .required(true) .description("The search query string") .build(), "maxResults", FunctionParameterProps.builder() .type(ParameterType.INTEGER) .required(false) .description("Maximum number of results to return") .build(), "includeOutOfPrint", FunctionParameterProps.builder() .type(ParameterType.BOOLEAN) .required(false) .description("Whether to include out-of-print books") .build())) .requireConfirmation(RequireConfirmation.DISABLED) .build(), FunctionProps.builder() .name("getBookDetails") .description("Get detailed information about a specific book") .parameters(Map.of( "bookId", FunctionParameterProps.builder() .type(ParameterType.STRING) .required(true) .description("The unique identifier of the book") .build())) .requireConfirmation(RequireConfirmation.ENABLED) .build())) .build(); // Create an action group using the function schema AgentActionGroup actionGroup = AgentActionGroup.Builder.create() .name("library-functions") .description("Functions for interacting with the library catalog") .executor(ActionGroupExecutor.fromLambda(actionGroupFunction)) .functionSchema(functionSchema) .enabled(true) .build(); Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(BedrockFoundationModel.ANTHROPIC_CLAUDE_HAIKU_V1_0) .instruction("You are a helpful and friendly agent that answers questions about literature.") .actionGroups(List.of(actionGroup)) .build();
The FunctionSchema
approach offers several advantages:
- Type-safe definition of functions and parameters
- Built-in validation of parameter names, descriptions, and other properties
- Clear structure that maps directly to the AWS Bedrock API
- Support for parameter types including string, number, integer, boolean, array, and object
- Option to require user confirmation before executing specific functions
If you chose to load your schema file from S3, the construct will provide the necessary permissions to your agent's execution role to access the schema file from the specific bucket. Similar to performing the operation through the console, the agent execution role will get a permission like:
{ "Version": "2012-10-17", "Statement": [ { "Sid": "AmazonBedrockAgentS3PolicyProd", "Effect": "Allow", "Action": [ "s3:GetObject" ], "Resource": [ "arn:aws:s3:::<BUCKET_NAME>/<OBJECT_KEY>" ], "Condition": { "StringEquals": { "aws:ResourceAccount": "ACCOUNT_NUMBER" } } } ] }
// create a bucket containing the input schema Bucket schemaBucket = Bucket.Builder.create(this, "SchemaBucket") .enforceSSL(true) .versioned(true) .publicReadAccess(false) .blockPublicAccess(BlockPublicAccess.BLOCK_ALL) .encryption(BucketEncryption.S3_MANAGED) .removalPolicy(RemovalPolicy.DESTROY) .autoDeleteObjects(true) .build(); // deploy the local schema file to S3 BucketDeployment deployement = BucketDeployment.Builder.create(this, "DeployWebsite") .sources(List.of(Source.asset(join(__dirname, "../inputschema")))) .destinationBucket(schemaBucket) .destinationKeyPrefix("inputschema") .build(); // create the agent Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(BedrockFoundationModel.ANTHROPIC_CLAUDE_3_5_SONNET_V1_0) .instruction("You are a helpful and friendly agent that answers questions about literature.") .userInputEnabled(true) .shouldPrepareAgent(true) .build(); // create a lambda function Function actionGroupFunction = Function.Builder.create(this, "ActionGroupFunction") .runtime(Runtime.PYTHON_3_12) .handler("index.handler") .code(Code.fromAsset(join(__dirname, "../lambda/action-group"))) .build(); // create an action group and read the schema file from S3 AgentActionGroup actionGroup = AgentActionGroup.Builder.create() .name("query-library") .description("Use these functions to get information about the books in the library.") .executor(ActionGroupExecutor.fromLambda(actionGroupFunction)) .enabled(true) .apiSchema(ApiSchema.fromS3File(schemaBucket, "inputschema/action-group.yaml")) .build(); // add the action group to the agent agent.addActionGroup(actionGroup); // add dependency for the agent on the s3 deployment agent.node.addDependency(deployement);
Prepare the Agent
The Agent
constructs take an optional parameter shouldPrepareAgent
to indicate that the Agent should be prepared after any updates to an agent or action group. This may increase the time to create and update those resources. By default, this value is false.
Prepare Agent Properties
| Name | Type | Required | Description | |---|---|---|---| | shouldPrepareAgent | boolean | No | Whether to automatically update the DRAFT version of the agent after making changes. Defaults to false |
Creating an agent alias will not prepare the agent, so if you create an alias using the AgentAlias
resource then you should set shouldPrepareAgent
to true.
Prompt Override Configuration
Bedrock Agents allows you to customize the prompts and LLM configuration for different steps in the agent sequence. The implementation provides type-safe configurations for each step type, ensuring correct usage at compile time.
Prompt Override Configuration Properties
| Name | Type | Required | Description | |---|---|---|---| | steps | PromptStepConfiguration[] | Yes | Array of step configurations for different parts of the agent sequence | | parser | lambda.IFunction | No | Lambda function for custom parsing of agent responses |
Prompt Step Configuration Properties
Each step in the steps
array supports the following properties:
| Name | Type | Required | Description | |---|---|---|---| | stepType | AgentStepType | Yes | The type of step being configured (PRE_PROCESSING, ORCHESTRATION, POST_PROCESSING, ROUTING_CLASSIFIER, MEMORY_SUMMARIZATION, KNOWLEDGE_BASE_RESPONSE_GENERATION) | | stepEnabled | boolean | No | Whether this step is enabled. Defaults to true | | customPromptTemplate | string | No | Custom prompt template to use for this step | | inferenceConfig | InferenceConfiguration | No | Configuration for model inference parameters | | foundationModel | BedrockFoundationModel | No | Alternative foundation model to use for this step (only valid for ROUTING_CLASSIFIER step) | | useCustomParser | boolean | No | Whether to use a custom parser for this step. Requires parser to be provided in PromptOverrideConfiguration |
Inference Configuration Properties
When providing inferenceConfig
, the following properties are supported:
| Name | Type | Required | Description | |---|---|---|---| | temperature | number | No | Controls randomness in the model's output (0.0-1.0) | | topP | number | No | Controls diversity via nucleus sampling (0.0-1.0) | | topK | number | No | Controls diversity by limiting the cumulative probability | | maximumLength | number | No | Maximum length of generated text | | stopSequences | string[] | No | Sequences where the model should stop generating |
The following steps can be configured:
- PRE_PROCESSING: Prepares the user input for orchestration
- ORCHESTRATION: Main step that determines the agent's actions
- POST_PROCESSING: Refines the agent's response
- ROUTING_CLASSIFIER: Classifies and routes requests to appropriate collaborators
- MEMORY_SUMMARIZATION: Summarizes conversation history for memory retention
- KNOWLEDGE_BASE_RESPONSE_GENERATION: Generates responses using knowledge base content
Example with pre-processing configuration:
Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(BedrockFoundationModel.AMAZON_NOVA_LITE_V1) .instruction("You are a helpful assistant.") .promptOverrideConfiguration(PromptOverrideConfiguration.fromSteps(List.of(PromptStepConfigBase.builder() .stepType(AgentStepType.PRE_PROCESSING) .stepEnabled(true) .customPromptTemplate("Your custom prompt template here") .inferenceConfig(InferenceConfiguration.builder() .temperature(0) .topP(1) .topK(250) .maximumLength(1) .stopSequences(List.of("\n\nHuman:")) .build()) .build()))) .build();
Example with routing classifier and foundation model:
Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(BedrockFoundationModel.AMAZON_NOVA_LITE_V1) .instruction("You are a helpful assistant.") .promptOverrideConfiguration(PromptOverrideConfiguration.fromSteps(List.of((PromptRoutingClassifierConfigCustomParser)PromptRoutingClassifierConfigCustomParser.builder() .stepType(AgentStepType.ROUTING_CLASSIFIER) .stepEnabled(true) .customPromptTemplate("Your routing template here") .foundationModel(BedrockFoundationModel.ANTHROPIC_CLAUDE_V2) .build()))) .build();
Using a custom Lambda parser:
Function parserFunction = Function.Builder.create(this, "ParserFunction") .runtime(Runtime.PYTHON_3_10) .handler("index.handler") .code(Code.fromAsset("lambda")) .build(); Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(BedrockFoundationModel.AMAZON_NOVA_LITE_V1) .instruction("You are a helpful assistant.") .promptOverrideConfiguration(PromptOverrideConfiguration.withCustomParser(CustomParserProps.builder() .parser(parserFunction) .preProcessingStep(PromptPreProcessingConfigCustomParser.builder() .stepType(AgentStepType.PRE_PROCESSING) .useCustomParser(true) .build()) .build())) .build();
Foundation models can only be specified for the ROUTING_CLASSIFIER step.
Memory Configuration
Agents can maintain context across multiple sessions and recall past interactions using memory. This feature is useful for creating a more coherent conversational experience.
Memory Configuration Properties
| Name | Type | Required | Description | |---|---|---|---| | maxRecentSessions | number | No | Maximum number of recent session summaries to retain | | memoryDuration | Duration | No | How long to retain session summaries |
Example:
Agent agent = Agent.Builder.create(this, "MyAgent") .agentName("MyAgent") .instruction("Your instruction here") .foundationModel(BedrockFoundationModel.AMAZON_NOVA_LITE_V1) .memory(Memory.sessionSummary(SessionSummaryMemoryProps.builder() .maxRecentSessions(10) // Keep the last 10 session summaries .memoryDuration(Duration.days(20)) .build())) .build();
Agent Collaboration
Agent Collaboration enables multiple Bedrock Agents to work together on complex tasks. This feature allows agents to specialize in different areas and collaborate to provide more comprehensive responses to user queries.
Agent Collaboration Properties
| Name | Type | Required | Description | |---|---|---|---| | type | AgentCollaboratorType | Yes | Type of collaboration (SUPERVISOR or PEER) | | collaborators | AgentCollaborator[] | Yes | List of agent collaborators |
Agent Collaborator Properties
| Name | Type | Required | Description | |---|---|---|---| | agentAlias | AgentAlias | Yes | The agent alias to collaborate with | | collaborationInstruction | string | Yes | Instructions for how to collaborate with this agent | | collaboratorName | string | Yes | Name of the collaborator | | relayConversationHistory | boolean | No | Whether to relay conversation history to the collaborator. Defaults to false |
Example:
// Create a specialized agent Agent customerSupportAgent = Agent.Builder.create(this, "CustomerSupportAgent") .instruction("You specialize in answering customer support questions.") .foundationModel(BedrockFoundationModel.AMAZON_NOVA_LITE_V1) .build(); // Create an agent alias AgentAlias customerSupportAlias = AgentAlias.Builder.create(this, "CustomerSupportAlias") .agent(customerSupportAgent) .agentAliasName("production") .build(); // Create a main agent that collaborates with the specialized agent Agent mainAgent = Agent.Builder.create(this, "MainAgent") .instruction("You route specialized questions to other agents.") .foundationModel(BedrockFoundationModel.AMAZON_NOVA_LITE_V1) .agentCollaboration(Map.of( "type", AgentCollaboratorType.SUPERVISOR, "collaborators", List.of( AgentCollaborator.Builder.create() .agentAlias(customerSupportAlias) .collaborationInstruction("Route customer support questions to this agent.") .collaboratorName("CustomerSupport") .relayConversationHistory(true) .build()))) .build();
Custom Orchestration
Custom Orchestration allows you to override the default agent orchestration flow with your own Lambda function. This enables more control over how the agent processes user inputs and invokes action groups.
When you provide a customOrchestrationExecutor, the agent's orchestrationType is automatically set to CUSTOM_ORCHESTRATION. If no customOrchestrationExecutor is provided, the orchestrationType defaults to DEFAULT, using Amazon Bedrock's built-in orchestration.
Custom Orchestration Properties
| Name | Type | Required | Description | |---|---|---|---| | function | lambda.IFunction | Yes | The Lambda function that implements the custom orchestration logic |
Example:
Function orchestrationFunction = Function.Builder.create(this, "OrchestrationFunction") .runtime(Runtime.PYTHON_3_10) .handler("index.handler") .code(Code.fromAsset("lambda/orchestration")) .build(); Agent agent = Agent.Builder.create(this, "CustomOrchestrationAgent") .instruction("You are a helpful assistant with custom orchestration logic.") .foundationModel(BedrockFoundationModel.AMAZON_NOVA_LITE_V1) .customOrchestrationExecutor(CustomOrchestrationExecutor.fromLambda(orchestrationFunction)) .build();
Agent Alias
After you have sufficiently iterated on your working draft and are satisfied with the behavior of your agent, you can set it up for deployment and integration into your application by creating aliases.
To deploy your agent, you need to create an alias. During alias creation, Amazon Bedrock automatically creates a version of your agent. The alias points to this newly created version. You can point the alias to a previously created version if necessary. You then configure your application to make API calls to that alias.
By default, the Agent resource creates a test alias named 'AgentTestAlias' that points to the 'DRAFT' version. This test alias is accessible via the testAlias
property of the agent. You can also create additional aliases for different environments using the AgentAlias construct.
Agent Alias Properties
| Name | Type | Required | Description | |---|---|---|---| | agent | Agent | Yes | The agent to create an alias for | | agentAliasName | string | No | The name of the agent alias. Defaults to a name generated by CDK | | description | string | No | A description of the agent alias. Defaults to no description | | routingConfiguration | AgentAliasRoutingConfiguration | No | Configuration for routing traffic between agent versions | | agentVersion | string | No | The version of the agent to use. If not specified, a new version is created |
When redeploying an agent with changes, you must ensure the agent version is updated to avoid deployment failures with "agent already exists" errors. The recommended way to handle this is to include the lastUpdated
property in the agent's description, which automatically updates whenever the agent is modified. This ensures a new version is created on each deployment.
Example:
Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(BedrockFoundationModel.ANTHROPIC_CLAUDE_HAIKU_V1_0) .instruction("You are a helpful and friendly agent that answers questions about literature.") .build(); AgentAlias agentAlias = AgentAlias.Builder.create(this, "myAlias") .agentAliasName("production") .agent(agent) .description(String.format("Production version of my agent. Created at %s", agent.getLastUpdated())) .build();
Guardrails
Amazon Bedrock's Guardrails feature enables you to implement robust governance and control mechanisms for your generative AI applications, ensuring alignment with your specific use cases and responsible AI policies. Guardrails empowers you to create multiple tailored policy configurations, each designed to address the unique requirements and constraints of different use cases. These policy configurations can then be seamlessly applied across multiple foundation models (FMs) and Agents, ensuring a consistent user experience and standardizing safety, security, and privacy controls throughout your generative AI ecosystem.
With Guardrails, you can define and enforce granular, customizable policies to precisely govern the behavior of your generative AI applications. You can configure the following policies in a guardrail to avoid undesirable and harmful content and remove sensitive information for privacy protection.
Content filters – Adjust filter strengths to block input prompts or model responses containing harmful content. Denied topics – Define a set of topics that are undesirable in the context of your application. These topics will be blocked if detected in user queries or model responses. Word filters – Configure filters to block undesirable words, phrases, and profanity. Such words can include offensive terms, competitor names etc. Sensitive information filters – Block or mask sensitive information such as personally identifiable information (PII) or custom regex in user inputs and model responses. You can create a Guardrail with a minimum blockedInputMessaging, blockedOutputsMessaging and default content filter policy.
Basic Guardrail Creation
TypeScript
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") .description("Legal ethical guardrails.") .build();
Guardrail Properties
| Property | Type | Required | Description | |----------|------|----------|-------------| | guardrailName | string | Yes | The name of the guardrail | | description | string | No | The description of the guardrail | | blockedInputMessaging | string | No | The message to return when the guardrail blocks a prompt. Default: "Sorry, your query violates our usage policy." | | blockedOutputsMessaging | string | No | The message to return when the guardrail blocks a model response. Default: "Sorry, I am unable to answer your question because of our usage policy." | | kmsKey | IKey | No | A custom KMS key to use for encrypting data. Default: Your data is encrypted by default with a key that AWS owns and manages for you. | | crossRegionConfig | GuardrailCrossRegionConfigProperty | No | The cross-region configuration for the guardrail. This enables cross-region inference for enhanced language support and filtering capabilities. Default: No cross-region configuration | | contentFilters | ContentFilter[] | No | The content filters to apply to the guardrail | | contentFiltersTierConfig | TierConfig | No | The tier configuration to apply to content filters. Default: TierConfig.CLASSIC | | deniedTopics | Topic[] | No | Up to 30 denied topics to block user inputs or model responses associated with the topic | | topicsTierConfig | TierConfig | No | The tier configuration to apply to topic filters. Default: TierConfig.CLASSIC | | wordFilters | string[] | No | The word filters to apply to the guardrail | | managedWordListFilters | ManagedWordFilterType[] | No | The managed word filters to apply to the guardrail | | piiFilters | PIIFilter[] | No | The PII filters to apply to the guardrail | | regexFilters | RegexFilter[] | No | The regular expression (regex) filters to apply to the guardrail | | contextualGroundingFilters | ContextualGroundingFilter[] | No | The contextual grounding filters to apply to the guardrail |
Filter Types
Content Filters
Content filters allow you to block input prompts or model responses containing harmful content. You can adjust the filter strength and configure separate actions for input and output.
Content Filter Configuration
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") // Configure tier for content filters (optional) .contentFiltersTierConfig(TierConfig.STANDARD) .build(); guardrail.addContentFilter(ContentFilter.builder() .type(ContentFilterType.SEXUAL) .inputStrength(ContentFilterStrength.HIGH) .outputStrength(ContentFilterStrength.MEDIUM) // props below are optional .inputAction(GuardrailAction.BLOCK) .inputEnabled(true) .outputAction(GuardrailAction.NONE) .outputEnabled(true) .inputModalities(List.of(ModalityType.TEXT, ModalityType.IMAGE)) .outputModalities(List.of(ModalityType.TEXT)) .build());
Available content filter types:
SEXUAL
: Describes input prompts and model responses that indicates sexual interest, activity, or arousalVIOLENCE
: Describes input prompts and model responses that includes glorification of or threats to inflict physical painHATE
: Describes input prompts and model responses that discriminate, criticize, insult, denounce, or dehumanize a person or groupINSULTS
: Describes input prompts and model responses that includes demeaning, humiliating, mocking, insulting, or belittling languageMISCONDUCT
: Describes input prompts and model responses that seeks or provides information about engaging in misconduct activityPROMPT_ATTACK
: Enable to detect and block user inputs attempting to override system instructions
Available content filter strengths:
NONE
: No filteringLOW
: Light filteringMEDIUM
: Moderate filteringHIGH
: Strict filtering
Available guardrail actions:
BLOCK
: Blocks the content from being processedANONYMIZE
: Masks the content with an identifier tagNONE
: Takes no action
Warning: the ANONYMIZE action is not available in all configurations. Please refer to the documentation of each filter to see which ones support
Available modality types:
TEXT
: Text modality for content filtersIMAGE
: Image modality for content filters
Tier Configuration
Guardrails support tier configurations that determine the level of language support and robustness for content and topic filters. You can configure separate tier settings for content filters and topic filters.
Tier Configuration Options
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") // Configure tier for content filters .contentFiltersTierConfig(TierConfig.STANDARD) // Configure tier for topic filters .topicsTierConfig(TierConfig.CLASSIC) .build();
Available tier configurations:
CLASSIC
: Provides established guardrails functionality supporting English, French, and Spanish languagesSTANDARD
: Provides a more robust solution than the CLASSIC tier and has more comprehensive language support. This tier requires that your guardrail use cross-Region inference
Note: The STANDARD tier provides enhanced language support and more comprehensive filtering capabilities, but requires cross-Region inference to be enabled for your guardrail.
Cross-Region Configuration
You can configure a system-defined guardrail profile to use with your guardrail. Guardrail profiles define the destination AWS Regions where guardrail inference requests can be automatically routed. Using guardrail profiles helps maintain guardrail performance and reliability when demand increases.
Cross-Region Configuration Properties
| Property | Type | Required | Description | |----------|------|----------|-------------| | guardrailProfileArn | string | Yes | The ARN of the system-defined guardrail profile that defines the destination AWS Regions where guardrail inference requests can be automatically routed |
Cross-Region Configuration Example
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") .description("Guardrail with cross-region configuration for enhanced language support") .crossRegionConfig(GuardrailCrossRegionConfigProperty.builder() .guardrailProfileArn("arn:aws:bedrock:us-east-1:123456789012:guardrail-profile/my-profile") .build()) // Use STANDARD tier for enhanced capabilities .contentFiltersTierConfig(TierConfig.STANDARD) .topicsTierConfig(TierConfig.STANDARD) .build();
Note: Cross-region configuration is required when using the STANDARD tier for content and topic filters. It helps maintain guardrail performance and reliability when demand increases by automatically routing inference requests to appropriate regions.
You will need to provide the necessary permissions for cross region: https://docs.aws.amazon.com/bedrock/latest/userguide/guardrail-profiles-permissions.html .
Denied Topics
Denied topics allow you to define a set of topics that are undesirable in the context of your application. These topics will be blocked if detected in user queries or model responses. You can configure separate actions for input and output.
Denied Topic Configuration
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") // Configure tier for topic filters (optional) .topicsTierConfig(TierConfig.STANDARD) .build(); // Use a predefined topic guardrail.addDeniedTopicFilter(Topic.FINANCIAL_ADVICE); // Create a custom topic with input/output actions guardrail.addDeniedTopicFilter(Topic.custom(CustomTopicProps.builder() .name("Legal_Advice") .definition("Offering guidance or suggestions on legal matters, legal actions, interpretation of laws, or legal rights and responsibilities.") .examples(List.of("Can I sue someone for this?", "What are my legal rights in this situation?", "Is this action against the law?", "What should I do to file a legal complaint?", "Can you explain this law to me?")) // props below are optional .inputAction(GuardrailAction.BLOCK) .inputEnabled(true) .outputAction(GuardrailAction.NONE) .outputEnabled(true) .build()));
Word Filters
Word filters allow you to block specific words, phrases, or profanity in user inputs and model responses. You can configure separate actions for input and output.
Word Filter Configuration
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") .build(); // Add managed word list with input/output actions guardrail.addManagedWordListFilter(ManagedWordFilter.builder() .type(ManagedWordFilterType.PROFANITY) .inputAction(GuardrailAction.BLOCK) .inputEnabled(true) .outputAction(GuardrailAction.NONE) .outputEnabled(true) .build()); // Add individual words guardrail.addWordFilter(WordFilter.builder().text("drugs").build()); guardrail.addWordFilter(WordFilter.builder().text("competitor").build()); // Add words from a file guardrail.addWordFilterFromFile("./scripts/wordsPolicy.csv");
PII Filters
PII filters allow you to detect and handle personally identifiable information in user inputs and model responses. You can configure separate actions for input and output.
The PII types are organized into enum-like classes for better type safety and transpilation compatibility:
- GeneralPIIType: General PII types like addresses, emails, names, phone numbers
- FinancePIIType: Financial information like credit card numbers, PINs, SWIFT codes
- InformationTechnologyPIIType: IT-related data like URLs, IP addresses, AWS keys
- USASpecificPIIType: US-specific identifiers like SSNs, passport numbers
- CanadaSpecificPIIType: Canada-specific identifiers like health numbers, SINs
- UKSpecificPIIType: UK-specific identifiers like NHS numbers, NI numbers
PII Filter Configuration
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") .build(); // Add PII filter for addresses with input/output actions guardrail.addPIIFilter(PIIFilter.builder() .type(GeneralPIIType.ADDRESS) .action(GuardrailAction.BLOCK) // below props are optional .inputAction(GuardrailAction.BLOCK) .inputEnabled(true) .outputAction(GuardrailAction.ANONYMIZE) .outputEnabled(true) .build()); // Add PII filter for credit card numbers with input/output actions guardrail.addPIIFilter(PIIFilter.builder() .type(FinancePIIType.CREDIT_DEBIT_CARD_NUMBER) .action(GuardrailAction.BLOCK) // below props are optional .inputAction(GuardrailAction.BLOCK) .inputEnabled(true) .outputAction(GuardrailAction.ANONYMIZE) .outputEnabled(true) .build()); // Add PII filter for email addresses guardrail.addPIIFilter(PIIFilter.builder() .type(GeneralPIIType.EMAIL) .action(GuardrailAction.ANONYMIZE) .build()); // Add PII filter for US Social Security Numbers guardrail.addPIIFilter(PIIFilter.builder() .type(USASpecificPIIType.US_SOCIAL_SECURITY_NUMBER) .action(GuardrailAction.BLOCK) .build()); // Add PII filter for IP addresses guardrail.addPIIFilter(PIIFilter.builder() .type(InformationTechnologyPIIType.IP_ADDRESS) .action(GuardrailAction.ANONYMIZE) .build());
Available PII Types
GeneralPIIType:
ADDRESS
: Physical addressesAGE
: Individual's ageDRIVER_ID
: Driver's license numbersEMAIL
: Email addressesLICENSE_PLATE
: Vehicle license platesNAME
: Individual namesPASSWORD
: PasswordsPHONE
: Phone numbersUSERNAME
: User account namesVEHICLE_IDENTIFICATION_NUMBER
: Vehicle VINs
FinancePIIType:
CREDIT_DEBIT_CARD_CVV
: Card verification codesCREDIT_DEBIT_CARD_EXPIRY
: Card expiration datesCREDIT_DEBIT_CARD_NUMBER
: Credit/debit card numbersPIN
: Personal identification numbersSWIFT_CODE
: Bank SWIFT codesINTERNATIONAL_BANK_ACCOUNT_NUMBER
: IBAN numbers
InformationTechnologyPIIType:
URL
: Web addressesIP_ADDRESS
: IPv4 addressesMAC_ADDRESS
: Network interface MAC addressesAWS_ACCESS_KEY
: AWS access key IDsAWS_SECRET_KEY
: AWS secret access keys
USASpecificPIIType:
US_BANK_ACCOUNT_NUMBER
: US bank account numbersUS_BANK_ROUTING_NUMBER
: US bank routing numbersUS_INDIVIDUAL_TAX_IDENTIFICATION_NUMBER
: US ITINsUS_PASSPORT_NUMBER
: US passport numbersUS_SOCIAL_SECURITY_NUMBER
: US Social Security Numbers
CanadaSpecificPIIType:
CA_HEALTH_NUMBER
: Canadian Health Service NumbersCA_SOCIAL_INSURANCE_NUMBER
: Canadian Social Insurance Numbers
UKSpecificPIIType:
UK_NATIONAL_HEALTH_SERVICE_NUMBER
: UK NHS numbersUK_NATIONAL_INSURANCE_NUMBER
: UK National Insurance numbersUK_UNIQUE_TAXPAYER_REFERENCE_NUMBER
: UK UTR numbers
Regex Filters
Regex filters allow you to detect and handle custom patterns in user inputs and model responses. You can configure separate actions for input and output.
Regex Filter Configuration
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") .build(); // Add regex filter with input/output actions guardrail.addRegexFilter(RegexFilter.builder() .name("TestRegexFilter") .pattern("test-pattern") .action(GuardrailAction.ANONYMIZE) // below props are optional .description("This is a test regex filter") .inputAction(GuardrailAction.BLOCK) .inputEnabled(true) .outputAction(GuardrailAction.ANONYMIZE) .outputEnabled(true) .build());
Contextual Grounding Filters
Contextual grounding filters allow you to ensure that model responses are factually correct and relevant to the user's query. You can configure the action and enable/disable the filter.
Contextual Grounding Filter Configuration
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") .build(); // Add contextual grounding filter with action and enabled flag guardrail.addContextualGroundingFilter(ContextualGroundingFilter.builder() .type(ContextualGroundingFilterType.GROUNDING) .threshold(0.8) // the properties below are optional .action(GuardrailAction.BLOCK) .enabled(true) .build());
Guardrail Methods
| Method | Description |
|--------|-------------|
| addContentFilter()
| Adds a content filter to the guardrail |
| addDeniedTopicFilter()
| Adds a denied topic filter to the guardrail |
| addWordFilter()
| Adds a word filter to the guardrail |
| addManagedWordListFilter()
| Adds a managed word list filter to the guardrail |
| addWordFilterFromFile()
| Adds word filters from a file to the guardrail |
| addPIIFilter()
| Adds a PII filter to the guardrail |
| addRegexFilter()
| Adds a regex filter to the guardrail |
| addContextualGroundingFilter()
| Adds a contextual grounding filter to the guardrail |
| createVersion()
| Creates a new version of the guardrail |
Guardrail Permissions
Guardrails provide methods to grant permissions to other resources that need to interact with the guardrail.
Permission Methods
| Method | Description | Parameters |
|--------|-------------|------------|
| grant(grantee, ...actions)
| Grants the given principal identity permissions to perform actions on this guardrail | grantee
: The principal to grant permissions toactions
: The actions to grant (e.g., bedrock:GetGuardrail
, bedrock:ListGuardrails
) |
| grantApply(grantee)
| Grants the given identity permissions to apply the guardrail | grantee
: The principal to grant permissions to |
Permission Examples
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") .build(); Function lambdaFunction = Function.Builder.create(this, "testLambda") .runtime(Runtime.PYTHON_3_12) .handler("index.handler") .code(Code.fromAsset(join(__dirname, "../lambda/my-code"))) .build(); // Grant specific permissions to a Lambda function guardrail.grant(lambdaFunction, "bedrock:GetGuardrail", "bedrock:ListGuardrails"); // Grant permissions to apply the guardrail guardrail.grantApply(lambdaFunction);
Guardrail Metrics
Amazon Bedrock provides metrics for your guardrails, allowing you to monitor their effectiveness and usage. These metrics are available in CloudWatch and can be used to create dashboards and alarms.
Metrics Examples
import software.amazon.awscdk.services.cloudwatch.*; Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") .build(); // Get a specific metric for this guardrail Metric invocationsMetric = guardrail.metricInvocations(MetricOptions.builder() .statistic("Sum") .period(Duration.minutes(5)) .build()); // Create a CloudWatch alarm for high invocation latency // Create a CloudWatch alarm for high invocation latency Alarm.Builder.create(this, "HighLatencyAlarm") .metric(guardrail.metricInvocationLatency()) .threshold(1000) // 1 second .evaluationPeriods(3) .build(); // Get metrics for all guardrails Metric allInvocationsMetric = Guardrail.metricAllInvocations();
Importing Guardrails
You can import existing guardrails using the fromGuardrailAttributes
or fromCfnGuardrail
methods.
Import Configuration
Stack stack; Key cmk = Key.Builder.create(this, "cmk").build(); // Import an existing guardrail by ARN IGuardrail importedGuardrail = Guardrail.fromGuardrailAttributes(stack, "TestGuardrail", GuardrailAttributes.builder() .guardrailArn("arn:aws:bedrock:us-east-1:123456789012:guardrail/oygh3o8g7rtl") .guardrailVersion("1") //optional .kmsKey(cmk) .build());
import software.amazon.awscdk.services.bedrock.*; // Import a guardrail created through the L1 CDK CfnGuardrail construct CfnGuardrail l1guardrail = CfnGuardrail.Builder.create(this, "MyCfnGuardrail") .blockedInputMessaging("blockedInputMessaging") .blockedOutputsMessaging("blockedOutputsMessaging") .name("namemycfnguardrails") .wordPolicyConfig(WordPolicyConfigProperty.builder() .wordsConfig(List.of(WordConfigProperty.builder() .text("drugs") .build())) .build()) .build(); IGuardrail importedGuardrail = Guardrail.fromCfnGuardrail(l1guardrail);
Guardrail Versioning
Guardrails support versioning, allowing you to track changes and maintain multiple versions of your guardrail configurations.
Version Configuration
Guardrail guardrail = Guardrail.Builder.create(this, "bedrockGuardrails") .guardrailName("my-BedrockGuardrails") .build(); // Create a new version of the guardrail guardrail.createVersion("testversion");
Prompts
Amazon Bedrock provides the ability to create and save prompts using Prompt management so that you can save time by applying the same prompt to different workflows. You can include variables in the prompt so that you can adjust the prompt for different use case.
The Prompt
resource allows you to create a new prompt.
Prompt Variants
Prompt variants in the context of Amazon Bedrock refer to alternative configurations of a prompt, including its message or the model and inference configurations used. Prompt variants are the building blocks of prompts - you must create at least one prompt variant to create a prompt. Prompt variants allow you to create different versions of a prompt, test them, and save the variant that works best for your use case.
There are three types of prompt variants:
- Basic Text Prompt: Simple text-based prompts for straightforward interactions
- Chat variant: Conversational prompts that support system messages, user/assistant message history, and tools
- Agent variant: Prompts designed to work with Bedrock Agents
Basic Text Prompt
Text prompts are the simplest form of prompts, consisting of plain text instructions with optional variables. They are ideal for straightforward tasks like summarization, content generation, or question answering where you need a direct text-based interaction with the model.
Key cmk = Key.Builder.create(this, "cmk").build(); BedrockFoundationModel claudeModel = BedrockFoundationModel.ANTHROPIC_CLAUDE_SONNET_V1_0; IPromptVariant variant1 = PromptVariant.text(TextPromptVariantProps.builder() .variantName("variant1") .model(claudeModel) .promptVariables(List.of("topic")) .promptText("This is my first text prompt. Please summarize our conversation on: {{topic}}.") .inferenceConfiguration(PromptInferenceConfiguration.text(PromptInferenceConfigurationProps.builder() .temperature(1) .topP(0.999) .maxTokens(2000) .build())) .build()); Prompt prompt1 = Prompt.Builder.create(this, "prompt1") .promptName("prompt1") .description("my first prompt") .defaultVariant(variant1) .variants(List.of(variant1)) .kmsKey(cmk) .build();
Chat Prompt
Use this template type when the model supports the Converse API or the Anthropic Claude Messages API. This allows you to include a System prompt and previous User messages and Assistant messages for context.
Key cmk = Key.Builder.create(this, "cmk").build(); IPromptVariant variantChat = PromptVariant.chat(ChatPromptVariantProps.builder() .variantName("variant1") .model(BedrockFoundationModel.ANTHROPIC_CLAUDE_3_5_SONNET_V1_0) .messages(List.of(ChatMessage.user("From now on, you speak Japanese!"), ChatMessage.assistant("Konnichiwa!"), ChatMessage.user("From now on, you speak {{language}}!"))) .system("You are a helpful assistant that only speaks the language you`re told.") .promptVariables(List.of("language")) .toolConfiguration(ToolConfiguration.builder() .toolChoice(ToolChoice.AUTO) .tools(List.of(Tool.function(FunctionToolProps.builder() .name("top_song") .description("Get the most popular song played on a radio station.") .inputSchema(Map.of( "type", "object", "properties", Map.of( "sign", Map.of( "type", "string", "description", "The call sign for the radio station for which you want the most popular song. Example calls signs are WZPZ and WKR.")), "required", List.of("sign"))) .build()))) .build()) .build()); Prompt.Builder.create(this, "prompt1") .promptName("prompt-chat") .description("my first chat prompt") .defaultVariant(variantChat) .variants(List.of(variantChat)) .kmsKey(cmk) .build();
Agent Prompt
Agent prompts are designed to work with Bedrock Agents, allowing you to create prompts that can be used by agents to perform specific tasks. Agent prompts use text prompts as their foundation and can reference agent aliases and include custom instructions for how the agent should behave.
Key cmk = Key.Builder.create(this, "cmk").build(); // Assuming you have an existing agent and alias IAgent agent = Agent.fromAgentAttributes(this, "ImportedAgent", AgentAttributes.builder() .agentArn("arn:aws:bedrock:region:account:agent/agent-id") .roleArn("arn:aws:iam::account:role/agent-role") .build()); IAgentAlias agentAlias = AgentAlias.fromAttributes(this, "ImportedAlias", AgentAliasAttributes.builder() .aliasId("alias-id") .aliasName("my-alias") .agentVersion("1") .agent(agent) .build()); IPromptVariant agentVariant = PromptVariant.agent(AgentPromptVariantProps.builder() .variantName("agent-variant") .model(BedrockFoundationModel.ANTHROPIC_CLAUDE_3_5_SONNET_V1_0) .agentAlias(agentAlias) .promptText("Use the agent to help with: {{task}}. Please be thorough and provide detailed explanations.") .promptVariables(List.of("task")) .build()); Prompt.Builder.create(this, "agentPrompt") .promptName("agent-prompt") .description("Prompt for agent interactions") .defaultVariant(agentVariant) .variants(List.of(agentVariant)) .kmsKey(cmk) .build();
Prompt Properties
| Property | Type | Required | Description | |----------|------|----------|-------------| | promptName | string | Yes | The name of the prompt | | description | string | No | A description of the prompt | | defaultVariant | PromptVariant | Yes | The default variant to use for the prompt | | variants | PromptVariant[] | No | Additional variants for the prompt | | kmsKey | kms.IKey | No | The KMS key to use for encrypting the prompt. Defaults to AWS managed key | | tags | Record<string, string> | No | Tags to apply to the prompt |
Prompt Version
A prompt version is a snapshot of a prompt at a specific point in time that you create when you are satisfied with a set of configurations. Versions allow you to deploy your prompt and easily switch between different configurations for your prompt and update your application with the most appropriate version for your use-case.
You can create a Prompt version by using the PromptVersion class or by using the .createVersion(..) on a Prompt object. It is recommended to use the .createVersion(..) method. It uses a hash based mechanism to update the version whenever a certain configuration property changes.
Key cmk = Key.Builder.create(this, "cmk").build(); BedrockFoundationModel claudeModel = BedrockFoundationModel.ANTHROPIC_CLAUDE_SONNET_V1_0; IPromptVariant variant1 = PromptVariant.text(TextPromptVariantProps.builder() .variantName("variant1") .model(claudeModel) .promptVariables(List.of("topic")) .promptText("This is my first text prompt. Please summarize our conversation on: {{topic}}.") .inferenceConfiguration(PromptInferenceConfiguration.text(PromptInferenceConfigurationProps.builder() .temperature(1) .topP(0.999) .maxTokens(2000) .build())) .build()); Prompt prompt1 = Prompt.Builder.create(this, "prompt1") .promptName("prompt1") .description("my first prompt") .defaultVariant(variant1) .variants(List.of(variant1)) .kmsKey(cmk) .build(); PromptVersion promptVersion = PromptVersion.Builder.create(this, "MyPromptVersion") .prompt(prompt1) .description("my first version") .build(); //or alternatively: // const promptVersion = prompt1.createVersion('my first version'); String versionString = promptVersion.getVersion();
Import Methods
You can use the fromPromptAttributes
method to import an existing Bedrock Prompt into your CDK application.
// Import an existing prompt by ARN IPrompt importedPrompt = Prompt.fromPromptAttributes(this, "ImportedPrompt", PromptAttributes.builder() .promptArn("arn:aws:bedrock:region:account:prompt/prompt-id") .kmsKey(Key.fromKeyArn(this, "ImportedKey", "arn:aws:kms:region:account:key/key-id")) // optional .promptVersion("1") .build());
Inference Profiles
Amazon Bedrock Inference Profiles provide a way to manage and optimize inference configurations for your foundation models. They allow you to define reusable configurations that can be applied across different prompts and agents.
Using Inference Profiles
Inference profiles can be used with prompts and agents to maintain consistent inference configurations across your application.
With Agents
// Create a cross-region inference profile CrossRegionInferenceProfile crossRegionProfile = CrossRegionInferenceProfile.fromConfig(CrossRegionInferenceProfileProps.builder() .geoRegion(CrossRegionInferenceProfileRegion.US) .model(BedrockFoundationModel.ANTHROPIC_CLAUDE_3_5_SONNET_V1_0) .build()); // Use the cross-region profile with an agent Agent agent = Agent.Builder.create(this, "Agent") .foundationModel(crossRegionProfile) .instruction("You are a helpful and friendly agent that answers questions about agriculture.") .build();
With Prompts
// Create a prompt router for intelligent model selection PromptRouter promptRouter = PromptRouter.fromDefaultId(DefaultPromptRouterIdentifier.ANTHROPIC_CLAUDE_V1, "us-east-1"); // Use the prompt router with a prompt variant IPromptVariant variant = PromptVariant.text(TextPromptVariantProps.builder() .variantName("variant1") .promptText("What is the capital of France?") .model(promptRouter) .build()); Prompt.Builder.create(this, "Prompt") .promptName("prompt-router-test") .variants(List.of(variant)) .build();
Types of Inference Profiles
Amazon Bedrock offers two types of inference profiles:
Application Inference Profiles
Application inference profiles are user-defined profiles that help you track costs and model usage. They can be created for a single region or for multiple regions using a cross-region inference profile.
Single Region Application Profile
// Create an application inference profile for one Region ApplicationInferenceProfile appProfile = ApplicationInferenceProfile.Builder.create(this, "MyApplicationProfile") .applicationInferenceProfileName("claude-3-sonnet-v1") .modelSource(BedrockFoundationModel.ANTHROPIC_CLAUDE_SONNET_V1_0) .description("Application profile for cost tracking") .tags(Map.of( "Environment", "Production")) .build();
Multi-Region Application Profile
// Create a cross-region inference profile CrossRegionInferenceProfile crossRegionProfile = CrossRegionInferenceProfile.fromConfig(CrossRegionInferenceProfileProps.builder() .geoRegion(CrossRegionInferenceProfileRegion.US) .model(BedrockFoundationModel.ANTHROPIC_CLAUDE_3_5_SONNET_V2_0) .build()); // Create an application inference profile across regions ApplicationInferenceProfile appProfile = ApplicationInferenceProfile.Builder.create(this, "MyMultiRegionProfile") .applicationInferenceProfileName("claude-35-sonnet-v2-multi-region") .modelSource(crossRegionProfile) .description("Multi-region application profile for cost tracking") .build();
System Defined Inference Profiles
Cross-region inference enables you to seamlessly manage unplanned traffic bursts by utilizing compute across different AWS Regions. With cross-region inference, you can distribute traffic across multiple AWS Regions, enabling higher throughput and enhanced resilience during periods of peak demands.
Before using a CrossRegionInferenceProfile, ensure that you have access to the models and regions defined in the inference profiles. For instance, if you use the system defined inference profile "us.anthropic.claude-3-5-sonnet-20241022-v2:0", inference requests will be routed to US East (Virginia) us-east-1, US East (Ohio) us-east-2 and US West (Oregon) us-west-2. Thus, you need to have model access enabled in those regions for the model anthropic.claude-3-5-sonnet-20241022-v2:0.
System Defined Profile Configuration
CrossRegionInferenceProfile crossRegionProfile = CrossRegionInferenceProfile.fromConfig(CrossRegionInferenceProfileProps.builder() .geoRegion(CrossRegionInferenceProfileRegion.US) .model(BedrockFoundationModel.ANTHROPIC_CLAUDE_3_5_SONNET_V2_0) .build());
Prompt Routers
Amazon Bedrock intelligent prompt routing provides a single serverless endpoint for efficiently routing requests between different foundational models within the same model family. It can help you optimize for response quality and cost. They offer a comprehensive solution for managing multiple AI models through a single serverless endpoint, simplifying the process for you. Intelligent prompt routing predicts the performance of each model for each request, and dynamically routes each request to the model that it predicts is most likely to give the desired response at the lowest cost.
Default and Custom Prompt Routers
// Use a default prompt router IPromptVariant variant = PromptVariant.text(TextPromptVariantProps.builder() .variantName("variant1") .promptText("What is the capital of France?") .model(PromptRouter.fromDefaultId(DefaultPromptRouterIdentifier.ANTHROPIC_CLAUDE_V1, "us-east-1")) .build()); Prompt.Builder.create(this, "Prompt") .promptName("prompt-router-test") .variants(List.of(variant)) .build();
Inference Profile Permissions
Use the grantProfileUsage
method to grant appropriate permissions to resources that need to use the inference profile.
Granting Profile Usage Permissions
// Create an application inference profile ApplicationInferenceProfile profile = ApplicationInferenceProfile.Builder.create(this, "MyProfile") .applicationInferenceProfileName("my-profile") .modelSource(BedrockFoundationModel.ANTHROPIC_CLAUDE_3_5_SONNET_V1_0) .build(); // Create a Lambda function Function lambdaFunction = Function.Builder.create(this, "MyFunction") .runtime(Runtime.PYTHON_3_11) .handler("index.handler") .code(Code.fromInline("def handler(event, context): return \"Hello\"")) .build(); // Grant the Lambda function permission to use the inference profile profile.grantProfileUsage(lambdaFunction); // Use a system defined inference profile CrossRegionInferenceProfile crossRegionProfile = CrossRegionInferenceProfile.fromConfig(CrossRegionInferenceProfileProps.builder() .geoRegion(CrossRegionInferenceProfileRegion.US) .model(BedrockFoundationModel.ANTHROPIC_CLAUDE_3_5_SONNET_V1_0) .build()); // Grant permissions to use the cross-region inference profile crossRegionProfile.grantProfileUsage(lambdaFunction);
The grantProfileUsage
method adds the necessary IAM permissions to the resource, allowing it to use the inference profile. This includes permissions to call bedrock:GetInferenceProfile
and bedrock:ListInferenceProfiles
actions on the inference profile resource.
Inference Profiles Import Methods
You can import existing application inference profiles using the following methods:
// Import an inference profile through attributes IInferenceProfile importedProfile = ApplicationInferenceProfile.fromApplicationInferenceProfileAttributes(this, "ImportedProfile", ApplicationInferenceProfileAttributes.builder() .inferenceProfileArn("arn:aws:bedrock:us-east-1:123456789012:application-inference-profile/my-profile-id") .inferenceProfileIdentifier("my-profile-id") .build());
You can also import an application inference profile from an existing L1 CloudFormation construct:
// Create or reference an existing L1 CfnApplicationInferenceProfile CfnApplicationInferenceProfile cfnProfile = CfnApplicationInferenceProfile.Builder.create(this, "CfnProfile") .inferenceProfileName("my-cfn-profile") .modelSource(InferenceProfileModelSourceProperty.builder() .copyFrom(BedrockFoundationModel.ANTHROPIC_CLAUDE_3_5_SONNET_V1_0.getInvokableArn()) .build()) .description("Profile created via L1 construct") .build(); // Import the L1 construct as an L2 ApplicationInferenceProfile IInferenceProfile importedFromCfn = ApplicationInferenceProfile.fromCfnApplicationInferenceProfile(cfnProfile); // Grant permissions to use the imported profile Function lambdaFunction = Function.Builder.create(this, "MyFunction") .runtime(Runtime.PYTHON_3_11) .handler("index.handler") .code(Code.fromInline("def handler(event, context): return \"Hello\"")) .build(); importedFromCfn.grantProfileUsage(lambdaFunction);
-
ClassDescription(experimental) Defines how fulfillment of the action group is handled after the necessary information has been elicited from the user.(experimental) Class to create (or import) an Agent with CDK.(experimental) A fluent builder for
Agent
.(experimental) **************************************************************************** DEF - Action Group Class ***************************************************************************.(experimental) A fluent builder forAgentActionGroup
.(experimental) **************************************************************************** PROPS - Action Group Class ***************************************************************************.A builder forAgentActionGroupProps
An implementation forAgentActionGroupProps
(experimental) Class to create an Agent Alias with CDK.(experimental) A fluent builder forAgentAlias
.(experimental) Attributes needed to create an import.A builder forAgentAliasAttributes
An implementation forAgentAliasAttributes
(experimental) Abstract base class for an Agent.(experimental) Properties for creating a CDK-Managed Agent Alias.A builder forAgentAliasProps
An implementation forAgentAliasProps
(experimental) Attributes for specifying an imported Bedrock Agent.A builder forAgentAttributes
An implementation forAgentAttributes
(experimental) Abstract base class for an Agent.(experimental) Class to manage agent collaboration configuration.(experimental) A fluent builder forAgentCollaboration
.(experimental) Configuration for agent collaboration settings.A builder forAgentCollaborationConfig
An implementation forAgentCollaborationConfig
(experimental) **************************************************************************** Agent Collaborator Class ***************************************************************************.(experimental) A fluent builder forAgentCollaborator
.(experimental) **************************************************************************** PROPS - Agent Collaborator Class ***************************************************************************.A builder forAgentCollaboratorProps
An implementation forAgentCollaboratorProps
(experimental) Enum for collaborator's relay conversation history types.(experimental) Properties for creating an agent GenAI resource configuration.A builder forAgentGenAiResourceProps
An implementation forAgentGenAiResourceProps
(experimental) Properties for creating an agent prompt variant.A builder forAgentPromptVariantProps
An implementation forAgentPromptVariantProps
(experimental) Properties for creating a CDK managed Bedrock Agent.A builder forAgentProps
An implementation forAgentProps
(experimental) The step in the agent sequence that this prompt configuration applies to.(experimental) Represents the concept of an API Schema for a Bedrock Agent Action Group.(experimental) Class to create an Application Inference Profile with CDK.(experimental) A fluent builder forApplicationInferenceProfile
.(experimental) Attributes for specifying an imported Application Inference Profile.A builder forApplicationInferenceProfileAttributes
An implementation forApplicationInferenceProfileAttributes
(experimental) Properties for creating an Application Inference Profile.A builder forApplicationInferenceProfileProps
An implementation forApplicationInferenceProfileProps
(experimental) API Schema from a local asset.(experimental) Bedrock models.(experimental) A fluent builder forBedrockFoundationModel
.(experimental) Properties for configuring a Bedrock Foundation Model.A builder forBedrockFoundationModelProps
An implementation forBedrockFoundationModelProps
(experimental) Types of PII specific to Canada.(experimental) Represents a message in a chat conversation.(experimental) The role of a message in a chat conversation.(experimental) Properties for creating a chat prompt variant.A builder forChatPromptVariantProps
An implementation forChatPromptVariantProps
(experimental) Properties for creating a chat template configuration.A builder forChatTemplateConfigurationProps
An implementation forChatTemplateConfigurationProps
(experimental) Common properties for all prompt variants.A builder forCommonPromptVariantProps
An implementation forCommonPromptVariantProps
(experimental) Interface to declare a content filter.A builder forContentFilter
An implementation forContentFilter
(experimental) The strength of the content filter.(experimental) The type of harmful category usable in a content filter.(experimental) Interface to define a Contextual Grounding Filter.A builder forContextualGroundingFilter
An implementation forContextualGroundingFilter
(experimental) The type of contextual grounding filter.(experimental) Cross-region inference enables you to seamlessly manage unplanned traffic bursts by utilizing compute across different AWS Regions.(experimental) Properties for creating a Cross-Region Inference Profile.A builder forCrossRegionInferenceProfileProps
An implementation forCrossRegionInferenceProfileProps
(experimental) Geographic regions supported for cross-region inference profiles.(experimental) The type of custom control for the action group executor.(experimental) Contains details about the Lambda function containing the orchestration logic carried out upon invoking the custom orchestration.(experimental) Properties for configuring a custom Lambda parser for prompt overrides.A builder forCustomParserProps
An implementation forCustomParserProps
(experimental) Interface for creating a custom Topic.A builder forCustomTopicProps
An implementation forCustomTopicProps
(experimental) Represents identifiers for default prompt routers in Bedrock.(experimental) Types of PII in the domain of Finance.(experimental) Represents a function in a function schema.(experimental) A fluent builder forFunction
.(experimental) Represents a function parameter in a function schema.(experimental) A fluent builder forFunctionParameter
.(experimental) Properties for a function parameter.A builder forFunctionParameterProps
An implementation forFunctionParameterProps
(experimental) Properties for a function in a function schema.A builder forFunctionProps
An implementation forFunctionProps
(experimental) Represents a function schema for a Bedrock Agent Action Group.(experimental) A fluent builder forFunctionSchema
.(experimental) Properties for a function schema.A builder forFunctionSchemaProps
An implementation forFunctionSchemaProps
(experimental) Properties for creating a function tool.A builder forFunctionToolProps
An implementation forFunctionToolProps
(experimental) Types of PII that are general, and not domain-specific.(experimental) Class to create a Guardrail with CDK.(experimental) A fluent builder forGuardrail
.(experimental) Guardrail action when a sensitive entity is detected.(experimental) **************************************************************************** ATTRS FOR IMPORTED CONSTRUCT ***************************************************************************.A builder forGuardrailAttributes
An implementation forGuardrailAttributes
(experimental) Abstract base class for a Guardrail.(experimental) GuardrailCrossRegionConfigProperty.A builder forGuardrailCrossRegionConfigProperty
An implementation forGuardrailCrossRegionConfigProperty
(experimental) Properties for creating a Guardrail.A builder forGuardrailProps
An implementation forGuardrailProps
(experimental) Represents an Agent, either created with CDK or imported.Internal default implementation forIAgent
.A proxy class which represents a concrete javascript instance of this type.(experimental) Represents an Agent Alias, either created with CDK or imported.Internal default implementation forIAgentAlias
.A proxy class which represents a concrete javascript instance of this type.(experimental) Represents an Amazon Bedrock abstraction on which you can run theInvoke
API.Internal default implementation forIBedrockInvokable
.A proxy class which represents a concrete javascript instance of this type.(experimental) Represents a Guardrail, either created with CDK or imported.Internal default implementation forIGuardrail
.A proxy class which represents a concrete javascript instance of this type.(experimental) Represents an Inference Profile, either created with CDK or imported.Internal default implementation forIInferenceProfile
.A proxy class which represents a concrete javascript instance of this type.(experimental) LLM inference configuration.A builder forInferenceConfiguration
An implementation forInferenceConfiguration
(experimental) Abstract base class for an Inference Profile.(experimental) These are the values used by the API when using aws bedrock get-inference-profile --inference-profile-identifier XXXXXXX.(experimental) Types of PII in the domain of IT (Information Technology).(experimental) Class to define an API Schema from an inline string.(experimental) Represents a Prompt, either created with CDK or imported.Internal default implementation forIPrompt
.A proxy class which represents a concrete javascript instance of this type.(experimental) Represents a Prompt Router, which provides intelligent routing between different models.Internal default implementation forIPromptRouter
.A proxy class which represents a concrete javascript instance of this type.(experimental) Interface representing a prompt variant configuration.Internal default implementation forIPromptVariant
.A proxy class which represents a concrete javascript instance of this type.(experimental) Interface for managed word list filters.A builder forManagedWordFilter
An implementation forManagedWordFilter
(experimental) Managed word list filter types supported by Amazon Bedrock.(experimental) Memory class for managing Bedrock Agent memory configurations.(experimental) A fluent builder forMemory
.(experimental) The type of modality that can be used in content filters.(experimental) Enum for orchestration types available for agents.(experimental) Enum for parameter types in function schemas.(experimental) AWS Defined signatures for enabling certain capabilities in your agent.(experimental) Interface to define a PII Filter.A builder forPIIFilter
An implementation forPIIFilter
(experimental) Abstract base class for all PII types.(experimental) Class to create (or import) a Prompt with CDK.(experimental) A fluent builder forPrompt
.(experimental) Attributes for specifying an imported Bedrock Prompt.A builder forPromptAttributes
An implementation forPromptAttributes
(experimental) Abstract base class for a Prompt.(experimental) Abstract base class for prompt GenAI resource configurations.(experimental) Abstract base class for prompt inference configurations.(experimental) Properties for creating a prompt inference configuration.A builder forPromptInferenceConfigurationProps
An implementation forPromptInferenceConfigurationProps
(experimental) Configuration for the knowledge base response generation step.A builder forPromptKnowledgeBaseResponseGenerationConfigCustomParser
An implementation forPromptKnowledgeBaseResponseGenerationConfigCustomParser
(experimental) Configuration for the memory summarization step.A builder forPromptMemorySummarizationConfigCustomParser
An implementation forPromptMemorySummarizationConfigCustomParser
(experimental) Configuration for the orchestration step.A builder forPromptOrchestrationConfigCustomParser
An implementation forPromptOrchestrationConfigCustomParser
(experimental) Configuration for overriding prompt templates and behaviors in different parts of an agent's sequence.(experimental) Configuration for the post-processing step.A builder forPromptPostProcessingConfigCustomParser
An implementation forPromptPostProcessingConfigCustomParser
(experimental) Configuration for the pre-processing step.A builder forPromptPreProcessingConfigCustomParser
An implementation forPromptPreProcessingConfigCustomParser
(experimental) Properties for creating a CDK managed Bedrock Prompt.A builder forPromptProps
An implementation forPromptProps
(experimental) Amazon Bedrock intelligent prompt routing provides a single serverless endpoint for efficiently routing requests between different foundational models within the same model family.(experimental) A fluent builder forPromptRouter
.(experimental) Properties for configuring a Prompt Router.A builder forPromptRouterProps
An implementation forPromptRouterProps
(experimental) Configuration for the routing classifier step.A builder forPromptRoutingClassifierConfigCustomParser
An implementation forPromptRoutingClassifierConfigCustomParser
(experimental) Base configuration interface for all prompt step types.A builder forPromptStepConfigBase
An implementation forPromptStepConfigBase
(experimental) Abstract base class for prompt template configurations.(experimental) The type of prompt template.(experimental) Factory class for creating prompt variants.(experimental) Class to create a Prompt Version with CDK.(experimental) A fluent builder forPromptVersion
.(experimental) Properties for creating a CDK managed Bedrock Prompt Version.A builder forPromptVersionProps
An implementation forPromptVersionProps
(experimental) A Regular expression (regex) filter for sensitive information.A builder forRegexFilter
An implementation forRegexFilter
(experimental) Enum for require confirmation state in function schemas.(experimental) Class to define an API Schema from an S3 object.(experimental) Properties for SessionSummaryConfiguration.A builder forSessionSummaryMemoryProps
An implementation forSessionSummaryMemoryProps
(experimental) Properties for creating a text prompt variant.A builder forTextPromptVariantProps
An implementation forTextPromptVariantProps
(experimental) Properties for creating a text template configuration.A builder forTextTemplateConfigurationProps
An implementation forTextTemplateConfigurationProps
(experimental) **************************************************************************** TIER CONFIG ***************************************************************************.(experimental) Abstract base class for tools that can be used by the model.(experimental) Defines how the model should choose which tool to use.(experimental) Configuration for tools available to the model.A builder forToolConfiguration
An implementation forToolConfiguration
(experimental) Represents predefined topics that can be used to filter content.(experimental) Types of PII specific to the United Kingdom (UK).(experimental) Types of PII specific to the USA.(experimental) The data type for the vectors when using a model to convert text into vector embeddings.(experimental) Interface to define a Word Filter.A builder forWordFilter
An implementation forWordFilter