Document enrichment in Amazon Q Business - Amazon Q Business

Document enrichment in Amazon Q Business

The Amazon Q Business document enrichment feature helps you control both what documents and document attributes are ingested into your index and also how they're ingested. Using document enrichment, you can create, modify, or delete document attributes and document content when you ingest them into your Amazon Q Business index.

Document enrichment offers two kinds of methods that you can use for your solution:

  • Configure basic operations – Use basic operations to add, update, or delete document attributes from your data. For example, you can scrub personally identifiable information (PII) by choosing to delete any document attributes related to PII.

  • Configure Lambda functions – Use a preconfigured Lambda function to perform more customized, advanced document attribute manipulation logic to your data. For example, your enterprise data might be stored as scanned images. In that case, you can use a Lambda function to run Optical Character recognition (OCR) on the scanned documents to extract text from them. Then, each scanned document is treated as a text document during ingestion. Finally, during chat, Amazon Q Business will factor the textual data extracted from the scanned documents when it generates responses.

When you implement your solution, you can choose to use both document enrichment methods together. That is, you can use basic operations to do a first parse of your data and then use a Lambda function for more complex operations. For example, you could first use a basic function to remove all PII information from your documents using document attributes. Then, use a Lambda function to extract text from scanned documents.

Document enrichment is supported both on the AWS Management Console and by Amazon Q Business API actions. If you use the console, you can only enrich documents connected to your application environment using an Amazon Q Business data source.

Note

Document enrichment is only supported in an Amazon Q Business application environment if you use an Amazon Q Business native retriever. If you use an Amazon Kendra retriever, you should configure document enrichment in Amazon Kendra.