Document attributes - Amazon Kendra

Document attributes

A document has attributes associated with it. Attributes of a document are the properties of a document or what is contained within the structure of a document. For example, each of your documents might contain title, body text, and author. You can also add your own custom attributes of your documents. Custom attributes are attributes that you specify for your own needs. For example, if your index searches tax documents, you might specify a custom attribute for the type of tax document such as W-2, 1099, and so on.

Before you can use a document attribute in a query, it must be mapped to a database field. For example, the title attribute can be mapped to the field _document_title. For more information, see Index fields.

You can use document attributes to filter responses and to make faceted search suggestions. For example, you can filter a response to only return a specific version of a document, or you can filter searches to only return 1099 tax documents that match the search term. For more information, see Filtering queries.

You can also use document attributes to manually tune the query response. For example, you can choose to increase the importance of the title field to increase the weight that Amazon Kendra assigns to the field when determining which documents to return in the response. For more information, see Tuning search relevance.

Before you can add an attribute, you must create an index field to map the attribute to. You create index fields using the console or by using the UpdateIndex operation.

If you are adding a document directly to an index, you specify the attributes in the Document input parameter to the BatchPutDocument operation. You specify the custom attribute values in a DocumentAttribute object array. If you are using a data source, the method that you use to add the document attributes depends on the data source. For more information, see Creating custom document attributes.