Skip to content

/AWS1/CL_BDACHUNKINGCONF

Details about how to chunk the documents in the data source. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried.

CONSTRUCTOR

IMPORTING

Required arguments:

IV_CHUNKINGSTRATEGY TYPE /AWS1/BDACHUNKINGSTRATEGY /AWS1/BDACHUNKINGSTRATEGY

Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried. You have the following options for chunking your data. If you opt for NONE, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.

  • FIXED_SIZE – Amazon Bedrock splits your source data into chunks of the approximate size that you set in the fixedSizeChunkingConfiguration.

  • HIERARCHICAL – Split documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.

  • SEMANTIC – Split documents into chunks based on groups of similar content derived with natural language processing.

  • NONE – Amazon Bedrock treats each file as one chunk. If you choose this option, you may want to pre-process your documents by splitting them into separate files.

Optional arguments:

IO_FIXEDSIZECHUNKINGCONF TYPE REF TO /AWS1/CL_BDAFIXEDSIZECHUNKIN00 /AWS1/CL_BDAFIXEDSIZECHUNKIN00

Configurations for when you choose fixed-size chunking. If you set the chunkingStrategy as NONE, exclude this field.

IO_HIERARCHICALCHUNKINGCONF TYPE REF TO /AWS1/CL_BDAHIERARCHICALCHUN00 /AWS1/CL_BDAHIERARCHICALCHUN00

Settings for hierarchical document chunking for a data source. Hierarchical chunking splits documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.

IO_SEMANTICCHUNKINGCONF TYPE REF TO /AWS1/CL_BDASEMANTICCHUNKING00 /AWS1/CL_BDASEMANTICCHUNKING00

Settings for semantic document chunking for a data source. Semantic chunking splits a document into into smaller documents based on groups of similar content derived from the text with natural language processing.


Queryable Attributes

chunkingStrategy

Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried. You have the following options for chunking your data. If you opt for NONE, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.

  • FIXED_SIZE – Amazon Bedrock splits your source data into chunks of the approximate size that you set in the fixedSizeChunkingConfiguration.

  • HIERARCHICAL – Split documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.

  • SEMANTIC – Split documents into chunks based on groups of similar content derived with natural language processing.

  • NONE – Amazon Bedrock treats each file as one chunk. If you choose this option, you may want to pre-process your documents by splitting them into separate files.

Accessible with the following methods

Method Description
GET_CHUNKINGSTRATEGY() Getter for CHUNKINGSTRATEGY, with configurable default
ASK_CHUNKINGSTRATEGY() Getter for CHUNKINGSTRATEGY w/ exceptions if field has no va
HAS_CHUNKINGSTRATEGY() Determine if CHUNKINGSTRATEGY has a value

fixedSizeChunkingConfiguration

Configurations for when you choose fixed-size chunking. If you set the chunkingStrategy as NONE, exclude this field.

Accessible with the following methods

Method Description
GET_FIXEDSIZECHUNKINGCONF() Getter for FIXEDSIZECHUNKINGCONF

hierarchicalChunkingConfiguration

Settings for hierarchical document chunking for a data source. Hierarchical chunking splits documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.

Accessible with the following methods

Method Description
GET_HIERARCHICALCHUNKINGCONF() Getter for HIERARCHICALCHUNKINGCONF

semanticChunkingConfiguration

Settings for semantic document chunking for a data source. Semantic chunking splits a document into into smaller documents based on groups of similar content derived from the text with natural language processing.

Accessible with the following methods

Method Description
GET_SEMANTICCHUNKINGCONF() Getter for SEMANTICCHUNKINGCONF