DatasetEntityRecognizerDocuments - Amazon Comprehend API Reference

DatasetEntityRecognizerDocuments

Describes the documents submitted with a dataset for an entity recognizer model.

Contents

S3Uri

Specifies the Amazon S3 location where the documents for the dataset are located.

Type: String

Length Constraints: Maximum length of 1024.

Pattern: s3://[a-z0-9][\.\-a-z0-9]{1,61}[a-z0-9](/.*)?

Required: Yes

InputFormat

Specifies how the text in an input file should be processed. This is optional, and the default is ONE_DOC_PER_LINE. ONE_DOC_PER_FILE - Each file is considered a separate document. Use this option when you are processing large documents, such as newspaper articles or scientific papers. ONE_DOC_PER_LINE - Each line in a file is considered a separate document. Use this option when you are processing many short documents, such as text messages.

Type: String

Valid Values: ONE_DOC_PER_FILE | ONE_DOC_PER_LINE

Required: No

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: