Amazon Comprehend
Developer Guide

Running an Asynchronous Custom Classification Job

Once you have created a custom document classifier, you can use it to categorize a group of documents.

To create a custom classification job (asynchronous)

  1. Sign in to the AWS Management Console and open the Amazon Comprehend console.

  2. From the left menu, choose Customization and then choose Custom classification.

  3. Give the classification job a name. The name must be unique in the region and account.

  4. From the Analysis type drop down, choose Custom classification.

  5. From the Select classifier drop down select the document classifier to use.

  6. (Optional) If you choose to encrypt the data in the storage volume while your classification job is processed, choose Job encryption and then choose whether to use a KMS key associated with the current account, or one from another account.

    • If you are using a key associated with the current account, for KMS key IDchoose the key ID.

    • If you are using a key associated with a different account, for KMS key ARN enter the ARN for the key ID.

    Note

    For more information on creating and using KMS keys and the associated encryption, see Key Management Service (KMS).

  7. Under S3 data location, search for or enter the location of the Amazon S3 bucket that contains the training documents you want to use to train your classifier. The bucket must be in the same region as the API that you are calling. Additionally, the total size of the training documents must be less than 5 Gb and you can provide up to 250 classification labels.

  8. Under Input format choose the format of the documents to be classified, whether the training data is contained in one document per file, or if there is one document per line in a file.

  9. Under Input labels S3 location, search for or enter the location of the Amazon S3 bucket that contains the input labels. If your data is in the one document per file format, choose the S3 location of the label file.

  10. (Optional) If you choose to encrypt the output result from your job, choose Encryption and then choose whether to use a KMS key associated with the current account, or one from another account.

    • If you are using a key associated with the current account, for KMS key ID choose the key alias or ID.

    • If you are using a key associated with a different account, for KMS key ID enter the ARN for the key alias or ID.

  11. Choose Create to create the document classification job.