@Generated(value="com.amazonaws:aws-java-sdk-code-generator") public class AugmentedManifestsListItem extends Object implements Serializable, Cloneable, StructuredPojo
An augmented manifest file that provides training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
Constructor and Description |
---|
AugmentedManifestsListItem() |
Modifier and Type | Method and Description |
---|---|
AugmentedManifestsListItem |
clone() |
boolean |
equals(Object obj) |
String |
getAnnotationDataS3Uri()
The S3 prefix to the annotation files that are referred in the augmented manifest file.
|
List<String> |
getAttributeNames()
The JSON attribute that contains the annotations for your training documents.
|
String |
getDocumentType()
The type of augmented manifest.
|
String |
getS3Uri()
The Amazon S3 location of the augmented manifest file.
|
String |
getSourceDocumentsS3Uri()
The S3 prefix to the source files (PDFs) that are referred to in the augmented manifest file.
|
String |
getSplit()
The purpose of the data you've provided in the augmented manifest.
|
int |
hashCode() |
void |
marshall(ProtocolMarshaller protocolMarshaller)
Marshalls this structured data using the given
ProtocolMarshaller . |
void |
setAnnotationDataS3Uri(String annotationDataS3Uri)
The S3 prefix to the annotation files that are referred in the augmented manifest file.
|
void |
setAttributeNames(Collection<String> attributeNames)
The JSON attribute that contains the annotations for your training documents.
|
void |
setDocumentType(String documentType)
The type of augmented manifest.
|
void |
setS3Uri(String s3Uri)
The Amazon S3 location of the augmented manifest file.
|
void |
setSourceDocumentsS3Uri(String sourceDocumentsS3Uri)
The S3 prefix to the source files (PDFs) that are referred to in the augmented manifest file.
|
void |
setSplit(String split)
The purpose of the data you've provided in the augmented manifest.
|
String |
toString()
Returns a string representation of this object.
|
AugmentedManifestsListItem |
withAnnotationDataS3Uri(String annotationDataS3Uri)
The S3 prefix to the annotation files that are referred in the augmented manifest file.
|
AugmentedManifestsListItem |
withAttributeNames(Collection<String> attributeNames)
The JSON attribute that contains the annotations for your training documents.
|
AugmentedManifestsListItem |
withAttributeNames(String... attributeNames)
The JSON attribute that contains the annotations for your training documents.
|
AugmentedManifestsListItem |
withDocumentType(AugmentedManifestsDocumentTypeFormat documentType)
The type of augmented manifest.
|
AugmentedManifestsListItem |
withDocumentType(String documentType)
The type of augmented manifest.
|
AugmentedManifestsListItem |
withS3Uri(String s3Uri)
The Amazon S3 location of the augmented manifest file.
|
AugmentedManifestsListItem |
withSourceDocumentsS3Uri(String sourceDocumentsS3Uri)
The S3 prefix to the source files (PDFs) that are referred to in the augmented manifest file.
|
AugmentedManifestsListItem |
withSplit(Split split)
The purpose of the data you've provided in the augmented manifest.
|
AugmentedManifestsListItem |
withSplit(String split)
The purpose of the data you've provided in the augmented manifest.
|
public void setS3Uri(String s3Uri)
The Amazon S3 location of the augmented manifest file.
s3Uri
- The Amazon S3 location of the augmented manifest file.public String getS3Uri()
The Amazon S3 location of the augmented manifest file.
public AugmentedManifestsListItem withS3Uri(String s3Uri)
The Amazon S3 location of the augmented manifest file.
s3Uri
- The Amazon S3 location of the augmented manifest file.public void setSplit(String split)
The purpose of the data you've provided in the augmented manifest. You can either train or test this data. If you don't specify, the default is train.
TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing.
TEST - all of the documents in the manifest will be used for testing.
split
- The purpose of the data you've provided in the augmented manifest. You can either train or test this data.
If you don't specify, the default is train.
TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing.
TEST - all of the documents in the manifest will be used for testing.
Split
public String getSplit()
The purpose of the data you've provided in the augmented manifest. You can either train or test this data. If you don't specify, the default is train.
TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing.
TEST - all of the documents in the manifest will be used for testing.
TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing.
TEST - all of the documents in the manifest will be used for testing.
Split
public AugmentedManifestsListItem withSplit(String split)
The purpose of the data you've provided in the augmented manifest. You can either train or test this data. If you don't specify, the default is train.
TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing.
TEST - all of the documents in the manifest will be used for testing.
split
- The purpose of the data you've provided in the augmented manifest. You can either train or test this data.
If you don't specify, the default is train.
TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing.
TEST - all of the documents in the manifest will be used for testing.
Split
public AugmentedManifestsListItem withSplit(Split split)
The purpose of the data you've provided in the augmented manifest. You can either train or test this data. If you don't specify, the default is train.
TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing.
TEST - all of the documents in the manifest will be used for testing.
split
- The purpose of the data you've provided in the augmented manifest. You can either train or test this data.
If you don't specify, the default is train.
TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing.
TEST - all of the documents in the manifest will be used for testing.
Split
public List<String> getAttributeNames()
The JSON attribute that contains the annotations for your training documents. The number of attribute names that you specify depends on whether your augmented manifest file is the output of a single labeling job or a chained labeling job.
If your file is the output of a single labeling job, specify the LabelAttributeName key that was used when the job was created in Ground Truth.
If your file is the output of a chained labeling job, specify the LabelAttributeName key for one or more jobs in the chain. Each LabelAttributeName key provides the annotations from an individual job.
If your file is the output of a single labeling job, specify the LabelAttributeName key that was used when the job was created in Ground Truth.
If your file is the output of a chained labeling job, specify the LabelAttributeName key for one or more jobs in the chain. Each LabelAttributeName key provides the annotations from an individual job.
public void setAttributeNames(Collection<String> attributeNames)
The JSON attribute that contains the annotations for your training documents. The number of attribute names that you specify depends on whether your augmented manifest file is the output of a single labeling job or a chained labeling job.
If your file is the output of a single labeling job, specify the LabelAttributeName key that was used when the job was created in Ground Truth.
If your file is the output of a chained labeling job, specify the LabelAttributeName key for one or more jobs in the chain. Each LabelAttributeName key provides the annotations from an individual job.
attributeNames
- The JSON attribute that contains the annotations for your training documents. The number of attribute
names that you specify depends on whether your augmented manifest file is the output of a single labeling
job or a chained labeling job.
If your file is the output of a single labeling job, specify the LabelAttributeName key that was used when the job was created in Ground Truth.
If your file is the output of a chained labeling job, specify the LabelAttributeName key for one or more jobs in the chain. Each LabelAttributeName key provides the annotations from an individual job.
public AugmentedManifestsListItem withAttributeNames(String... attributeNames)
The JSON attribute that contains the annotations for your training documents. The number of attribute names that you specify depends on whether your augmented manifest file is the output of a single labeling job or a chained labeling job.
If your file is the output of a single labeling job, specify the LabelAttributeName key that was used when the job was created in Ground Truth.
If your file is the output of a chained labeling job, specify the LabelAttributeName key for one or more jobs in the chain. Each LabelAttributeName key provides the annotations from an individual job.
NOTE: This method appends the values to the existing list (if any). Use
setAttributeNames(java.util.Collection)
or withAttributeNames(java.util.Collection)
if you want
to override the existing values.
attributeNames
- The JSON attribute that contains the annotations for your training documents. The number of attribute
names that you specify depends on whether your augmented manifest file is the output of a single labeling
job or a chained labeling job.
If your file is the output of a single labeling job, specify the LabelAttributeName key that was used when the job was created in Ground Truth.
If your file is the output of a chained labeling job, specify the LabelAttributeName key for one or more jobs in the chain. Each LabelAttributeName key provides the annotations from an individual job.
public AugmentedManifestsListItem withAttributeNames(Collection<String> attributeNames)
The JSON attribute that contains the annotations for your training documents. The number of attribute names that you specify depends on whether your augmented manifest file is the output of a single labeling job or a chained labeling job.
If your file is the output of a single labeling job, specify the LabelAttributeName key that was used when the job was created in Ground Truth.
If your file is the output of a chained labeling job, specify the LabelAttributeName key for one or more jobs in the chain. Each LabelAttributeName key provides the annotations from an individual job.
attributeNames
- The JSON attribute that contains the annotations for your training documents. The number of attribute
names that you specify depends on whether your augmented manifest file is the output of a single labeling
job or a chained labeling job.
If your file is the output of a single labeling job, specify the LabelAttributeName key that was used when the job was created in Ground Truth.
If your file is the output of a chained labeling job, specify the LabelAttributeName key for one or more jobs in the chain. Each LabelAttributeName key provides the annotations from an individual job.
public void setAnnotationDataS3Uri(String annotationDataS3Uri)
The S3 prefix to the annotation files that are referred in the augmented manifest file.
annotationDataS3Uri
- The S3 prefix to the annotation files that are referred in the augmented manifest file.public String getAnnotationDataS3Uri()
The S3 prefix to the annotation files that are referred in the augmented manifest file.
public AugmentedManifestsListItem withAnnotationDataS3Uri(String annotationDataS3Uri)
The S3 prefix to the annotation files that are referred in the augmented manifest file.
annotationDataS3Uri
- The S3 prefix to the annotation files that are referred in the augmented manifest file.public void setSourceDocumentsS3Uri(String sourceDocumentsS3Uri)
The S3 prefix to the source files (PDFs) that are referred to in the augmented manifest file.
sourceDocumentsS3Uri
- The S3 prefix to the source files (PDFs) that are referred to in the augmented manifest file.public String getSourceDocumentsS3Uri()
The S3 prefix to the source files (PDFs) that are referred to in the augmented manifest file.
public AugmentedManifestsListItem withSourceDocumentsS3Uri(String sourceDocumentsS3Uri)
The S3 prefix to the source files (PDFs) that are referred to in the augmented manifest file.
sourceDocumentsS3Uri
- The S3 prefix to the source files (PDFs) that are referred to in the augmented manifest file.public void setDocumentType(String documentType)
The type of augmented manifest. PlainTextDocument or SemiStructuredDocument. If you don't specify, the default is PlainTextDocument.
PLAIN_TEXT_DOCUMENT
A document type that represents any unicode text that is encoded in UTF-8.
SEMI_STRUCTURED_DOCUMENT
A document type with positional and structural context, like a PDF. For
training with Amazon Comprehend, only PDFs are supported. For inference, Amazon Comprehend support PDFs, DOCX and
TXT.
documentType
- The type of augmented manifest. PlainTextDocument or SemiStructuredDocument. If you don't specify, the
default is PlainTextDocument.
PLAIN_TEXT_DOCUMENT
A document type that represents any unicode text that is encoded in
UTF-8.
SEMI_STRUCTURED_DOCUMENT
A document type with positional and structural context, like a PDF.
For training with Amazon Comprehend, only PDFs are supported. For inference, Amazon Comprehend support
PDFs, DOCX and TXT.
AugmentedManifestsDocumentTypeFormat
public String getDocumentType()
The type of augmented manifest. PlainTextDocument or SemiStructuredDocument. If you don't specify, the default is PlainTextDocument.
PLAIN_TEXT_DOCUMENT
A document type that represents any unicode text that is encoded in UTF-8.
SEMI_STRUCTURED_DOCUMENT
A document type with positional and structural context, like a PDF. For
training with Amazon Comprehend, only PDFs are supported. For inference, Amazon Comprehend support PDFs, DOCX and
TXT.
PLAIN_TEXT_DOCUMENT
A document type that represents any unicode text that is encoded in
UTF-8.
SEMI_STRUCTURED_DOCUMENT
A document type with positional and structural context, like a PDF.
For training with Amazon Comprehend, only PDFs are supported. For inference, Amazon Comprehend support
PDFs, DOCX and TXT.
AugmentedManifestsDocumentTypeFormat
public AugmentedManifestsListItem withDocumentType(String documentType)
The type of augmented manifest. PlainTextDocument or SemiStructuredDocument. If you don't specify, the default is PlainTextDocument.
PLAIN_TEXT_DOCUMENT
A document type that represents any unicode text that is encoded in UTF-8.
SEMI_STRUCTURED_DOCUMENT
A document type with positional and structural context, like a PDF. For
training with Amazon Comprehend, only PDFs are supported. For inference, Amazon Comprehend support PDFs, DOCX and
TXT.
documentType
- The type of augmented manifest. PlainTextDocument or SemiStructuredDocument. If you don't specify, the
default is PlainTextDocument.
PLAIN_TEXT_DOCUMENT
A document type that represents any unicode text that is encoded in
UTF-8.
SEMI_STRUCTURED_DOCUMENT
A document type with positional and structural context, like a PDF.
For training with Amazon Comprehend, only PDFs are supported. For inference, Amazon Comprehend support
PDFs, DOCX and TXT.
AugmentedManifestsDocumentTypeFormat
public AugmentedManifestsListItem withDocumentType(AugmentedManifestsDocumentTypeFormat documentType)
The type of augmented manifest. PlainTextDocument or SemiStructuredDocument. If you don't specify, the default is PlainTextDocument.
PLAIN_TEXT_DOCUMENT
A document type that represents any unicode text that is encoded in UTF-8.
SEMI_STRUCTURED_DOCUMENT
A document type with positional and structural context, like a PDF. For
training with Amazon Comprehend, only PDFs are supported. For inference, Amazon Comprehend support PDFs, DOCX and
TXT.
documentType
- The type of augmented manifest. PlainTextDocument or SemiStructuredDocument. If you don't specify, the
default is PlainTextDocument.
PLAIN_TEXT_DOCUMENT
A document type that represents any unicode text that is encoded in
UTF-8.
SEMI_STRUCTURED_DOCUMENT
A document type with positional and structural context, like a PDF.
For training with Amazon Comprehend, only PDFs are supported. For inference, Amazon Comprehend support
PDFs, DOCX and TXT.
AugmentedManifestsDocumentTypeFormat
public String toString()
toString
in class Object
Object.toString()
public AugmentedManifestsListItem clone()
public void marshall(ProtocolMarshaller protocolMarshaller)
StructuredPojo
ProtocolMarshaller
.marshall
in interface StructuredPojo
protocolMarshaller
- Implementation of ProtocolMarshaller
used to marshall this object's data.