了解清单摘要 - Rekognition

本文属于机器翻译版本。若本译文内容与英语原文存在差异,则一律以英文原文为准。

了解清单摘要

清单摘要包含以下信息。

如果没有终止性清单文件错误,则会在训练期间创建清单摘要。要获取清单摘要文件 (manifest_summary.json) 的位置,请参阅获取验证结果

注意

清单摘要中不会报告服务错误清单文件错误。有关更多信息,请参阅 终止性错误

有关特定清单内容错误的信息,请参阅终止性清单内容错误

清单摘要文件格式

清单文件包含 2 个部分:statisticserrors

statistics

statistics 包含有关训练和测试数据集中错误的信息。

  • training:训练数据集中的统计信息和发现的错误。

  • testing:测试数据集中的统计信息和发现的错误。

errors 数组中的对象包含清单内容错误的错误代码和错误消息。

error_line_indices 数组包含训练或测试清单中存在错误的每个 JSON 行的行号。有关更多信息,请参阅 修复训练错误

错误

跨越训练和测试数据集的错误。例如,当没有足够的可用标签与训练和测试数据集重叠时,就会发生 ERROR_INSUFFICIENT_USABLE_LABEL_OVERLAP

{ "statistics": { "training": { "use_case": String, # Possible values are IMAGE_LEVEL_LABELS, OBJECT_LOCALIZATION and NOT_DETERMINED "total_json_lines": Number, # Total number json lines (images) in the training manifest. "valid_json_lines": Number, # Total number of JSON Lines (images) that can be used for training. "invalid_json_lines": Number, # Total number of invalid JSON Lines. They are not used for training. "ignored_json_lines": Number, # JSON Lines that have a valid schema but have no annotations. The aren't used for training and aren't counted as invalid. "error_json_line_indices": List[int], # Contains a list of line numbers for JSON line errors in the training dataset. "errors": [ { "code": String, # Error code for a training manifest content error. "message": String # Description for a training manifest content error. } ] }, "testing": { "use_case": String, # Possible values are IMAGE_LEVEL_LABELS, OBJECT_LOCALIZATION and NOT_DETERMINED "total_json_lines": Number, # Total number json lines (images) in the manifest. "valid_json_lines": Number, # Total number of JSON Lines (images) that can be used for testing. "invalid_json_lines": Number, # Total number of invalid JSON Lines. They are not used for testing. "ignored_json_lines": Number, # JSON Lines that have a valid schema but have no annotations. They aren't used for testing and aren't counted as invalid. "error_json_line_indices": List[int], # contains a list of error record line numbers in testing dataset. "errors": [ { "code": String, # # Error code for a testing manifest content error. "message": String # Description for a testing manifest content error. } ] } }, "errors": [ { "code": String, # # Error code for errors that span the training and testing datasets. "message": String # Description of the error. } ] }

示例清单摘要

以下示例是显示终止性清单内容错误 (ERROR_TOO_MANY_INVALID_ROWS_IN_MANIFEST) 的一部分清单摘要。error_json_line_indices 数组包含相应训练或测试验证清单中非终止性 JSON 行错误的行号。

{ "errors": [], "statistics": { "training": { "use_case": "NOT_DETERMINED", "total_json_lines": 301, "valid_json_lines": 146, "invalid_json_lines": 155, "ignored_json_lines": 0, "errors": [ { "code": "ERROR_TOO_MANY_INVALID_ROWS_IN_MANIFEST", "message": "The manifest file contains too many invalid rows." } ], "error_json_line_indices": [ 15, 16, 17, 22, 23, 24, . . . . 300 ] }, "testing": { "use_case": "NOT_DETERMINED", "total_json_lines": 15, "valid_json_lines": 13, "invalid_json_lines": 2, "ignored_json_lines": 0, "errors": [], "error_json_line_indices": [ 13, 15 ] } } }