View a markdown version of this page

Evaluate data quality in the data catalog - Amazon SageMaker Unified Studio

Evaluate data quality in the data catalog

Use the Data quality tab on any catalog table to define rules, run them on demand or on a schedule, and track quality scores over time without building a pipeline. This lets you monitor the quality of your data at rest directly from the data catalog.

Rules are written using DQDL (Data Quality Definition Language), a domain-specific language for defining data quality rules, with 31 built-in rule types. For the full list of rule types and syntax, see DQDL rule types in the AWS Glue documentation.