Recipe step and function reference - AWS Glue DataBrew

Recipe step and function reference

In this reference, you can find descriptions of the recipe steps and functions that you can use programmatically, either from the AWS CLI or by using one of the AWS SDKs. In DataBrew, a recipe step is an action that transforms your raw data into a form that is ready to be consumed by your data pipeline. A DataBrew function is a special kind of recipe step that performs a computation based on parameters.

The recipe steps and functions are organized by category in the console UI, as shown in the following diagram.

Categories for transformations in the UI include the following:

  • Basic column recipe steps

    • Filter

    • Column

  • Data cleaning recipe steps

    • Format

    • Clean

    • Extract

  • Data quality recipe steps

    • Missing

    • Invalid

    • Duplicates

    • Outliers

  • Column structure recipe steps

    • Split

    • Merge

    • Create

  • Column formatting recipe steps

    • Decimal precision

    • Thousands separator

    • Abbreviate numbers

  • Data structure recipe steps

    • Nest-Unnest

    • Pivot

    • Group

    • Join

    • Union

  • Data science recipe steps

    • Text

    • Scale

    • Mapping

    • Encode

  • Functions

    • Mathematical functions

    • Aggregate functions

    • Text functions

    • Date and time functions

    • Window functions

    • Web functions

    • Other functions

For more information about how these recipe steps and functions are used in a recipe (including the use of condition expressions) see Defining a recipe structure.

The following sections describe the recipe steps and functions, organized by what they do.