Program AWS Glue ETL scripts in PySpark - AWS Glue

Program AWS Glue ETL scripts in PySpark

You can find Python code examples and utilities for AWS Glue in the AWS Glue samples repository on the GitHub website.

Using Python with AWS Glue

AWS Glue supports an extension of the PySpark Python dialect for scripting extract, transform, and load (ETL) jobs. This section describes how to use Python in ETL scripts and with the AWS Glue API.

AWS Glue PySpark extensions

AWS Glue has created the following extensions to the PySpark Python dialect.

AWS Glue PySpark transforms

AWS Glue has created the following transform Classes to use in PySpark ETL operations.