AWS Glue versions
You can configure the AWS Glue version parameter when you add or update a job. The AWS Glue version determines the versions of Apache Spark and Python that AWS Glue supports. The Python version indicates the version that's supported for jobs of type Spark. The following table lists the available AWS Glue versions, the corresponding Spark and Python versions, and other changes in functionality.
AWS Glue versions
AWS Glue version | Supported runtime environment versions | Supported Java version | Changes in functionality |
---|---|---|---|
AWS Glue 4.0 |
Spark environment versions
|
Java 8 |
AWS Glue 4.0 is the latest version of AWS Glue. There are several optimizations and upgrades built into this AWS Glue release, such as:
LimitationsThe following are limitations with AWS Glue 4.0:
For more information about migrating to AWS Glue version 4.0, see Migrating AWS Glue for Spark jobs to AWS Glue version 4.0. |
Ray environment versions
|
N/A |
Build and run distributed Python applications with AWS Glue for Ray.
Limitations on Ray jobs in AWS Glue 4.0
|
|
AWS Glue 3.0 |
|
Java 8 |
In addition to the Spark engine upgrade to 3.0, there are optimizations and upgrades built into this AWS Glue release, such as:
LimitationsThe following are limitations with AWS Glue 3.0:
For more information about migrating to AWS Glue version 3.0, see Migrating AWS Glue for Spark jobs to AWS Glue version 3.0. |
AWS Glue 2.0 (deprecated, end of support) |
|
N/A |
In addition to the features provided in AWS Glue version 1.0, AWS Glue version 2.0 also provides:
NoteAWS Glue version 2.0 differs from AWS Glue version 1.0 for some dependencies and versions due to underlying architectural changes. Validate your AWS Glue jobs before migrating across major AWS Glue version releases. For more information about AWS Glue version 2.0 features and limitations, see Running Spark ETL jobs with reduced startup times. |
AWS Glue 1.0 (deprecated, end of support) |
|
N/A |
You can maintain job bookmarks for Parquet and ORC formats in AWS Glue ETL jobs (using AWS Glue version 1.0). Previously, you were only able to bookmark common Amazon S3 source formats such as JSON, CSV, Apache Avro, and XML in AWS Glue ETL jobs. When setting format options for ETL inputs and outputs, you can specify to use Apache Avro reader/writer format 1.8 to support Avro logical type reading and writing (using AWS Glue version 1.0). Previously, only the version 1.7 Avro reader/writer format was supported. The DynamoDB connection type supports a writer option (using AWS Glue version 1.0). LimitationsThe following are limitations with AWS Glue 1.0:
|
AWS Glue 0.9 (deprecated, end of support) |
|
N/A |
Jobs that were created without specifying an AWS Glue version default to AWS Glue 0.9. LimitationsThe following are limitations with AWS Glue 0.9:
|
Note
The following Glue versions support these versions of PythonShell:
PythonShell v3.6 is supported in Glue version 1.0.
PythonShell v3.9 is supported in Glue version 3.0.
Additionally, dev endpoints are supported only in Glue version 1.0, and 0.9.