Serverless ETL on AWS GlueFAQ - AWS Prescriptive Guidance

Serverless ETL on AWS GlueFAQ

This section provides answers to commonly raised questions about serverless ETL on AWS Glue.

When should I use AWS Glue Python shell instead of AWS Glue with Spark?

Use AWS Glue Python shell when you do not need too much of a compute power to run light ETL workloads. Use AWS Glue with Spark when you must scale either horizontally, vertically, or both.

What is the difference between AWS Glue version 1.0 and AWS Glue version 2.0?

The major improvement of version 2.0 is the reduced startup time for Spark-related jobs. More feature-related improvements are mentioned in the AWS documentation.