Amazon EMR Studio - Amazon EMR

Amazon EMR Studio

With Amazon EMR 5.32.0 and 6.2.0 and later, you can use Amazon EMR Studio to perform data analysis and data engineering tasks in a web-based integrated development environment (IDE), using fully managed Jupyter notebooks that run on Amazon EMR clusters. Use AWS Single Sign-On (AWS SSO) to sign in to EMR Studio directly through a secure URL using your corporate credentials.

You can accomplish the following tasks with Amazon EMR Studio:

  • Log in to Amazon EMR Studio using AWS Single Sign-On (AWS SSO) with your enterprise identity provider.

  • Create Workspaces with one or more notebooks. Set up your Workspace with default options that apply to all notebooks, or customize your Workspace and cluster configurations.

  • Access compute capacity and distributed kernels by attaching a Workspace to an EMR cluster to run Jupyter Notebook jobs.

  • Launch Amazon EMR clusters on demand.

  • Connect notebooks to Amazon EMR on EKS clusters to submit work as job runs.

  • Explore and save example notebooks to a Workspace. For more information about example notebooks, see the EMR Studio Notebook examples GitHub repository.

  • Analyze data using Python, PySpark, Spark Scala, Spark R, or SparkSQL, and install custom kernels and libraries.

  • Collaborate with your team by linking Git repositories to share notebook code.

  • Run parameterized notebooks as part of scheduled workflows using an orchestration tool such as Apache Airflow or Amazon Managed Workflows for Apache Airflow. For more information, see Orchestrating analytics jobs on EMR Notebooks using MWAA in the AWS Big Data Blog.

  • Track and debug jobs using the Spark History Server, Tez UI, or YARN timeline server.

You can create and use an EMR Studio at no cost. Applicable charges for Amazon S3 storage and for Amazon EMR clusters apply when you use EMR Studio. For more information about pricing options and details, see Amazon EMR pricing.

For information about setting up one or more EMR Studios, see Set up and manage an Amazon EMR Studio. To learn more about working in an EMR Studio, see Work in an Amazon EMR Studio.

EMR Studio workshop

The Amazon EMR developer experience workshop helps you build a foundational knowledge of Amazon EMR Studio through a series of lab activities. In the workshop, you set up a Studio, create and attach an Amazon EMR cluster to a Workspace in the Studio, and analyze data using a sample notebook.