Amazon Releases Data IDE, Meet EMR Studio
Written by Kay Ewbank   
Friday, 30 April 2021

Amazon AWS has released an IDE for data scientists and analysts who want to develop apps in R, Python, Scala and PySpark. EMR Studio was previewed at AWS re:Invent 2020 and is now generally available.

EMR Studio provides fully managed Jupyter notebooks, and tools like Spark UI and YARN Timeline Service to simplify debugging. EMR Studio uses AWS Single Sign-On and allows you to log in directly with your corporate credentials without signing in to the AWS Management Console.


EMR Studio provides fully managed Jupyter notebooks, along with tools including Spark UI and YARN Timeline Service. It is designed to make it easy to move from prototyping to production. Developers can trigger pipelines from code repositories, and run Notebooks as pipelines using orchestration tools like Apache Airflow or Amazon Managed Workflows for Apache Airflow, or attach notebooks to a bigger cluster using a single click.


For debugging, developers can work within EMR Studio, and can use native application interfaces such as Spark UI and YARN timeline service directly from EMR Studio.

EMR Studio lets developers run notebook code on Amazon EMR running on Amazon Elastic Compute Cloud (Amazon EC2) or Amazon EMR on Amazon Elastic Kubernetes Service (Amazon EKS). EMR Studio kernels and applications run on EMR clusters, so support distributed data processing with the performance-optimized Apache Spark runtime in Amazon EMR. It can also be used to run parameterized notebooks as part of scheduled workflows using orchestration services like Apache Airflow or Amazon Managed Workflows for Apache Airflow (Amazon MWAA).

The updated version adds  the ability to use the Amazon EMR console and AWS CloudFormation to create and configure a new EMR Studio for teams, support for Microsoft Active Directory (AD) as an identity provider, a new quick start notebook experience, the ability to launch the live Apache Spark UI directly from an EMR Studio notebook, and support for private Git repositories.


More Information

EMR Studio Webpage

Related Articles

Amazon AWS Invests In Rust 

AWS Greengrass Now Supports C Executables

AWS Amplify Opens Cloud Services For JavaScript

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.


Couchbase Adds AI To Capella

Couchbase has announced that its Capella managed service NoSQL database will from now on include "generative AI capabilities" 

Deno 1.36 Improves Security

Deno 1.36 has been released with improvements to security and to testing and benchmarking APIs. The team says there are also more robust Node.js and npm package support, and language server improvemen [ ... ]

More News

Summer SALE Kindle 9.99 Paperback $10 off!!




or email your comment to:

Last Updated ( Friday, 30 April 2021 )