Amazon Releases Data IDE, Meet EMR Studio
Written by Kay Ewbank   
Friday, 30 April 2021

Amazon AWS has released an IDE for data scientists and analysts who want to develop apps in R, Python, Scala and PySpark. EMR Studio was previewed at AWS re:Invent 2020 and is now generally available.

EMR Studio provides fully managed Jupyter notebooks, and tools like Spark UI and YARN Timeline Service to simplify debugging. EMR Studio uses AWS Single Sign-On and allows you to log in directly with your corporate credentials without signing in to the AWS Management Console.

aws

EMR Studio provides fully managed Jupyter notebooks, along with tools including Spark UI and YARN Timeline Service. It is designed to make it easy to move from prototyping to production. Developers can trigger pipelines from code repositories, and run Notebooks as pipelines using orchestration tools like Apache Airflow or Amazon Managed Workflows for Apache Airflow, or attach notebooks to a bigger cluster using a single click.

emrstudio

For debugging, developers can work within EMR Studio, and can use native application interfaces such as Spark UI and YARN timeline service directly from EMR Studio.

EMR Studio lets developers run notebook code on Amazon EMR running on Amazon Elastic Compute Cloud (Amazon EC2) or Amazon EMR on Amazon Elastic Kubernetes Service (Amazon EKS). EMR Studio kernels and applications run on EMR clusters, so support distributed data processing with the performance-optimized Apache Spark runtime in Amazon EMR. It can also be used to run parameterized notebooks as part of scheduled workflows using orchestration services like Apache Airflow or Amazon Managed Workflows for Apache Airflow (Amazon MWAA).

The updated version adds  the ability to use the Amazon EMR console and AWS CloudFormation to create and configure a new EMR Studio for teams, support for Microsoft Active Directory (AD) as an identity provider, a new quick start notebook experience, the ability to launch the live Apache Spark UI directly from an EMR Studio notebook, and support for private Git repositories.

aws

More Information

EMR Studio Webpage

Related Articles

Amazon AWS Invests In Rust 

AWS Greengrass Now Supports C Executables

AWS Amplify Opens Cloud Services For JavaScript

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


GitHub Introduces New Features At Universe 2022
10/11/2022

GitHub has announced a number of improvements for developers, including Copilot for Business, voice based interaction, and the availability of Codespaces for all users. 



Data, Data, Everywhere ...
08/11/2022

... and not enough people with the skills to cope with it all. The whole raft of Udacity's data-related Nanodegrees and courses are restarting November 9th and Udacity is currently offering a 75% disc [ ... ]


More News

picobook

 



 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Friday, 30 April 2021 )