Apache HOP 2.0 Released
Written by Kay Ewbank   
Monday, 13 June 2022

Apache Hop 2.0 has been released. The Hop orchestration platform is an open source data integration platform in which everything is treated as metadata, meaning it can work with most data platforms. HOP 2.0 has upgraded to Java 11 and added a number of transformation plug-ins.

Hop's aim is to let developers focus on the problem they're trying to solve without technology getting in the way.  The team says simple tasks should be easy, complex tasks need to be possible.


Hop lets users work visually, using metadata to describe how data should be processed and how workflows and pipelines should be constructed.

Hop began life as a commercial program back in the early 2000s as Kettle, and Extract-Transform-Load (ETL) product that became Pentaho Data Integration (PDI). HOP is a fork of PDI that entered the Apache incubator in 2020.

Hop is made up of a small but powerful engine, with all functionality added through plugins: the default Hop installation comes with about 400 plugins and you can remove or add third-party plugins. Hop developers create workflows and pipelines in a visual development environment called Hop Gui. The workflows and pipelines can be executed on a variety of engines: workflows and pipelines can run on the native Hop engine, both locally and remotely. Pipelines can also run on Apache Spark, Apache Flink and Google Dataflow through the Apache Beam runtime configurations.

Hop runs in a Java environment, and can be used on premises, in the cloud, or in containers  under Windows, Linux, and OSX.

Much of the work in the new release has gone on the upgrade from Java 8 to Java 11. In terms of what's new for users, the main improvement is the addition of several new transform plugins, to support Apache Avro File Output, Apache Doris Bulk Loader, a Drools Rules Accumulator and a Drools Rules Executor. There's also a new Formula plugin that allows you to apply Excel-like formulas and functions on fields in a pipeline.


More Information

Apache Hop Website

Related Articles

Apache Flink ML 2.0 Released

Apache Ignite Changes SQL Engine  

.NET For Apache Spark Updated

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.


MySQL Heatwave Available On AWS

Oracle has announced that MySQL Heatwave is now available on Amazon AWS. MySQL Heatwave is a massively parallel, high performance, in-memory query accelerator for Oracle MySQL Database Service.

Constellation - The First Confidential Kubernetes

Edgeless Systems secure the cloud's workloads by releasing the first runtime encrypted Kubernetes, able to run on a multitude of cloud providers.

More News





or email your comment to: comments@i-programmer.info