Apache HAWQ Moves To Top Level
Written by Kay Ewbank   
Tuesday, 11 September 2018

Apache HAWQ has moved to be a top-level project. HAWQ is described as an advanced enterprise SQL-on-Hadoop query engine and analytic database that  combines the key technological advantages of MPP database with the scalability and convenience of Apache Hadoop.

HAWQ reads data from and writes data to HDFS natively, is fast and scalable, and provides users with a complete, standards compliant SQL interface. HAWQ operates natively in Apache Hadoop.

hawq

The developers say that HAWQ's parallel processing means it combines high performance throughput and low latency (potentially near real time) query responses that can scale to petabyte-sized datasets. The developers used the expertise in massively parallel processing (MPP) gained through the creation of the Pivotal Greenplum enterprise database and open source PostgreSQL.

HAWQ started life as Pivotal HAWQ before being transferred to an Apache project. Pivotal HAWQ itself owed much to technology that Pivotal acquired when it bought Greenplum and its MPP Shared-Nothing data warehouse product.  Pivotal developed HAWQ as a SQL query database that combined the merits of Pivotal Greenplum with Hadoop distributed storage. Pivotal contributed the Pivotal HAWQ core to the Apache in September 2015. The original team of developers continued work on the Apache HAWQ product to make the product better at running on public cloud, private cloud, or shared physical cluster environments.

The SQL support means HAWQ works with SQL-based applications and BI/data visualization tools. You can use it to execute complex queries and joins, including roll-ups and nested queries. HAWQ now works with the rest of the Apache Hadoop ecosystem products, so you can integrate and manage with Apache YARN. provision with Apache Ambari. and interface with Apache HCatalog. It also supports Apache Parquet, Apache HBase, and works with the Apache MADlib machine learning libraries for AI-based analytics.

hawq

More Information

HAWQ website

Related Articles

Apache MADlib Adds HITS Implementation

Apache Phoenix Now HBase 2.0 Compatible

Apache Phoenix Improves HBase Support

HBase 1.4 With New Shaded Client

HBase Adds MultiWAL Support 

Hadoop 3 Adds HDFS Erasure Coding

Hadoop 2.9 Adds Resource Estimator

Hadoop Adds In-Memory Caching

Hadoop SQL Query Engine Launched

 

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


Call For Code 2024 Focuses On Generative AI
01/03/2024

This year's Call for Code challenge has been launched with a theme of the use of generative AI technology for solutions that aim to improve equitable access to resources and opportunities for historic [ ... ]



Five Tips for Managing Hybrid Development Teams
08/03/2024

Managing hybrid development teams can be challenging, but  can also be a rewarding endeavor. Here are some tips to follow to ensure success. 


More News

raspberry pi books

 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Tuesday, 11 September 2018 )