First Hybrid Open-Source RDBMS Powered By Hadoop and Spark

Written by Nikos Vaggalis

Friday, 22 July 2016

Splice Machine is a novel attempt to merge the best parts of the traditional relational database management systems and their NoSQL counterparts with distributed and in-memory computing based on Hadoop and Spark.

Traditional RDBMS find it tough when faced with massive amounts of data, which they typically handle by scaling up, albeit expensively. Another side effect of the sheer volume of data accumulating from the likes of social media and mobile devices, is that OLTP and OLAP queries carry high performance hits that subsequently have detrimental effects on real time analysis and instant decision making.

NoSQL systems might have the answer to the cheap and effective handling of that kind of volume by scaling horizontally rather than up, on commodity hardware. But in comparison to the the traditional RDBMS solution, NoSQL falls short in offering support for ANSI SQL and transactional processing.

Then in-memory databases, despite the fact that they offer the best value for money, also fall short when memory fills up under the load of terrabytes of data, at which point they typically drop the queries they cannot handle, providing for neither fail safety nor consistency.

The answer to these problems is in integrating a number of state of the art technologies in a single package with a "hyper"-hybrid approach.

That attempt is Splice Machine and comprises:

HBase and Hadoop

They solve the scaling problem of working under the load of petabytes, at the same time providing failover guarantees.

Apache Spark
An analytics in-memory engine that unlike the rest, employs spill-to-disk technology that, as soon as memory fills up, offloads the surplus queries to disk so that they do not fail.

Apache Derby
It contributes the RDBMS component, lightweight and ANSI SQL compatible with the likes of joins, constraints, indexes, sub-queries, stored procedures and the rest.

Their interoperation, based on sophisticated procedures, enables a true RDBMS with MVCC and ACID on Hadoop, capable of doing realtime on the fly analysis and updates on massive and distributed data volumes, creating a new breed of database technologies.

Don't forget the open source advantage. Very recently Splice Machine went open source, following a trend already set by other DBMSs like Actian's Ingres, in an attempt to expand its adoption and client base, form a community around it and make the process of getting hold of it and testing it much easier.

Splice Machine is available in both the Community and Enterprise editions, with the Community one being free and with unrestricted SQL, while the Enterprise one includes devops features, such as backup, encryption, and security, plus future features like Oracle PL/SQL wire-compatibility (Fall 2016) and active-active replication (in 2017). Official support comes with the Enterprise edition only.

Developers will be able to test and deploy the Community Edition at scale, thanks to a Amazon Web Services cloud-based sandbox launched earlier this week. This gives the ability to put the new open-source 2.0 Community Edition through tests ranging from small to enterprise scale, by initiating clusters in minutes.

If this is not already enough to persuade you in trying out Splice Machine, then the following benefits when compared to
traditional RDBMSs such as Oracle, IBM DB2, or MySQL, might do so:

• 10-20x faster – leverages HBase, the distributed NoSQL DB, as well as in-memory cluster computing from Spark

• A quarter the cost – scales out on commodity hardware using Hadoop

• ANSI SQL - leverages existing SQL-based analysts, reports, and applications without rewrites

• Distributed Transactions – ensures reliable updates across multiple rows and tables, based on advanced research by Google

• Flexible – provides excellent performance for simultaneous OLTP and OLAP workloads

• Elastic – increases or decreases scale in just a few minutes

More Information

Splice Machine

Actian Ingres 11 Technical Preview

To be informed about new articles on I Programmer, sign up for our weekly newsletter,subscribe to the RSS feed and follow us on, Twitter, Facebook, Google+ or Linkedin.

Linux Demonstrates That Bugs Can Hide For 20 years!
14/01/2026

A very nice analysis of Linux commits reveals some interesting things about bugs - and how long they take to fix isn't the most interesting.

+ Full Story

Xmas Tree? Sorted!
25/12/2025

Yes I know we have all seen enough visualizations of sorting algorithms to last a lifetime - which is how long a bubble sort generally takes. But it's the holiday season and sorting is one programmer [ ... ]

+ Full Story

More News

Comments

or email your comment to: comments@i-programmer.info

Last Updated ( Friday, 22 July 2016 )

More Information

Related Articles

Comments