Hadoop gets to 1.0
Written by Kay Ewbank   
Thursday, 05 January 2012

The Apache Software Foundation has officially announced the release of Hadoop 1.0. The release adds some features in the areas of security and support for the Hadoop HBase database, but the most important aspect of the release is that  Hadoop is now mature enough to warrant the 1.0 marker.

 

Hadoop may have been making the headlines for months, if not years, in the field of big data, but it has only now reached the status of a 1.0 release.

The Apache Software Foundation has now officially announced the release of Hadoop 1.0. The release adds some features in the areas of security and support for the Hadoop HBase database, but the most important aspect of the release is that Apache Software Foundation thinks Hadoop is now mature enough to warrant the 1.0 marker.

Despite its earlier 0.x status, Hadoop, which started life at Yahoo! and has been under development for the last six years, is already in use at high profile sites including Yahoo, Facebook and LinkedIn.

 

hadoop

 

The security improvement comes in the form of support for Kerberos strong authentication so you can encrypt and protect your data. Another significant feature in the 1.0 release is Webhdfs, an HTTP web interface to the Hadoop Distributed File System (HDFS). This will let you use HTTP rather than having to go via a Java or C client when you want to interact with HDFS.

The inclusion of support for HBase is interesting because it shows how Hadoop is moving more towards the type of apps that require real time web apps. Hadoop was designed to replicate Google MapReduce, the software that was used to build Google’s web index, and because of this is great for data analysis where the end results of the analysis are used by other apps - to create a web index that can then be used by a search engine, say. It wasn’t designed to provide instant responses to queries. HBase, by contrast, is a distributed database that works with HDFS and is more suited to real-time applications.

Apache Software Foundation suggests you use HBase when you need random, realtime read/write access to your Big Data, saying that the goal of HBase is to host very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. The column-oriented store is modeled after Google's Bigtable.

Despite the increased support for HBase in Hadoop 1.0, the companies behind Hadoop are hedging their bets as to whether it will be a winner or not. For example, while Yahoo! uses HBase for some of its services, it is also working on other alternatives including MapReduce Online and S4, both of which provide an online window onto data sets that have been first mapped then reduced so the data set is small enough to be workable.

hadoop

More Information

Hadoop 1.0 release notes

Related News:

Hadoop for Windows

 

raspberry pi books

 

Comments




or email your comment to: comments@i-programmer.info

 

To be informed about new articles on I Programmer, subscribe to the RSS feed, follow us on Google+, Twitter, Linkedin or Facebook or sign up for our weekly newsletter.

Banner


Microsoft Introduces .NET Smart Components
01/04/2024

Microsoft has provided a set of .NET Smart Components, described as a set of genuinely useful AI-powered UI components that you can quickly and easily add to .NET apps. The components are prebuilt end [ ... ]



Redis Changes License, Rival Fork Launched
03/04/2024

The developers of Redis have announced that they are changing the licensing model for the database. From now on, all future versions of Redis will be released with source-available licenses rather tha [ ... ]


More News

Last Updated ( Thursday, 05 January 2012 )