Apache Rya Becomes Top Level Project
Written by Kay Ewbank   
Tuesday, 08 October 2019

Apache Rya has become an Apache top level project. Rya is a  cloud-based big data triple store (subject-predicate-object) database used to process queries in milliseconds.

Apache Rya was originally developed at the Laboratory for Telecommunication Sciences and became an Apache project in 2015. The name stands for RDF y(and) Accumulo.



RDF (Resource Description Framework) was designed with the  goal of developing metadata for the Internet. RDF is a family ofW3C specifications used as a way to describe and model information. It is based on the idea of making statements about resources in the form <subject,predicate, object>expressions called triples. To specify the title of the main US Naval Academy web page, you could write the triple <USNA Home, :titleOf , http://www.usna.edu/homepage.php>. RDF triple data format is useful for datasets that are diverse, such as connections between users on social media, financial data and transactions, or medical data.

Rya provides a scalable solution to store and query such data. It can store billions of linked information sets and return answers to most computer-based questions in under a second. Rya is built on top of Apache Accumulo to support SPARQL queries for RDF data. A MongoDB back-end is also implemented. Rya uses novel storage methods, indexing schemes, and query processing techniques that scale to billions of triples across multiple nodes. 

Accumulo is an open-source, distributed, column-oriented store modeled after Google’s Bigtable. It provides random, realtime, read/write access to large datasets on clusters of commodity hardware. Accumulo uses Apache Hadoop Distributed File System, and provides automatic load balancing and partitioning, data compression, and fine grained security labels.

Announcing the move to a top level project, the Apache Foundation said Rya is recognized as one of the most advanced database projects in the United States Department of the Navy, powering a new generation of drones, advanced tactical communications through manned-unmanned teaming, and supporting autonomous swarms of smaller robots, among numerous other applications. In addition, Apache Rya is being used for artificial intelligence projects involving semi-autonomous content production operations.

More Information

Rya Website

Related Articles

Apache Fluo Improves Spark Integration

Amazon Neptune Graph DB Reaches GA

Ebay Open Sources Beam

SPARQL Moves Closer



To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on, Twitter, Facebook or Linkedin.


$30,000 For Rule 30

Stephen Wolfram has just announced a prize for three proofs concerning Rule 30. What is Rule 30 and why is it so interesting?

Udacity Scholarships In Data, AI and Cloud Computing

Applications are now open for the initial phase of the Bertelsmann Technology Scholarship program. This will result in 15,000 scholarship recipients participating in Challenge Courses focused on  [ ... ]

More News





or email your comment to: comments@i-programmer.info

Last Updated ( Tuesday, 08 October 2019 )