Apache Doris Reaches Top-Level Status
Written by Kay Ewbank   
Thursday, 30 June 2022

Apache Doris has achieved Top-Level Project status at Apache. The open source realtime analytical database is massively parallel processing (MPP) based, is an MPP-based and provides interactive SQL data warehousing for reporting and analysis.

The project was originally developed as the Palo Project within Baidu's advertising report business and was made open source then donated by Baidu to Apache foundation for incubation in July 2018, at which point it was renamed Doris.


Doris provides high concurrent low latency point query performance, as well as high throughput queries of ad-hoc analysis. It also provides batch data loading and real-time mini-batch data loading. Doris provides high availability, reliability, fault tolerance, and scalability. The main advantages of Doris are the simplicity (of developing, deploying and using) and meeting many data serving requirements in a single system.

The Doris team say it offers excellent performance because it is equipped with an efficient column storage engine, which not only reduces the amount of data scanning, but also implements an ultra-high data compression ratio. Using the partition and bucket pruning function, Doris can support ultra-high concurrency of online service business, and a single node can support up to thousands of QPS.

Doris supports ANSI SQL syntax, including single table aggregation, sorting, filtering and multi table join and sub queries. It also supports complex SQL syntax such as window function and grouping set. Users can also create UDF, UDAF and other user-defined functions, and Doris is also compatible with MySQL protocol.

Dpris supports fast loading of data from localhost, Hadoop, Flink, Spark, Kafka, SeaTunnel and other systems, and can also directly access data in MySQL, PostgreSQL, Oracle, S3, Hive, Iceberg, Elasticsearch and other systems without data replication. At the same time, the data stored in Doris can also be read by Spark and Flink, and can be output to the upstream data application for display and analysis.


More Information

Doris Website

Related Articles

Apache InLong Becomes Top Level Project

Apache Flink ML 2.0 Released

Apache Ignite Changes SQL Engine  

.NET For Apache Spark Updated 


To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.


Apache Lucene Adds Similarity Vector Searches

Apache Lucene 9.10 has been released with support for similarity-based vector searches. Other improvements include block join compatible index sorting, and several improvements to ensure the software  [ ... ]

pgxman - PostgreSQL Extension Manager

pgxman is a package manager like npm, but instead of Javascript packages, it is for PostgreSQL extensions. It detects and streamlines extension operations and looks after dependency manageme [ ... ]

More News

raspberry pi books



or email your comment to: comments@i-programmer.info

Last Updated ( Thursday, 30 June 2022 )