Apache Doris Reaches Top-Level Status
Written by Kay Ewbank   
Thursday, 30 June 2022

Apache Doris has achieved Top-Level Project status at Apache. The open source realtime analytical database is massively parallel processing (MPP) based, is an MPP-based and provides interactive SQL data warehousing for reporting and analysis.

The project was originally developed as the Palo Project within Baidu's advertising report business and was made open source then donated by Baidu to Apache foundation for incubation in July 2018, at which point it was renamed Doris.

dorislogo

Doris provides high concurrent low latency point query performance, as well as high throughput queries of ad-hoc analysis. It also provides batch data loading and real-time mini-batch data loading. Doris provides high availability, reliability, fault tolerance, and scalability. The main advantages of Doris are the simplicity (of developing, deploying and using) and meeting many data serving requirements in a single system.

The Doris team say it offers excellent performance because it is equipped with an efficient column storage engine, which not only reduces the amount of data scanning, but also implements an ultra-high data compression ratio. Using the partition and bucket pruning function, Doris can support ultra-high concurrency of online service business, and a single node can support up to thousands of QPS.

Doris supports ANSI SQL syntax, including single table aggregation, sorting, filtering and multi table join and sub queries. It also supports complex SQL syntax such as window function and grouping set. Users can also create UDF, UDAF and other user-defined functions, and Doris is also compatible with MySQL protocol.

Dpris supports fast loading of data from localhost, Hadoop, Flink, Spark, Kafka, SeaTunnel and other systems, and can also directly access data in MySQL, PostgreSQL, Oracle, S3, Hive, Iceberg, Elasticsearch and other systems without data replication. At the same time, the data stored in Doris can also be read by Spark and Flink, and can be output to the upstream data application for display and analysis.

 dorislogo

More Information

Doris Website

Related Articles

Apache InLong Becomes Top Level Project

Apache Flink ML 2.0 Released

Apache Ignite Changes SQL Engine  

.NET For Apache Spark Updated 

 

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


Data On Data Scientists
02/11/2022

How do data scientists spend their time? How do the companies they work for consume and contribute to open source software? How concerned are they about the potential impact of a shortage of suitably  [ ... ]



Python.NET 3.0.0 Released
15/11/2022

You could be forgiven for assuming that Python.NET was a dead project, or equally for never having heard of it. In September it emerged from hibernation with a major update - Release 3.0.0 - which sup [ ... ]


More News

picobook

 



 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Thursday, 30 June 2022 )