Apache Doris Reaches Top-Level Status
Written by Kay Ewbank   
Thursday, 30 June 2022

Apache Doris has achieved Top-Level Project status at Apache. The open source realtime analytical database is massively parallel processing (MPP) based, is an MPP-based and provides interactive SQL data warehousing for reporting and analysis.

The project was originally developed as the Palo Project within Baidu's advertising report business and was made open source then donated by Baidu to Apache foundation for incubation in July 2018, at which point it was renamed Doris.

dorislogo

Doris provides high concurrent low latency point query performance, as well as high throughput queries of ad-hoc analysis. It also provides batch data loading and real-time mini-batch data loading. Doris provides high availability, reliability, fault tolerance, and scalability. The main advantages of Doris are the simplicity (of developing, deploying and using) and meeting many data serving requirements in a single system.

The Doris team say it offers excellent performance because it is equipped with an efficient column storage engine, which not only reduces the amount of data scanning, but also implements an ultra-high data compression ratio. Using the partition and bucket pruning function, Doris can support ultra-high concurrency of online service business, and a single node can support up to thousands of QPS.

Doris supports ANSI SQL syntax, including single table aggregation, sorting, filtering and multi table join and sub queries. It also supports complex SQL syntax such as window function and grouping set. Users can also create UDF, UDAF and other user-defined functions, and Doris is also compatible with MySQL protocol.

Dpris supports fast loading of data from localhost, Hadoop, Flink, Spark, Kafka, SeaTunnel and other systems, and can also directly access data in MySQL, PostgreSQL, Oracle, S3, Hive, Iceberg, Elasticsearch and other systems without data replication. At the same time, the data stored in Doris can also be read by Spark and Flink, and can be output to the upstream data application for display and analysis.

 dorislogo

More Information

Doris Website

Related Articles

Apache InLong Becomes Top Level Project

Apache Flink ML 2.0 Released

Apache Ignite Changes SQL Engine  

.NET For Apache Spark Updated 

 

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


Nvidia Releases Updated AI Framework
02/08/2022

NVIDIA has announced the general availability of NVIDIA AI Enterprise 2.1. The improvements to the AI and data analytics software include new support for containers, and for public clouds.



JetBrains Launches Containerized Development Environment
25/07/2022

JetBrains has launched an on-premises beta version of Space, its integrated team environment and collaboration software.


More News

pythondata

 



 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Thursday, 30 June 2022 )