Apache Doris Reaches Top-Level Status
Written by Kay Ewbank   
Thursday, 30 June 2022

Apache Doris has achieved Top-Level Project status at Apache. The open source realtime analytical database is massively parallel processing (MPP) based, is an MPP-based and provides interactive SQL data warehousing for reporting and analysis.

The project was originally developed as the Palo Project within Baidu's advertising report business and was made open source then donated by Baidu to Apache foundation for incubation in July 2018, at which point it was renamed Doris.

dorislogo

Doris provides high concurrent low latency point query performance, as well as high throughput queries of ad-hoc analysis. It also provides batch data loading and real-time mini-batch data loading. Doris provides high availability, reliability, fault tolerance, and scalability. The main advantages of Doris are the simplicity (of developing, deploying and using) and meeting many data serving requirements in a single system.

The Doris team say it offers excellent performance because it is equipped with an efficient column storage engine, which not only reduces the amount of data scanning, but also implements an ultra-high data compression ratio. Using the partition and bucket pruning function, Doris can support ultra-high concurrency of online service business, and a single node can support up to thousands of QPS.

Doris supports ANSI SQL syntax, including single table aggregation, sorting, filtering and multi table join and sub queries. It also supports complex SQL syntax such as window function and grouping set. Users can also create UDF, UDAF and other user-defined functions, and Doris is also compatible with MySQL protocol.

Dpris supports fast loading of data from localhost, Hadoop, Flink, Spark, Kafka, SeaTunnel and other systems, and can also directly access data in MySQL, PostgreSQL, Oracle, S3, Hive, Iceberg, Elasticsearch and other systems without data replication. At the same time, the data stored in Doris can also be read by Spark and Flink, and can be output to the upstream data application for display and analysis.

 dorislogo

More Information

Doris Website

Related Articles

Apache InLong Becomes Top Level Project

Apache Flink ML 2.0 Released

Apache Ignite Changes SQL Engine  

.NET For Apache Spark Updated 

 

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


GR00T Could Be The Robot You Have Always Wanted
27/03/2024

We may not have flying cars, but we could well soon have robots that match up to predictions for the 21st century. Nvidia has announced GR00T, a cleverly named project to build robots using foundation [ ... ]



Azure AI And Pgvector Run Generative AI Directly On Postgres
26/03/2024

It's a match made in heaven. The Azure AI extension enables the database to call into various Azure AI services like Azure OpenAI. Combined with pgvector you can go far beyond full text search. Let's  [ ... ]


More News

raspberry pi books

 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Thursday, 30 June 2022 )