doris
by
apache

Description: Apache Doris is an easy-to-use, high performance and unified analytics database.

View apache/doris on GitHub ↗

Summary Information

Updated 2 hours ago
Added to GitGenius on January 4th, 2025
Created on August 10th, 2017
Open Issues/Pull Requests: 742 (+0)
Number of forks: 3,715
Total Stargazers: 15,035 (+0)
Total Subscribers: 283 (+0)
Detailed Description

The Apache Doris repository is an open-source, distributed SQL analytical database designed to support real-time analytics on large datasets. Developed by Apache Software Foundation, it aims to provide high-performance data querying capabilities, making it suitable for online analytical processing (OLAP) workloads and interactive analysis scenarios. The project emphasizes ease of use and integration with various data sources, facilitating seamless access to diverse data types.

Doris is engineered to handle massive volumes of structured and semi-structured data across multiple nodes in a cluster environment. Its architecture supports a variety of query execution strategies, optimizing for both throughput and latency. Key features include columnar storage, vectorized query execution, and advanced indexing mechanisms like bitmap indexes and B+ tree indexes. These optimizations enable Doris to deliver fast query performance while maintaining high scalability.

The system is designed with compatibility in mind, supporting multiple SQL dialects including MySQL, HiveQL, Presto, and SparkSQL. This ensures that users can leverage familiar query languages without extensive retraining or tooling changes. Moreover, Doris provides a robust data ingestion pipeline, allowing for real-time updates and efficient batch processing. Its support for both OLTP and OLAP workloads enables flexible deployment scenarios, catering to dynamic business needs.

Apache Doris also emphasizes security and reliability. It incorporates various authentication and authorization mechanisms, supporting integration with existing identity management systems. Additionally, the platform ensures data integrity through transactional features and ACID compliance for certain operations. High availability is achieved via distributed architecture and automated failover processes, minimizing downtime and ensuring continuous service.

The community around Apache Doris is active and growing, contributing to its development and support ecosystem. Users benefit from comprehensive documentation, tutorials, and an engaged forum where they can seek assistance or share insights. The project encourages contributions, offering a welcoming environment for developers interested in enhancing the platform's capabilities.

Overall, Apache Doris represents a powerful solution for enterprises seeking efficient, real-time data analysis tools. Its blend of performance, scalability, and ease of integration makes it an attractive option for organizations looking to harness their data assets effectively.

doris
by
apacheapache/doris

Repository Details

Fetching additional details & charts...