presto
by
prestodb

Description: The official home of the Presto distributed SQL query engine for big data

View on GitHub ↗

Summary Information

Updated 1 hour ago

Added to GitGenius on January 4th, 2025

Created on August 9th, 2012

Open Issues & Pull Requests: 2,914 (+0)

Number of forks: 5,540

Total Stargazers: 16,714 (+0)

Total Subscribers: 819 (+0)

Issue Activity (beta)

Open issues: 1,566

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 1,131 days

Stale 30+ days: 1,544

Stale 90+ days: 1,511

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 2

Events in 7 days: 4

Top labels

bug (1,200)
stale (959)
feature request (498)
beginner-task (347)
prestissimo (291)
enhancement (272)
flaky-test (147)
function (143)

Most active issues this week

#28099 TestPrestoQueries.testAddDistinctForSemiJoinBuild is flaky - 3 events / 1 comments
#28019 docs: add missing documentation for configuration and session properties - 2 events / 2 comments
#28077 Refactor `PushDownWidenCast` to source-side coercion annotation, extensible to other operators - 2 events / 0 comments

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 0.0 hours

Mean response time: 162.0 days

90th percentile: 427.5 days

Tracked items: 1,300

Most active contributors

tdcmeehan - 568 events, 271 issues
hantangwangd - 315 events, 126 issues
steveburnett - 260 events, 99 issues
kaikalur - 217 events, 89 issues
aditi-pandit - 194 events, 104 issues

Related by overlapping contributors

Detailed Description

Presto is a distributed SQL query engine designed for querying big data across multiple sources. Written in Java, it enables interactive SQL analysis on data stored in Hadoop, Hive, and other data lakes without requiring data movement or transformation into a separate analytics system. The engine supports querying data in various formats including Parquet and integrates with the Apache ecosystem, making it a core component for organizations managing large-scale data infrastructure.

The project maintains active development with significant community engagement. According to activity tracking, the repository has processed over 1,300 issues and pull requests with a median response latency of 0.0 hours, though the mean response time of 3,887.8 hours reflects the variable nature of issue resolution timelines. Bug reports represent the most common issue type with 532 tracked instances, followed by 410 feature requests and 89 flaky test reports. The core maintenance team includes contributors like tdcmeehan with 568 tracked events, hantangwangd with 315 events, and steveburnett with 259 events, indicating sustained organizational investment in the project's evolution.

Building Presto requires Java 17 with either Oracle JDK or OpenJDK, Maven 3.6.3 or later, and Python 2.4 or higher for the launcher script. The project uses a standard Maven build process that downloads dependencies on the first build and supports configuration through a local Maven cache to avoid conflicts when building multiple Presto projects simultaneously. The comprehensive unit test suite can be skipped during development builds to accelerate iteration. The project integrates with IntelliJ IDEA as the recommended development environment and includes sample configuration files that work out-of-the-box for local development.

Presto's architecture includes a native C++ implementation called Presto native, which rewrites the worker component using Velox, a C++ database library providing high-performance data processing components. This dual-implementation approach allows organizations to choose between the Java-based standard implementation and the performance-optimized native version depending on their workload requirements.

The user interface is built with React components written in JSX and ES6, compiled into browser-compatible JavaScript during the Maven build process. The UI can be excluded from builds using the skipUI flag for faster compilation when frontend changes are not needed. The project provides development container support through a separate presto-dev repository for streamlined local development setup.

The repository is classified across multiple domains including query engines, data warehousing, performance optimization, real-time analysis, and multi-source querying. It demonstrates strong overlap with other major projects through shared contributors, linking it to microsoft/vscode, trinodb/trino, and microsoft/typescript ecosystems. The project operates under the Apache License Version 2.0, with contributions governed by formal contribution guidelines and community coordination through a dedicated Slack channel for developer discussions.

presto
by
prestodb

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

presto
by
prestodbprestodb/presto

Repository Details

presto by prestodb

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

presto by prestodbprestodb/presto

Repository Details

presto
by
prestodb

presto
by
prestodbprestodb/presto