xgboost
by
dmlc

Description: Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop,...

View on GitHub ↗

Summary Information

Updated 18 minutes ago

Added to GitGenius on June 11th, 2024

Created on February 6th, 2014

Open Issues & Pull Requests: 474 (+1)

Number of forks: 8,881

Total Stargazers: 28,547 (+0)

Total Subscribers: 883 (+0)

Issue Activity (beta)

Open issues: 381

New in 7 days: 2

Closed in 7 days: 1

Avg open age: 1,203 days

Stale 30+ days: 366

Stale 90+ days: 356

Recent activity

Opened in 7 days: 2

Closed in 7 days: 1

Comments in 7 days: 4

Events in 7 days: 5

Top labels

feature-request (320)
status: need update (248)
type: bug (196)
type: roadmap (59)
doc (44)
CI (42)
Blocking (38)
type: question (33)

Most active issues this week

#12282 I'm forced to use CPU tho, Any fix please ? - 3 events / 3 comments
#12284 `XGBoosterPredictFromDense()` leaks JNI arrays (and a proxy `DMatrix`) on base-margin setup errors - 2 events / 1 comments

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 8.8 hours

Mean response time: 231.8 days

90th percentile: 1045.4 days

Tracked items: 740

Most active contributors

trivialfis - 1,950 events, 611 issues
hcho3 - 351 events, 150 issues
david-cortes - 99 events, 37 issues
wbo4958 - 96 events, 40 issues
RAMitchell - 78 events, 35 issues

Related by overlapping contributors

Detailed Description

XGBoost is an optimized distributed gradient boosting library written primarily in C++ that implements machine learning algorithms under the Gradient Boosting framework. The library is designed to be highly efficient, flexible, and portable, providing parallel tree boosting capabilities also known as GBDT, GBM, or GBRT. It solves data science problems in a fast and accurate manner and can handle datasets with billions of examples. The same codebase runs across major distributed environments including Kubernetes, Hadoop, SGE, Dask, Spark, PySpark, and DataFlow, making it suitable for both single-machine and large-scale distributed computing scenarios.

The repository originated from a research project at the University of Washington and was published as a peer-reviewed paper by Tianqi Chen and Carlos Guestrin at the 22nd SIGKDD Conference on Knowledge Discovery and Data Mining in 2016. The library provides language bindings for Python, R, Java, Scala, C++, and additional languages, enabling broad accessibility across different development ecosystems. XGBoost is licensed under Apache 2.0, allowing free use and modification by the community.

From an activity perspective, the repository demonstrates sustained engagement with a median issue and pull request response latency of 8.8 hours across 738 tracked items. The most frequently requested features are tracked through feature-request labels with 105 items, while status updates and bug reports represent significant portions of ongoing work with 99 status-need-update items and 30 bug reports. The project is maintained by active contributors, with trivialfis leading the effort at 1943 recorded events, followed by hcho3 with 351 events and david-cortes with 99 events. The repository shares overlapping contributors with other major projects including microsoft/vscode, lightgbm-org/lightgbm, and rust-lang/rust, indicating cross-pollination of expertise and practices across the machine learning and systems software communities.

The project is supported by prominent sponsors including NVIDIA, Intel, Comet, and Databento, with funding directed toward continuous integration and testing infrastructure hosted at xgboost-ci.net. This sponsorship model reflects the library's importance to the broader data science and machine learning ecosystem. The repository emphasizes community contribution and maintains active documentation, community pages, and contributor guidelines to facilitate ongoing development and user support.

xgboost
by
dmlc

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

xgboost
by
dmlcdmlc/xgboost

Repository Details

xgboost by dmlc

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

xgboost by dmlcdmlc/xgboost

Repository Details

xgboost
by
dmlc

xgboost
by
dmlcdmlc/xgboost