katib
by
kubeflow

Description: Automated Machine Learning on Kubernetes

View on GitHub ↗

Summary Information

Updated 30 minutes ago

Added to GitGenius on June 18th, 2024

Created on April 3rd, 2018

Open Issues & Pull Requests: 111 (+0)

Number of forks: 532

Total Stargazers: 1,689 (+0)

Total Subscribers: 52 (+0)

Issue Activity (beta)

Open issues: 77

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 1,719 days

Stale 30+ days: 72

Stale 90+ days: 70

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

kind/feature (334)
kind/bug (318)
lifecycle/stale (162)
lifecycle/frozen (111)
area/katib (105)
priority/p1 (86)
help wanted (75)
area/front-end (60)

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 0.0 hours

Mean response time: 186.3 days

90th percentile: 885.8 days

Tracked items: 124

Most active contributors

andreyvelich - 224 events, 77 issues
Electronic-Waste - 137 events, 41 issues
tenzen-y - 67 events, 26 issues
mahdikhashan - 43 events, 12 issues
helenxie-bit - 27 events, 9 issues

Related by overlapping contributors

Detailed Description

Kubeflow Katib is a Kubernetes-native automated machine learning platform that enables hyperparameter tuning, early stopping, and neural architecture search for machine learning workloads. Written primarily in Python, the project provides a framework-agnostic approach to AutoML that can optimize applications written in any programming language while natively supporting major ML frameworks including TensorFlow, PyTorch, XGBoost, JAX, and scikit-learn. The platform integrates seamlessly with Kubernetes Custom Resources and provides out-of-the-box support for Kubeflow Training Operator, Argo Workflows, and Tekton Pipelines, making it suitable for distributed training environments.

The project implements a comprehensive suite of search algorithms for different optimization tasks. For hyperparameter tuning, Katib supports Random Search, Grid Search, Bayesian Optimization, Tree of Parzen Estimators (TPE), Multivariate TPE, CMA-ES, Sobol's Quasirandom Sequence, HyperBand, and Population Based Training. For neural architecture search, it provides ENAS and DARTS algorithms. Early stopping capabilities include the Median Stopping Rule. These algorithms are powered by established optimization frameworks including Goptuna, Hyperopt, Optuna, and Scikit Optimize, allowing users to leverage well-tested optimization methodologies.

Katib is designed with accessibility in mind, offering both a control plane for Kubernetes deployments and a Python SDK available through PyPI as kubeflow-katib. This dual approach enables both infrastructure administrators to deploy Katib clusters and data scientists to programmatically create hyperparameter tuning experiments without deep Kubernetes expertise. The project includes comprehensive examples and getting started guides to facilitate adoption.

The repository shows active development and community engagement. GitGenius tracking data reveals a median issue and pull request response latency of 0.0 hours across 124 tracked items, indicating responsive maintainers. The most active contributor, andreyvelich, has logged 224 events, followed by Electronic-Waste with 137 events and tenzen-y with 67 events. Issue activity is dominated by feature requests (56 tracked items), with stale lifecycle issues (52 items) and bug reports (47 items) also receiving attention. The project maintains connections with related repositories through overlapping contributors, particularly with kubeflow/kubeflow, argoproj/argo-workflows, and microsoft/vscode.

The codebase is classified across multiple optimization and machine learning domains, reflecting its role as a comprehensive experimentation framework for AI model optimization and machine learning workflows. Katib supports reproducible research through its experiment framework capabilities and enables benchmarking of different optimization strategies. The project's integration with the broader Kubeflow ecosystem positions it as a central component for MLOps workflows on Kubernetes, addressing the need for scalable, cloud-native hyperparameter optimization in production environments. The project maintains active community engagement through bi-weekly AutoML and Training Working Group meetings and a dedicated Slack channel, with documented adopters and presentations demonstrating real-world usage.

katib
by
kubeflow

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

katib
by
kubeflowkubeflow/katib

Repository Details

katib by kubeflow

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

katib by kubeflowkubeflow/katib

Repository Details

katib
by
kubeflow

katib
by
kubeflowkubeflow/katib