unsloth
by
unslothai

Description: Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

View on GitHub ↗

Summary Information

Updated 2 minutes ago

Added to GitGenius on May 7th, 2025

Created on November 29th, 2023

Open Issues & Pull Requests: 1,048 (-4)

Number of forks: 6,121

Total Stargazers: 67,996 (+3)

Total Subscribers: 363 (+0)

Issue Activity (beta)

Open issues: 790

New in 7 days: 12

Closed in 7 days: 12

Avg open age: 227 days

Stale 30+ days: 685

Stale 90+ days: 574

Recent activity

Opened in 7 days: 6

Closed in 7 days: 6

Comments in 7 days: 26

Events in 7 days: 55

Top labels

bug (914)
feature request (520)
fixed - pending confirmation (276)
currently fixing (186)
fixed (136)
inactive (107)
help wanted (98)
unsure bug? (86)

Most active issues this week

#6881 [Bug] FastSentenceTransformer silently degrades Qwen3-Embedding quality (recall@50: 54.3 → 39.7 on our eval) - 18 events / 5 comments
#6854 [Bug] Unsloth Studio Context Length Resets on Model Load/Startup - 5 events / 3 comments
#6380 Api unsloth studio does not work correctly with hermes - 4 events / 1 comments
#6676 [Bug] Studio llama.cpp updater fails for Windows ROCm gfx120X prebuilt - 4 events / 2 comments
#6817 [Bug] Offline Training and Exporting Is broken - 4 events / 2 comments

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 0.1 hours

Mean response time: 13.1 days

90th percentile: 2.8 days

Tracked items: 3,572

Most active contributors

danielhanchen - 3,756 events, 1,698 issues
rolandtannous - 2,267 events, 959 issues
shimmyshimmer - 1,472 events, 835 issues
Datta0 - 921 events, 320 issues
mmathew23 - 913 events, 304 issues

Related by overlapping contributors

Detailed Description

Unsloth is a Python-based platform for training and running open-source language models locally, consisting of two complementary components: Unsloth Studio, a web UI for model training and inference, and Unsloth Core, a code-based library for developers. The project enables users to work with models including Gemma 4, Qwen 3.6, DeepSeek, gpt-oss, Llama, Mistral, and other open models on Windows, Linux, macOS, and WSL environments.

The inference capabilities of Unsloth Studio include searching, downloading, and running models in multiple formats such as GGUF, LoRA adapters, and safetensors. Users can export trained models to various formats, leverage self-healing tool calling with web search integration, and execute code within sandboxed environments to allow language models to test implementations. The platform provides an API inference endpoint for deploying local language models in external tools like Claude Code and Codex, with automatic inference parameter tuning and customizable chat templates. The project maintains direct collaborations with teams behind major model architectures, having contributed bug fixes that improve model accuracy for gpt-oss, Qwen3, Llama 4, Mistral, Gemma, and Phi-4.

Training functionality supports over 500 models with performance improvements of up to 2x faster training speed and up to 70 percent reduction in VRAM usage without accuracy loss. The platform includes custom Triton and mathematical kernels developed through collaborations with PyTorch and Hugging Face. Data recipes allow automatic dataset creation from PDF, CSV, and DOCX files with visual node-based workflow editing. Reinforcement learning capabilities use 80 percent less VRAM for GRPO training and support FP8 quantization. The platform supports full fine-tuning, reinforcement learning, pretraining, and various quantization levels including 4-bit, 16-bit, and FP8 training, with multi-GPU training support and live training observability for monitoring loss, GPU usage, and custom metrics.

According to GitGenius activity tracking, the repository shows strong community engagement with a median issue and pull request response latency of 0.1 hours and a mean latency of 315.1 hours across 3569 tracked items. The most active issue labels are bug reports with 912 items, feature requests with 516 items, and fixed issues pending confirmation with 275 items. Primary contributors include danielhanchen with 3754 tracked events, rolandtannous with 2267 events, and shimmyshimmer with 1472 events. The repository shares overlapping contributors with major projects including Microsoft's VSCode and TypeScript repositories, as well as the Rust language project, indicating cross-pollination with significant open-source ecosystems.

Installation options include Docker containers, platform-specific pip installations, and cloud deployment with Cloudflare tunnel support for remote HTTPS access. The project provides free Google Colab notebooks for training various models including Gemma 4, Qwen 3.5, gpt-oss, and specialized notebooks for reinforcement learning, text-to-speech, embedding models, and vision-multimodal tasks. The platform is classified across multiple domains including AI development, machine learning, MLOps, natural language processing, model training, and conversational agents, reflecting its broad applicability across the machine learning development lifecycle.

unsloth
by
unslothai

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

unsloth
by
unslothaiunslothai/unsloth

Repository Details

unsloth by unslothai

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

unsloth by unslothaiunslothai/unsloth

Repository Details

unsloth
by
unslothai

unsloth
by
unslothaiunslothai/unsloth