deepsparse
by
neuralmagic

Description: Sparsity-aware deep learning inference runtime for CPUs

View on GitHub ↗

Summary Information

Updated 46 minutes ago

Added to GitGenius on April 30th, 2024

Created on December 14th, 2020

Open Issues & Pull Requests: 1 (+0)

Number of forks: 191

Total Stargazers: 3,160 (+0)

Total Subscribers: 13 (+0)

Issue Activity (beta)

Open issues: 1

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 328 days

Stale 30+ days: 1

Stale 90+ days: 1

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

bug (43)
enhancement (14)
documentation (3)
Product Update (1)
known issue (1)

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: N/A

Mean response time: 41.8 days

90th percentile: 144.1 days

Tracked items: 33

Most active contributors

jeanniefinks - 51 events, 26 issues
mgoin - 11 events, 8 issues
piaoyaoi - 8 events, 3 issues
AJITKUMAR130012 - 5 events, 2 issues
bfineran - 3 events, 3 issues

Related by overlapping contributors

Detailed Description

DeepSparse is a sparsity-aware deep learning inference runtime designed specifically for CPU execution. The project enables efficient neural network inference by leveraging sparsification techniques including pruning and quantization to reduce model size and computational requirements while maintaining accuracy. The runtime supports multiple machine learning domains including computer vision, natural language processing, and large language model inference, with built-in support for ONNX model formats and integration with pretrained models.

The repository is written primarily in Python and addresses a core challenge in machine learning deployment: running complex neural networks efficiently on standard CPU hardware without requiring specialized accelerators. DeepSparse achieves this through sparse computation techniques that skip redundant calculations in pruned neural networks and through low-precision inference using INT8 quantization. The project encompasses neural network compression, model optimization, and runtime speedup as core functionality, making it relevant for scenarios where GPU or specialized hardware deployment is impractical or cost-prohibitive.

According to GitGenius activity tracking, the repository maintains active development with a median issue and pull request response latency of 0.0 hours across 33 tracked items, indicating rapid community engagement. The mean response latency of 1002.3 hours reflects occasional longer-term discussions on complex issues. Bug reports represent the most common issue type with 19 tracked instances, followed by enhancement requests with 8 items and documentation improvements with 2 items. The primary contributor jeanniefinks has driven 51 tracked events, with mgoin and piaoyaoi contributing 11 and 8 events respectively, suggesting a focused core team managing community contributions.

The repository's classification spans performance optimization, neural network speedup, sparse neural networks, inference performance, and AI model deployment categories. It addresses cross-platform compatibility and hardware acceleration concerns while focusing on model efficiency through various optimization techniques. The project's integration with PyTorch extensions and support for low-precision inference demonstrates its position as a comprehensive inference optimization platform.

As of June 2025, DeepSparse entered end-of-life status following Neural Magic's acquisition by Red Hat in January 2025. The community versions of DeepSparse, along with related tools SparseML, SparseZoo, and Sparsify, ceased development and were deprecated on June 2, 2025. The transition reflects a strategic shift toward vLLM-based solutions within Red Hat's AI initiatives. While the community editions no longer receive updates or support, the repository remains accessible as a historical reference for sparsity-aware CPU inference techniques and represents a significant contribution to the open-source machine learning optimization ecosystem prior to its deprecation.

deepsparse
by
neuralmagic

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

deepsparse
by
neuralmagicneuralmagic/deepsparse

Repository Details

deepsparse by neuralmagic

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

deepsparse by neuralmagicneuralmagic/deepsparse

Repository Details

deepsparse
by
neuralmagic

deepsparse
by
neuralmagicneuralmagic/deepsparse