kaldi-asr/kaldi

Description: kaldi-asr/kaldi is the official location of the Kaldi project.

View on GitHub ↗Jump to charts ↓

Summary Information

Updated 2 hours ago

Added to GitGenius on May 30th, 2026

Created on April 20th, 2015

Open Issues & Pull Requests: 260 (+0)

Number of forks: 5,359

Total Stargazers: 15,431 (+0)

Total Subscribers: 677 (+0)

Issue Activity (beta)

Open issues: 47

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 830 days

Stale 30+ days: 47

Stale 90+ days: 47

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

bug (48)
stale (34)
discussion (10)
waiting-for-feedback (5)
kaldi10-TODO (4)
stale-exclude (2)
wontfix (2)
enhancement (1)

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: N/A

Mean response time: 14.8 days

90th percentile: 0.0 hours

Tracked items: 71

Most active contributors

kkm000 - 83 events, 17 issues
danpovey - 58 events, 21 issues
jtrmal - 51 events, 23 issues
danijel3 - 16 events, 8 issues
jimbozhang - 15 events, 2 issues

Related by overlapping contributors

Detailed Description

Kaldi is a comprehensive open-source toolkit for speech recognition and acoustic modeling, written primarily in C++ with Shell build infrastructure. The project serves as the official reference implementation for speech-to-text systems and related audio processing tasks, with extensive support for speaker identification, speaker verification, and phoneme alignment. The toolkit is designed to handle the full pipeline of speech recognition research and deployment, from feature extraction and signal processing through acoustic and language modeling to final speech decoding.

The repository demonstrates active maintenance and community engagement, with GitGenius tracking 71 recent issues and pull requests. The most active contributors include kkm000 with 83 recorded events, danpovey with 58 events, and jtrmal with 51 events. Bug reports represent the most common issue type with 47 tracked instances, followed by stale issues at 31 and discussion threads at 10. The median response latency for issues and pull requests is effectively immediate at 0.0 hours, though the mean response time of 354.0 hours reflects the variable nature of community-driven development where some items receive immediate attention while others may take longer to address.

Kaldi's build system supports a wide range of platforms and configurations. The toolkit can be compiled on UNIX systems including various Linux distributions, Darwin, and Cygwin, with separate Windows installation instructions provided. The project includes platform-specific guidance for Fedora 41, PowerPC 64-bit little-endian systems, Android cross-compilation using the Android NDK, and WebAssembly compilation via emscripten for in-browser execution. The toolkit integrates with multiple linear algebra libraries including OpenBLAS, ATLAS, and CUDA for GPU acceleration, making it adaptable to different computational environments.

The development workflow follows a structured contribution pattern documented in the repository. Contributors are expected to fork the main repository, create feature branches, and submit pull requests through GitHub's interface. The project adheres to the Google C++ Style Guide with documented exceptions specific to Kaldi, and contributors can validate their code using Google's cpplint.py tool. The repository maintains comprehensive documentation at kaldi-asr.org, including project information, technique descriptions, C++ coding tutorials, and Doxygen-generated API references.

Community support is facilitated through multiple channels including dedicated user and developer mailing lists accessible via the project website. The toolkit's example systems and build instructions are documented in the egs directory, providing practical starting points for users. The project's interconnection with other major repositories is evident through overlapping contributors with espnet/espnet, llvm/llvm-project, and pytorch/pytorch, indicating Kaldi's role as a foundational component in the broader speech processing and machine learning ecosystem.

kaldi-asr/kaldi

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

kaldi
by
kaldi-asrkaldi-asr/kaldi

Repository Details

kaldi-asr/kaldi

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

kaldi by kaldi-asrkaldi-asr/kaldi

Repository Details

kaldi
by
kaldi-asrkaldi-asr/kaldi