moonshine-ai/moonshine

Description: Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

View on GitHub ↗Jump to charts ↓

Summary Information

Updated 6 minutes ago

Added to GitGenius on March 2nd, 2026

Created on October 4th, 2024

Open Issues & Pull Requests: 7 (+0)

Number of forks: 476

Total Stargazers: 8,721 (+6)

Total Subscribers: 62 (+0)

Issue Activity (beta)

Open issues: 3

New in 7 days: 0

Closed in 7 days: 23

Avg open age: 103 days

Stale 30+ days: 1

Stale 90+ days: 0

Recent activity

Opened in 7 days: 0

Closed in 7 days: 10

Comments in 7 days: 0

Events in 7 days: 0

Top labels

enhancement (3)
documentation (1)

Most active issues this week

#137 Request: macOS Intel (x86_64) Support - 7 events / 2 comments
#196 Python: `mic_stream.add_audio()` performs synchronous transcription, blocking the PortAudio callback - 7 events / 2 comments
#158 fix(python): transcript_line_t struct mismatch in PyPI v0.0.49 causes SIGSEGV on multi-line transcription - 6 events / 1 comments
#164 Can the Android SDK set minSdk = 35 to a lower value to support Android 7 devices? - 6 events / 1 comments
#161 Documentation on `moonshine_transcribe_without_streaming` does not match actual behaviour - 5 events / 1 comments

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 6.7 days

Mean response time: 57.9 days

90th percentile: 223.5 days

Tracked items: 94

Most active contributors

petewarden - 96 events, 56 issues
evmaki - 56 events, 26 issues
keveman - 15 events, 9 issues
guynich - 12 events, 3 issues
LauraGPT - 11 events, 5 issues

Related by overlapping contributors

Detailed Description

Moonshine Voice is an open source AI toolkit written in C++ designed for developers building real-time voice agents and applications. The project provides speech-to-text, text-to-speech, and intent recognition capabilities optimized for very low latency performance. All processing runs on-device, eliminating the need for API keys, accounts, or cloud connectivity while maintaining privacy and enabling fast responses.

The toolkit addresses specific limitations of existing solutions like OpenAI's Whisper by implementing streaming-capable models that process audio incrementally rather than requiring fixed 30-second input windows. This streaming architecture allows the framework to cache computations and avoid redundant processing as users speak, delivering latency below 200 milliseconds on various platforms. The speech-to-text models are based on cutting-edge research published at arxiv.org/abs/2602.12241 and trained from scratch, achieving higher accuracy than Whisper Large V3 at the top end while offering models as small as 26 megabytes for constrained deployments.

Cross-platform support is a core strength of Moonshine Voice. The same library runs on Python, iOS, Android, macOS, Linux, Windows, Raspberry Pis, IoT devices, microcontrollers, DSPs, and wearables. The repository includes example applications for each major platform available as downloadable archives from GitHub Releases, with quickstart guides for Python, iOS, Android, Linux, macOS, Windows, and Raspberry Pi. The framework provides high-level APIs that bundle complete solutions for transcription, text-to-speech, voice cloning, speaker identification, command recognition, and conversational agents into a single library.

Language support spans eight languages for speech-to-text including English, Spanish, Mandarin, Japanese, Korean, Vietnamese, Ukrainian, and Arabic. Text-to-speech support extends to sixteen languages, adding German, French, Hindi, Italian, Dutch, Portuguese, Russian, and Turkish to the STT language list. This multilingual capability addresses another gap in Whisper's performance, particularly for Asian languages like Korean and Japanese where Whisper's accuracy drops significantly below usable thresholds.

According to GitGenius activity tracking across eighty issues and pull requests, the project maintains a median response latency of 30.8 hours with a mean of 1413.6 hours, indicating variable but generally responsive maintenance. The most active contributors are evmaki with 56 tracked events, petewarden with 47 events, and keveman with 15 events. Enhancement requests and documentation improvements represent the most frequently tracked issue labels. The project's contributor network overlaps with major repositories including microsoft/vscode, microsoft/typescript, and rust-lang/rust, suggesting involvement from developers with experience in large-scale systems.

The repository is classified across multiple domains including AI Assistant, Natural Language processing, LLM-powered applications, Data Analysis, and Business Intelligence, reflecting its role as infrastructure for voice-driven AI applications. The project maintains an active community with a Discord server for live support and includes comprehensive documentation through README files, Colab notebooks, and YouTube screencasts demonstrating platform-specific implementations.

moonshine-ai/moonshine

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

moonshine
by
moonshine-aimoonshine-ai/moonshine

Repository Details

moonshine-ai/moonshine

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

moonshine by moonshine-aimoonshine-ai/moonshine

Repository Details

moonshine
by
moonshine-aimoonshine-ai/moonshine