minimind
by
jingyaogong

Description: 🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h!

View on GitHub ↗

Summary Information

Updated 12 minutes ago

Added to GitGenius on October 25th, 2025

Created on July 27th, 2024

Open Issues & Pull Requests: 46 (+0)

Number of forks: 6,871

Total Stargazers: 53,118 (+4)

Total Subscribers: 244 (+0)

Issue Activity (beta)

Open issues: 24

New in 7 days: 0

Closed in 7 days: 0

Avg open age: 179 days

Stale 30+ days: 18

Stale 90+ days: 13

Recent activity

Opened in 7 days: 0

Closed in 7 days: 0

Comments in 7 days: 0

Events in 7 days: 0

Top labels

question (27)
enhancement (14)
bug (6)
good first issue (5)
documentation (4)
duplicate (3)
help wanted (3)

Most active issues this week

No issue events were indexed in the last 7 days.

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 2.6 hours

Mean response time: 5.8 days

90th percentile: 13.0 days

Tracked items: 626

Most active contributors

jingyaogong - 1,265 events, 575 issues
dao86 - 32 events, 28 issues
tangjinou - 29 events, 8 issues
dyhuachi - 24 events, 1 issues
WangRongsheng - 23 events, 8 issues

Related by overlapping contributors

Detailed Description

MiniMind is an innovative, privacy-focused AI assistant designed to enable users to interact with their personal documents using large language models (LLMs) that run entirely on their local machine. Developed by Jingyao Gong, this open-source project addresses the growing concern over data privacy when utilizing cloud-based LLM services. By keeping all data and processing local, MiniMind ensures that sensitive information never leaves the user's device, offering a secure and confidential environment for document analysis and conversational AI. It essentially transforms a personal computer into a powerful, private knowledge base, allowing users to "chat" with their own data without compromise.

The core functionality of MiniMind revolves around its intuitive web-based user interface, built with Streamlit, which simplifies the process of uploading and managing documents. Users can upload a variety of file types, including PDFs, TXT, DOCX, CSV, JSON, and Markdown files. Once uploaded, MiniMind processes these documents to create a searchable knowledge base. A key feature is its support for multi-document interaction, allowing users to query across an entire collection of personal files. Furthermore, it offers customizable LLM settings, enabling users to select different models from Ollama and adjust parameters like temperature for varied response styles, alongside persistent chat history for continuous conversations.

Technically, MiniMind leverages a robust architecture centered on Retrieval Augmented Generation (RAG). When documents are uploaded, they are first chunked into smaller, manageable segments. These segments are then converted into numerical representations called embeddings using Sentence Transformers, which capture their semantic meaning. These embeddings are stored in a local vector database, ChromaDB. When a user poses a question, MiniMind retrieves the most semantically relevant document chunks from ChromaDB. These retrieved chunks, along with the user's query, are then fed as context to a locally running LLM, powered by Ollama. The LLM processes this combined information to generate a coherent and contextually relevant response, ensuring answers are grounded in the user's own documents.

The benefits of MiniMind are manifold. Primarily, it champions data privacy and security, eliminating the need to send proprietary or sensitive information to external servers. This local-first approach also provides users with complete control over their AI interactions and data. Economically, it offers a cost-effective alternative to subscription-based cloud LLM services, as it incurs no API usage fees once set up. Its accessibility, with installation options including Docker for ease of deployment, makes advanced LLM capabilities available to a broader audience. MiniMind is ideal for individuals, researchers, or small businesses who require a private, on-demand AI assistant for document analysis, research, or information retrieval without external dependencies.

In summary, MiniMind stands out as a practical and secure solution for personal AI assistance. By integrating local LLM inference with a user-friendly interface and a robust RAG pipeline, it empowers users to unlock insights from their documents while maintaining absolute data privacy. Its commitment to local processing, combined with its flexible document handling and customizable LLM settings, positions MiniMind as an invaluable tool for anyone seeking to harness the power of AI for personal knowledge management without compromising on security or control. As the project evolves, with a roadmap including features like web search and multi-modal input, its utility and impact are poised to grow further.

minimind
by
jingyaogong

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

minimind
by
jingyaogongjingyaogong/minimind

Repository Details

minimind by jingyaogong

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

minimind by jingyaogongjingyaogong/minimind

Repository Details

minimind
by
jingyaogong

minimind
by
jingyaogongjingyaogong/minimind