minimind
by
jingyaogong

Description: 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

View jingyaogong/minimind on GitHub ↗

Summary Information

Updated 1 hour ago
Added to GitGenius on October 25th, 2025
Created on July 27th, 2024
Open Issues/Pull Requests: 37 (+0)
Number of forks: 4,820
Total Stargazers: 39,843 (+1)
Total Subscribers: 219 (+0)
Detailed Description

MiniMind is an innovative, privacy-focused AI assistant designed to enable users to interact with their personal documents using large language models (LLMs) that run entirely on their local machine. Developed by Jingyao Gong, this open-source project addresses the growing concern over data privacy when utilizing cloud-based LLM services. By keeping all data and processing local, MiniMind ensures that sensitive information never leaves the user's device, offering a secure and confidential environment for document analysis and conversational AI. It essentially transforms a personal computer into a powerful, private knowledge base, allowing users to "chat" with their own data without compromise.

The core functionality of MiniMind revolves around its intuitive web-based user interface, built with Streamlit, which simplifies the process of uploading and managing documents. Users can upload a variety of file types, including PDFs, TXT, DOCX, CSV, JSON, and Markdown files. Once uploaded, MiniMind processes these documents to create a searchable knowledge base. A key feature is its support for multi-document interaction, allowing users to query across an entire collection of personal files. Furthermore, it offers customizable LLM settings, enabling users to select different models from Ollama and adjust parameters like temperature for varied response styles, alongside persistent chat history for continuous conversations.

Technically, MiniMind leverages a robust architecture centered on Retrieval Augmented Generation (RAG). When documents are uploaded, they are first chunked into smaller, manageable segments. These segments are then converted into numerical representations called embeddings using Sentence Transformers, which capture their semantic meaning. These embeddings are stored in a local vector database, ChromaDB. When a user poses a question, MiniMind retrieves the most semantically relevant document chunks from ChromaDB. These retrieved chunks, along with the user's query, are then fed as context to a locally running LLM, powered by Ollama. The LLM processes this combined information to generate a coherent and contextually relevant response, ensuring answers are grounded in the user's own documents.

The benefits of MiniMind are manifold. Primarily, it champions data privacy and security, eliminating the need to send proprietary or sensitive information to external servers. This local-first approach also provides users with complete control over their AI interactions and data. Economically, it offers a cost-effective alternative to subscription-based cloud LLM services, as it incurs no API usage fees once set up. Its accessibility, with installation options including Docker for ease of deployment, makes advanced LLM capabilities available to a broader audience. MiniMind is ideal for individuals, researchers, or small businesses who require a private, on-demand AI assistant for document analysis, research, or information retrieval without external dependencies.

In summary, MiniMind stands out as a practical and secure solution for personal AI assistance. By integrating local LLM inference with a user-friendly interface and a robust RAG pipeline, it empowers users to unlock insights from their documents while maintaining absolute data privacy. Its commitment to local processing, combined with its flexible document handling and customizable LLM settings, positions MiniMind as an invaluable tool for anyone seeking to harness the power of AI for personal knowledge management without compromising on security or control. As the project evolves, with a roadmap including features like web search and multi-modal input, its utility and impact are poised to grow further.

minimind
by
jingyaogongjingyaogong/minimind

Repository Details

Fetching additional details & charts...