opendatalab/MinerU

Description: Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

View on GitHub ↗Jump to charts ↓

Summary Information

Updated 45 minutes ago

Added to GitGenius on March 6th, 2026

Created on February 29th, 2024

Open Issues & Pull Requests: 54 (+0)

Number of forks: 6,313

Total Stargazers: 75,176 (+0)

Total Subscribers: 273 (+0)

Issue Activity (beta)

Open issues: 22

New in 7 days: 18

Closed in 7 days: 4

Avg open age: 61 days

Stale 30+ days: 6

Stale 90+ days: 1

Recent activity

Opened in 7 days: 17

Closed in 7 days: 4

Comments in 7 days: 13

Events in 7 days: 42

Top labels

bug (1,602)
enhancement (382)
question (63)
documentation (32)
P1 (14)
wontfix (13)
help wanted (9)
upstream bug (9)

Most active issues this week

#5268 运行时突然崩溃，health接口正常 - 7 events / 2 comments
#5273 models verify high 通过后本地 high parse 仍访问 HuggingFace 导致失败 - 6 events / 2 comments
#5269 如果把页眉和旁注保存到md文件 - 5 events / 2 comments
#5276 [NEXT] flash 解析文本类输入返回误导性 not_cached - 5 events / 1 comments
#5263 VL模式版面解析问题 - 4 events / 2 comments

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: N/A

Mean response time: 5.8 hours

90th percentile: 0.0 hours

Tracked items: 2,289

Most active contributors

myhloli - 4,817 events, 2,046 issues
dt-yy - 258 events, 135 issues
drunkpig - 130 events, 90 issues
2257396011 - 54 events, 14 issues
James-Dao - 49 events, 3 issues

Related by overlapping contributors

Detailed Description

MinerU is a high-accuracy document parsing engine designed to transform complex documents into structured formats suitable for large language model workflows, retrieval-augmented generation systems, and agentic applications. The project converts PDFs, DOCX, PPTX, XLSX files, images, and web pages into markdown or JSON output with support for 109 languages through its dual VLM and OCR engine architecture.

The repository has grown to 73,389 stargazers and 6,160 forks as of the most recent tracking period, with consistent community engagement. GitGenius tracking shows a median issue and pull request response latency of 0.0 hours across 1,815 items, indicating rapid triage and response cycles. The most active contributor, myhloli, has logged 3,991 events, followed by dt-yy with 257 events and drunkpig with 129 events. Bug reports represent the dominant issue category with 1,129 tracked items, followed by enhancement requests at 376 items and general questions at 56 items. The project shares contributors with related repositories including PaddleOCR, Dify, and RAGFlow, indicating integration within a broader ecosystem of document processing and AI workflow tools.

MinerU's core parsing capabilities include native support for DOCX, PPTX, and XLSX formats with formula-to-LaTeX conversion and table-to-HTML transformation. The system handles scanned documents, handwriting recognition, multi-column layouts, and cross-page table merging while maintaining human reading order and automatically removing headers and footers. The dual engine approach combines vision language models with OCR for robust document understanding across diverse input types.

The platform offers multiple deployment and integration pathways. It provides MCP Server support for AI coding tools like Cursor and Claude Desktop, integrates with RAG frameworks including LangChain, LlamaIndex, RAGFlow, and Dify, and offers Python, Go, and TypeScript SDKs alongside CLI, REST API, and Docker deployment options. A web version at mineru.net provides zero-installation access, while desktop clients and Gradio WebUI interfaces serve different user needs. The system supports multiple inference backends including pipeline, vlm-engine, and hybrid-engine configurations, with compatibility for domestic AI chips including Ascend, Cambricon, Enflame, and others.

Recent releases demonstrate active development focused on performance optimization and capability expansion. Version 3.4 upgraded the OCR model to PP-OCRv6, achieving approximately 11 percent accuracy improvement on OmniDocBench v1.6 while doubling OCR processing speed. Version 3.3 introduced an effort parameter for the hybrid backend with medium and high parsing strength levels, delivering 35 to 220 percent speed improvements across different platforms while maintaining parsing accuracy. Version 3.1.0 transitioned the project from AGPLv3 to a custom MinerU Open Source License based on Apache 2.0, reducing adoption friction for both community and commercial deployments. The VLM model was upgraded to MinerU2.5-Pro-2604-1.2B with support for image and chart parsing, truncated paragraph merging, and cross-page table merging capabilities.

The project is classified across multiple domains including data mining, machine learning, AI platforms, data science, deep learning, workflow management, model training, data processing, and MLOps, reflecting its position as a comprehensive document understanding solution for enterprise and research applications.

opendatalab/MinerU

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

MinerU
by
opendatalabopendatalab/MinerU

Repository Details

opendatalab/MinerU

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

MinerU by opendatalabopendatalab/MinerU

Repository Details

MinerU
by
opendatalabopendatalab/MinerU