PaddleOCR
by
PaddlePaddle

Description: Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs....

View on GitHub ↗

Summary Information

Updated 36 minutes ago

Added to GitGenius on November 24th, 2025

Created on May 8th, 2020

Open Issues & Pull Requests: 226 (+0)

Number of forks: 10,995

Total Stargazers: 85,152 (+11)

Total Subscribers: 554 (+0)

Issue Activity (beta)

Open issues: 152

New in 7 days: 2

Closed in 7 days: 0

Avg open age: 61 days

Stale 30+ days: 103

Stale 90+ days: 29

Recent activity

Opened in 7 days: 1

Closed in 7 days: 0

Comments in 7 days: 2

Events in 7 days: 3

Top labels

status/close (3,800)
contrib/good-first-issue (309)
stale (189)
bug (184)
automated issue (112)
report (112)
task/deployment (67)
task/inference (64)

Most active issues this week

#18237 3.7.0版本训练的PPOCRV6的medium_det模型导出为推理模型后再转成onnx模型时候报错 - 3 events / 0 comments
#16823 Frequently Asked Questions on Inference and Deployment of PaddleOCR-VL PaddleOCR-VL 推理部署相关高频问题回复 - 1 events / 1 comments
#18190 PP-OCRv6_medium_rec 转onnx [ERROR][Paddle2ONNX] Due to the unsupported operators, the conversion is aborted. - 1 events / 1 comments
#18215 PP-DocLayoutV3 版面识别问题 - 1 events / 1 comments
#18241 PP-DocLayoutV3基本上全部忽略参考文献 - 1 events / 0 comments

Explore full issue details

Repository Insights (GitGenius)

Median issue/PR response: 39.9 days

Mean response time: 176.5 days

90th percentile: 405.7 days

Tracked items: 2,677

Most active contributors

TingquanGao - 1,124 events, 529 issues
UserWangZz - 1,107 events, 559 issues
GreatV - 899 events, 429 issues
Bobholamovic - 548 events, 235 issues
scyyh11 - 457 events, 289 issues

Related by overlapping contributors

Detailed Description

PaddleOCR is a comprehensive optical character recognition toolkit and document AI engine developed by PaddlePaddle that converts PDF documents and images into structured, machine-readable data suitable for large language models. The project has accumulated over 70,000 stars and serves as a foundational component for intelligent retrieval-augmented generation and agentic applications, with integration into widely-used platforms including Dify, RAGFlow, and Cherry Studio.

The repository's core functionality centers on two primary capabilities. First, it provides intelligent document parsing through its PaddleOCR-VL series models, with the latest version PaddleOCR-VL-1.6 achieving 96.3% accuracy on OmniDocBench v1.6. This lightweight 0.9-billion-parameter vision-language model excels at recognizing text, formulas, and tables while handling challenging scenarios such as ancient documents, rare characters, seals, and charts. The toolkit outputs structured data in both Markdown and JSON formats. Second, it offers universal text recognition across 100+ languages through PP-OCRv6, which supports 50 languages with a single unified model, eliminating the need for model switching when processing multilingual documents. The latest version achieves 4.6% improvement in detection accuracy and 5.1% improvement in recognition accuracy compared to PP-OCRv5, while delivering 5.2 times faster CPU inference speeds.

The PP-StructureV3 algorithm provides structure-aware conversion capabilities, transforming complex PDFs and images into Markdown or JSON with fine-grained coordinate information including table cell positions and text locations. The toolkit also includes PP-DocLayoutV3 for handling irregular document shapes across five challenging scenarios: skew, warping, scanning artifacts, illumination variations, and screen photography.

GitGenius activity data reveals substantial community engagement with 3,322 tracked issues and pull requests. The median response latency for issues and PRs stands at 12.5 hours, indicating active maintenance. The most frequently applied issue labels are status/close with 1,890 occurrences, contrib/good-first-issue with 271 occurrences, and stale with 178 occurrences. The core maintenance team includes TingquanGao with 1,124 tracked events, UserWangZz with 1,107 events, and GreatV with 899 events, demonstrating consistent project stewardship.

The repository supports flexible deployment across multiple hardware backends including NVIDIA GPUs, Intel CPUs, Kunlunxin XPUs, and various AI accelerators. Recent releases have expanded functionality to include office document conversion to Markdown, DOCX export for parsed results, and browser-based inference through the PaddleOCR.js SDK. The toolkit provides three model tiers—tiny at 1.5 million parameters, small at 7.7 million parameters, and medium at 34.5 million parameters—enabling deployment across edge devices, mobile platforms, and server environments. Models are distributed through HuggingFace and ModelScope repositories, facilitating integration with the broader machine learning ecosystem. The project maintains documentation in multiple languages including English, Simplified Chinese, Traditional Chinese, Japanese, Korean, French, Russian, Spanish, and Arabic, reflecting its global user base.

PaddleOCR
by
PaddlePaddle

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

PaddleOCR
by
PaddlePaddlePaddlePaddle/PaddleOCR

Repository Details

PaddleOCR by PaddlePaddle

Summary Information

Issue Activity (beta)

Recent activity

Top labels

Most active issues this week

Repository Insights (GitGenius)

Most active contributors

Related by overlapping contributors

PaddleOCR by PaddlePaddlePaddlePaddle/PaddleOCR

Repository Details

PaddleOCR
by
PaddlePaddle

PaddleOCR
by
PaddlePaddlePaddlePaddle/PaddleOCR