The Principal Dev – Masterclass for Tech Leads

The Principal Dev – Masterclass for Tech Leads28-29 May

Join

Star-history

Global Leading OCR Toolkit & Document AI Engine

English | 简体中文 | 繁體中文 | 日本語 | 한국어 | Français | Русский | Español | العربية

PyPI Downloads Used by python os hardware

AI Studio Ask DeepWiki License

PaddleOCR converts PDF documents and images into structured, LLM-ready data (JSON/Markdown) with industry-leading accuracy. With 70k+ Stars and trusted by top-tier projects like Dify, RAGFlow, and Cherry Studio, PaddleOCR is the bedrock for building intelligent RAG and Agentic applications.

🚀 Key Features

📄 Intelligent Document Parsing (LLM-Ready)

Transforming messy visuals into structured data for the LLM era.

🔍 Universal Text Recognition (Scene OCR)

The global gold standard for high-speed, multilingual text spotting.

PaddleOCR Architecture

🛠️ Developer-Centric Ecosystem

📣 Recent updates

🔥 [2026.01.29] PaddleOCR v3.4.0 Released: The Era of Irregular Document Parsing

2025.10.16: Release of PaddleOCR 3.3.0
2025.08.21: Release of PaddleOCR 3.2.0

History Log

🚀 Quick Start

Step 1: Try Online

PaddleOCR official website provides interactive Experience Center and APIs—no setup required, just one click to experience.

👉 Visit Official Website

Step 2: Local Deployment

For local usage, please refer to the following documentation based on your needs:

🧩 More Features

🔄 Quick Overview of Execution Results

PP-OCRv5

PP-OCRv5 Demo

PP-StructureV3

PP-StructureV3 Demo

PaddleOCR-VL

PP-StructureV3 Demo

✨ Stay Tuned

Star this repository to keep up with exciting updates and new releases, including powerful OCR and document parsing capabilities!

Star-Project

👩‍👩‍👧‍👦 Community

PaddlePaddle WeChat official account Join the tech discussion group

😃 Awesome Projects Leveraging PaddleOCR

PaddleOCR wouldn't be where it is today without its incredible community! 💗 A massive thank you to all our longtime partners, new collaborators, and everyone who's poured their passion into PaddleOCR — whether we've named you or not. Your support fuels our fire!

Project Name Description
Dify Production-ready platform for agentic workflow development.
RAGFlow RAG engine based on deep document understanding.
pathway Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
MinerU Multi-type Document to Markdown Conversion Tool
Umi-OCR Free, Open-source, Batch Offline OCR Software.
cherry-studio A desktop client that supports for multiple LLM providers.
haystack AI orchestration framework to build customizable, production-ready LLM applications.
OmniParser OmniParser: Screen Parsing tool for Pure Vision Based GUI Agent.
QAnything Question and Answer based on Anything.
Learn more projects More projects based on PaddleOCR

👩‍👩‍👧‍👦 Contributors

🌟 Star

Star-history

📄 License

This project is released under the Apache 2.0 license.

🎓 Citation

@misc{cui2025paddleocr30technicalreport,
      title={PaddleOCR 3.0 Technical Report}, 
      author={Cheng Cui and Ting Sun and Manhui Lin and Tingquan Gao and Yubo Zhang and Jiaxuan Liu and Xueqing Wang and Zelun Zhang and Changda Zhou and Hongen Liu and Yue Zhang and Wenyu Lv and Kui Huang and Yichao Zhang and Jing Zhang and Jun Zhang and Yi Liu and Dianhai Yu and Yanjun Ma},
      year={2025},
      eprint={2507.05595},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2507.05595}, 
}

@misc{cui2025paddleocrvlboostingmultilingualdocument,
      title={PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model}, 
      author={Cheng Cui and Ting Sun and Suyin Liang and Tingquan Gao and Zelun Zhang and Jiaxuan Liu and Xueqing Wang and Changda Zhou and Hongen Liu and Manhui Lin and Yue Zhang and Yubo Zhang and Handong Zheng and Jing Zhang and Jun Zhang and Yi Liu and Dianhai Yu and Yanjun Ma},
      year={2025},
      eprint={2510.14528},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2510.14528}, 
}

@misc{cui2026paddleocrvl15multitask09bvlm,
      title={PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing}, 
      author={Cheng Cui and Ting Sun and Suyin Liang and Tingquan Gao and Zelun Zhang and Jiaxuan Liu and Xueqing Wang and Changda Zhou and Hongen Liu and Manhui Lin and Yue Zhang and Yubo Zhang and Yi Liu and Dianhai Yu and Yanjun Ma},
      year={2026},
      eprint={2601.21957},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2601.21957}, 
}

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.