The Principal Dev – Masterclass for Tech Leads

The Principal Dev – Masterclass for Tech LeadsNov 27-28

Join

Top Python OCR Libraries 2025

GitHub Libraries Python OCR Libraries

paddlepaddle/paddleocr 53K +439

added 5 months ago

Awesome multilingual OCR toolkit based on PaddlePaddle . It's a an ultra lightweight OCR system with support for 80+ languages, data annotation and synthesis.

hiroi-sora/umi-ocr 36K +209

added 5 months ago

Free, open source, batch offline OCR text recognition tool.

ocrmypdf/ocrmypdf 30K +201

added 5 months ago

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted.

jaidedai/easyocr 27K +211

added 5 months ago

Ready-to-use OCR with 80+ supported languages and all popular writing scripts

lukas-blecher/latex-ocr 15K +35

added 5 months ago

Takes an image of a math formula and returns corresponding LaTeX code.

madmaze/pytesseract 6K +11

added 5 months ago

A Python wrapper for Google Tesseract

sirfz/tesserocr 2K

added 5 months ago

A Python wrapper for the tesseract-ocr API

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.