The Principal Dev – Masterclass for Tech Leads

The Principal Dev – Masterclass for Tech LeadsJuly 17-18

Join

Top Python OCR Libraries 2025

GitHub Libraries Python OCR Libraries

paddlepaddle/paddleocr 50K +543

added 2 months ago

Awesome multilingual OCR toolkit based on PaddlePaddle . It's a an ultra lightweight OCR system with support for 80+ languages, data annotation and synthesis.

hiroi-sora/umi-ocr 34K +386

added 3 months ago

Free, open source, batch offline OCR text recognition tool.

ocrmypdf/ocrmypdf 29K +184

added 3 months ago

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted.

jaidedai/easyocr 26K +97

added 3 months ago

Ready-to-use OCR with 80+ supported languages and all popular writing scripts

lukas-blecher/latex-ocr 14K +60

added 3 months ago

Takes an image of a math formula and returns corresponding LaTeX code.

madmaze/pytesseract 6K +13

added 2 months ago

A Python wrapper for Google Tesseract

sirfz/tesserocr 2K +2

added 2 months ago

A Python wrapper for the tesseract-ocr API

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.