The Principal Dev – Masterclass for Tech Leads

The Principal Dev – Masterclass for Tech LeadsNov 27-28

Join

New Python OCR Libraries 2025

GitHub Libraries Python OCR Libraries

paddlepaddle/paddleocr 56K +633

added 7 months ago

Awesome multilingual OCR toolkit based on PaddlePaddle . It's a an ultra lightweight OCR system with support for 80+ languages, data annotation and synthesis.

sirfz/tesserocr 2K -2

added 7 months ago

A Python wrapper for the tesseract-ocr API

madmaze/pytesseract 6K +9

added 7 months ago

A Python wrapper for Google Tesseract

lukas-blecher/latex-ocr 15K +502

added 7 months ago

Takes an image of a math formula and returns corresponding LaTeX code.

ocrmypdf/ocrmypdf 31K +106

added 7 months ago

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted.

hiroi-sora/umi-ocr 36K +207

added 7 months ago

Free, open source, batch offline OCR text recognition tool.

jaidedai/easyocr 28K +67

added 7 months ago

Ready-to-use OCR with 80+ supported languages and all popular writing scripts

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.