The Principal Dev – Masterclass for Tech Leads

The Principal Dev – Masterclass for Tech Leads28-29 May

Join

New Python OCR Libraries 2026

GitHub Libraries Python OCR Libraries

paddlepaddle/paddleocr 72K +808

added 1 year ago

Awesome multilingual OCR toolkit based on PaddlePaddle . It's a an ultra lightweight OCR system with support for 80+ languages, data annotation and synthesis.

sirfz/tesserocr 2K +3

added 1 year ago

A Python wrapper for the tesseract-ocr API

madmaze/pytesseract 6K +2

added 1 year ago

A Python wrapper for Google Tesseract

lukas-blecher/latex-ocr 16K +30

added 1 year ago

Takes an image of a math formula and returns corresponding LaTeX code.

ocrmypdf/ocrmypdf 32K +135

added 1 year ago

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted.

hiroi-sora/umi-ocr 36K +207

added 1 year ago

Free, open source, batch offline OCR text recognition tool.

jaidedai/easyocr 29K +53

added 1 year ago

Ready-to-use OCR with 80+ supported languages and all popular writing scripts

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.