The Principal Dev – Masterclass for Tech Leads

The Principal Dev – Masterclass for Tech LeadsJuly 17-18

Join

Top Python PDF Libraries 2025

GitHub Libraries Python PDF Libraries

opendatalab/mineru 31K +1593

added 1 month ago

A high-quality tool for convert PDF to Markdown and JSON.

docling-project/docling 28K +1287

added 1 month ago

Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

ocrmypdf/ocrmypdf 28K +942

added 1 month ago

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted.

vikparuchuri/marker 24K +401

added 1 month ago

Marker PDF converts documents to markdown, JSON, and HTML quickly and accurately.

py-pdf/pypdf 8K +44

added 1 month ago

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

kozea/weasyprint 7K +39

added 1 month ago

WeasyPrint is a smart solution helping web developers to create PDF documents. It turns simple HTML pages into gorgeous statistical reports, invoices, tickets, etc.

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.