v0.0.0
pymupdf

pymupdf/PyMuPDF

library
AGPL-3.0
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python

9.9k

735

Updated 5 Jun 2026

data-science
epub
extract-data
font
mupdf
ocr
pdf
pdf-documents
pymupdf
python
table-extraction
tesseract
text-processing
text-shaping
xps

Appears in lists

pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.