pymupdf

Here are 109 public repositories matching this topic...

pymupdf / PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

python pdf font data-science ocr tesseract epub mupdf text-processing pdf-documents extract-data table-extraction text-shaping xps pymupdf

Updated Jul 17, 2025
Python

ArtifexSoftware / pdf2docx

Star

Open source Python library for converting PDF to DOCX.

pdf-converter docx pymupdf pdf-to-word extract-table

Updated May 28, 2025
Python

(eBook，PDFs Translation) A multilingual eBook processing tool supporting all eBook formats. Features online and offline translation while preserving original layouts. Compatible with both scanned and digital PDFs. Elegant user interface. The world's highest-performing open-source layout-preserving eBook translator.

pdf latex translation math ebook formulas pymupdf openai-api deepseek

Updated Jul 3, 2025
Python

Krasjet / pdf.tocgen

Sponsor

Star

A CLI toolset to generate table of contents for PDF files automatically.

cli pdf table-of-contents scraping toc-generator pdf-files pdf-document pymupdf

Updated Nov 26, 2023
Python

lucasrla / remarks

Star

Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG

markdown pdf ocr highlighting annotations pdf-converter epub zotero obsidian ocrmypdf svg-images pymupdf remarkable-tablet roamresearch

Updated May 26, 2024
Python

genieincodebottle / parsemypdf

Star

Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.

ocr openai claude camelot pymupdf pypdf ocr-python markitdown gemini-pro gemini-ai llama-parse omniai unstructured-io docling llama-vision mistral-ocr smoldocling llama4

Updated Jul 1, 2025
Python

vb64 / markdown-pdf

Star

Markdown to pdf renderer

markdown pdf markdown-it pymupdf

Updated Apr 17, 2025
Python

Zain-Bin-Arshad / pdf-viewer

Star

A Pure Python PDFViewer, which provides functionalities same as other famous PDFViewers.

python pdf pdf-viewer pure-python fitz pymupdf python-pdf pysimplegui pdf-viewer-python

Updated Jul 14, 2023
Python

devxzh / PDFTools

Star

基于pyqt5, pymupdf实现的批量添加目录书签，增强pdf，拆分合并pdf的小工具

pdf bookmark pyqt5 pdf-merge pdf-split pymupdf add-catalog

Updated Aug 5, 2021
Python

shayanalibhatti / Designing-a-PDF-Audiobook-using-Python

Star

In this code, a simple implementation of PDF to audio converter is shown

python python3 pdf-reader audio-converter gtts pytesseract pymupdf pdf-to-audio pdf-text pytesseract-ocr

Updated Mar 30, 2021
Python

TheWatcherMultiversal / pdfgui_tools

Star

pdfgui_tools is a user interface tool developed in Qt and Python that integrates with poppler-utils and PyPDF2 for PDF document management. It's a simple and user-friendly tool that includes various utilities.

linux pdf gnu-linux python3 pdf-document pypdf2 pymupdf qt6 pyside6 poppler-utils

Updated Feb 5, 2024
Python

xxao / pero

Star

Unified Python drawing API

visualization python svg drawing pyqt5 pyside2 wxpython pymupdf pycairo pyqt6 pyside6

Updated Jul 20, 2025
Python

stroblme / UNote

Star

Fills the lack of an open-source PDF Editor with the capability to draw and add notes

editor lightweight pdf dark-theme viewer draw note create freehand annotate productive pymupdf handwrite

Updated Jun 17, 2024
Python

lheredias / Luftmensch

Star

Useful PDF-related productivity tool.

python pdf automation pyqt5 gui-application web-scraping windows-desktop pdfa pdf-merger pymupdf pdf-compression pysimplegui image-to-pdf-converter pdf-combiner luftmensch pdf-to-pdfa

Updated Oct 12, 2021
Python

gautam132002 / invoice-pdf-data-extraction

Star

Automated extraction of specific information from invoices, achieving over 95% accuracy.

python automation data-extraction pdf-data-extraction pymupdf

Updated Jul 14, 2023
Python

politikundbildung / kindle_to_pdf

Star

Creates PDF annotations from Kindle clippings

python-script kindle pymupdf

Updated Dec 20, 2022
Python

myogpatterns / layered-pdf-merge

Star

Merges multiple PDFs into a combined PDF file respecting layers aka Optional Content Group

pdf pdf-converter pdf-merge pdf-format pymupdf pdf-tools pdf-layer pdf-ocg

Updated Jun 26, 2022
Python

renan-siqueira / python-pdf-tool

Star

This project facilitates the extraction of text from PDF files using various Python libraries. It is designed to be flexible, allowing the choice among different text extraction libraries and supporting both single PDF file and directory containing multiple PDF files.

python pdf mit-license pdf-to-text pypdf2 pdf-extractor pdfminer pymupdf pdfplumber

Updated Nov 18, 2023
Python

DioCrafts / ai-book-summarizer

Star

📚 AI-Powered Book PDF Knowledge Extractor & Summarizer Transform your PDF books into structured knowledge effortlessly! This tool leverages AI to analyze books page by page, extracting key insights, definitions, and concepts, and organizes them into Markdown summaries for easier study

python markdown pdf machine-learning natural-language-processing automation ai text-analysis openai text-summarization document-analysis study-materials pymupdf knowledge-extraction pdf-processing book-summary educational-tools pdf-summarization ai-powered-tools

Updated Jan 2, 2025
Python

kezb90 / PDF_To_Word

Star

A Python-based tool that converts PDF files into editable Word documents, preserving text, images, and layout. Uses PyPDF2, PyMuPDF (fitz), python-docx, and Pillow to accurately transfer content from PDF to .docx. Ideal for transforming complex PDFs into Word format for easy editing.

python automation python-script text-extraction pdf-conversion python-docx pypdf2 pymupdf pdf-to-word image-extraction pdf-to-docx

Updated Nov 16, 2024
Python

Improve this page

Add a description, image, and links to the pymupdf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pymupdf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pymupdf

Here are 109 public repositories matching this topic...

pymupdf / PyMuPDF

ArtifexSoftware / pdf2docx

CBIhalsen / PolyglotPDF

Krasjet / pdf.tocgen

lucasrla / remarks

genieincodebottle / parsemypdf

vb64 / markdown-pdf

Zain-Bin-Arshad / pdf-viewer

devxzh / PDFTools

shayanalibhatti / Designing-a-PDF-Audiobook-using-Python

TheWatcherMultiversal / pdfgui_tools

xxao / pero

stroblme / UNote

lheredias / Luftmensch

gautam132002 / invoice-pdf-data-extraction

politikundbildung / kindle_to_pdf

myogpatterns / layered-pdf-merge

renan-siqueira / python-pdf-tool

DioCrafts / ai-book-summarizer

kezb90 / PDF_To_Word

Improve this page

Add this topic to your repo