Extract text, images, tables, and metadata from PDF files using Python. Built with PyPDF2, PyMuPDF, pdfplumber, and pdfminer. Helpful for practicing document parsing and data extraction tasks.
-
Updated
Aug 6, 2025 - Python
Extract text, images, tables, and metadata from PDF files using Python. Built with PyPDF2, PyMuPDF, pdfplumber, and pdfminer. Helpful for practicing document parsing and data extraction tasks.
Add a description, image, and links to the pimupdf topic page so that developers can more easily learn about it.
To associate your repository with the pimupdf topic, visit your repo's landing page and select "manage topics."