borb is a library for reading, creating and manipulating PDF files in python.
-
Updated
Dec 1, 2024 - Python
borb is a library for reading, creating and manipulating PDF files in python.
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
Python library to interact with https://pdftables.com API
Merge images into one pdf file including useful optiıns via command line.
A Python-based tool that converts PDF files into editable Word documents, preserving text, images, and layout. Uses PyPDF2, PyMuPDF (fitz), python-docx, and Pillow to accurately transfer content from PDF to .docx. Ideal for transforming complex PDFs into Word format for easy editing.
Python DJVU to PDF converter which preserves OCR text and bookmark metadata (e.g. TOC)
The PDF-to-Image-Encryptor is a Python app that converts PDF pages to secure, encrypted images, ensuring privacy and security. It also decrypts images back to their original form.
Una herramienta en Python con interfaz gráfica para combinar múltiples PDFs en uno solo.
PDFGuard is a user-friendly Python application that helps you enhance the security of PDF files by removing potential security threats and hidden content. It does this by converting PDF pages into images and then creating new, sanitized PDFs from these images.
Simple script for extracting questions, answers and so on from test PDFs (for a subject called TS I have at uni) to a more usable format.
Add a description, image, and links to the pdf-conversion topic page so that developers can more easily learn about it.
To associate your repository with the pdf-conversion topic, visit your repo's landing page and select "manage topics."