pdf-analysis

Here are 25 public repositories matching this topic...

tfmorris / pdf2table

PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz

information-extraction table-extraction pdf-analysis

Updated Mar 15, 2024
Java

SreejaBethu / Smart-Report-Analyzer

Star

An AI-powered LLM app to analyze and summarize Excel, CSV, and PDF reports using Hugging Face language models. Built with Streamlit.

python nlp question-answering data-analysis summarization huggingface streamlit pdf-analysis llm

Updated Mar 27, 2025
Python

michael-eble / pdf-analysis-word-extraction-word-frequencies

Star

PDF Analysis: Extracting words and their word frequencies from PDF files; Preparation of text data for performing topic analysis on annual reports of German car manufacturers - e.g. Volkswagen, Porsche and Audi. Please note that words are only being extracted, stemming is not being applied. In order to improve this, use nltk.stem.snowball.Snowba…

natural-language-processing german nlp-parsing pdf-analysis

Updated Sep 11, 2019
Python

jlmayorgaco / r-biblio-synth

Star

This project focuses on automating the analysis and reporting of bibliometric data, specifically targeting the annual production of academic articles. The primary goal is to understand trends, anomalies, and patterns in bibliometric data through a combination of statistical modeling and exploratory data analysis.

data-science data-visualization data-analysis scopus r-language systematic-reviews anomaly-detection bibliometrics literature-review data-analysis-in-r report-generation regression-modeling pdf-analysis research-tools bibliometrix-package llm-integration trending-analysis reserach-

Updated Oct 3, 2025
R

lhiebert01 / GenAI_PDF_App

Star

Advanced PDF analysis and question-answering application powered by Google's Gemini Pro AI. Upload PDFs and get intelligent, structured responses to your questions about the document content

python streamlit pdf-analysis langchain genai-chatbot gemini-pro

Updated Dec 2, 2024
Python

MaliosDark / Pdf-infected-Virus-Scanner-Online

Star

A secure, AI-enhanced file scanning tool built on Flask, strengthened with ClamAV and PDF analysis, designed to vigilantly detect digital threats and potential vulnerabilities.

flask ai sqlite web-application clamav cybersecurity malware-detection ai-security threat-detection pdf-analysis digital-security file-scanning

Updated Sep 9, 2024
HTML

RaghuSharma14 / PDF-Reader

Star

A PDF Reader application powered by AI, allowing users to upload PDF documents and extract meaningful information using advanced NLP models. Built with Streamlit, Transformers, and Langchain, this app provides a seamless interface for interacting with and analyzing PDF content.

machine-learning automation transformers text-extraction pdf-reader pdf-extraction streamlit pdf-analysis langchain natural-language-processing-nlp

Updated Apr 24, 2025
Python

MahirSalahin / chat-pdf

Star

A RAG project. Chat PDF

chat-application faiss streamlit pdf-analysis langchain chat-pdf gemini-chat

Updated Aug 30, 2024
Python

bylickilabs / pdfAnalyzer

Sponsor

Star

PDF Analyzer** ist ein effizientes Python-Tool zur automatischen Analyse von PDF-Dokumenten.

python cli open-source metadata pdf text-mining automation reporting document-analysis document-processing file-analyzer pdf-extraction streamlit pdf-analysis file-inspector

Updated Jun 30, 2025
Python

FrancescoRomeo02 / multimodalragApp

Star

Advanced multimodal RAG system for querying PDF documents with text, images, and tables using vector embeddings, semantic chunking, and LLMs via Groq API

nlp machine-learning ai computer-vision chatbot semantic-search multimodal rag groq streamlit pdf-analysis document-intelligence qdrant langchain

Updated Jul 29, 2025
Python

Ouns-AN / pdf-page-counter

Star

Offline web app to count pages in PDF files using PDF.js

pdf counter csv offline simple drag-and-drop pyqt5 vanilla-js client-side pdfjs pdf-tools page-counter pdf-analysis simple-tools

Updated Oct 4, 2025
JavaScript

mkapulica / PDF-Page-Counter

Star

An extremely fast and user-friendly PDF page counter app for multiple PDF files.

python pdf-tools pdf-analysis pdf-page-count

Updated Jun 10, 2024
Python

Rakshath66 / Chat-With-Your-PDF

Star

Streamlit-based chatbot to interact with PDFs using Retrieval-Augmented Generation (RAG), FAISS, Sentence Transformers, and Mistral LLM

Updated Jul 3, 2025
Python

PKHarsimran / IOC-Inspector

Star

Fast, SOC‑ready malicious document scanner that turns suspicious PDFs, DOC(X), XLS(X), and RTFs into IOC‑rich, SIEM‑friendly reports.

python cli ioc static-analysis cybersecurity malware-analysis threat-intelligence abuseipdb virus-total soc-tools pdf-analysis office-macros

Updated Jul 23, 2025
Python

colingalbraith / OpenRAGSearch

Star

Local RAG-powered document analysis platform with PDF QA, Ollama integration, and citation-aware search.

semantic-search offline-app rag fastapi vector-search pdf-analysis document-intelligence langchain chromadb local-llm retrieval-augmented-generation ollama

Updated Jul 26, 2025
JavaScript

fresh-milkshake / page-parser

Star

Intelligent PDF document analysis with AI-powered chart understanding

computer-vision pdf-analysis openai-api

Updated Aug 29, 2025
Jupyter Notebook

IrinaDragunow / multimodal-document-assistant

Star

AI-Powered Document Assistant | Multimodal Processing (PDF + Images) | Enterprise Automation Demo | Proven ROI: 2,670% | Professional ML Portfolio

image-processing business-automation document-processing api-integration pdf-analysis document-automation streamlit-app openai-gpt4 multimodal-ai enterprise-ai-engineering business-roi ml-portfolio

Updated Aug 21, 2025
Python

1reverseengineer / pdfid-for-arch

Star

ArchLinux packaged version of the kali-linux pdf analysis tool pdfid. Original author is DidierStevensSuite! His license applies!

pdf security protection malware-analysis pdf-analyzer pdf-analysis

Updated Apr 28, 2023
Python

rohanag03 / PDF-Insights

Star

This project uses Google's Generative AI to analyze and answer questions about PDF content. It provides a user-friendly interface to upload PDFs and receive insightful answers generated by the Gemini AI model.

python gemini-api google-ai pdf-analysis generative-ai

Updated Jun 16, 2024
Python

marcusmcb / ai-pdf-tutor

Star

Demo AI app that summarizes PDF documents via text & voice

text-to-speech ai data-parsing pdf-analysis

Updated Jul 1, 2025
TypeScript

Improve this page

Add a description, image, and links to the pdf-analysis topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pdf-analysis topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf-analysis

Here are 25 public repositories matching this topic...

tfmorris / pdf2table

SreejaBethu / Smart-Report-Analyzer

michael-eble / pdf-analysis-word-extraction-word-frequencies

jlmayorgaco / r-biblio-synth

lhiebert01 / GenAI_PDF_App

MaliosDark / Pdf-infected-Virus-Scanner-Online

RaghuSharma14 / PDF-Reader

MahirSalahin / chat-pdf

bylickilabs / pdfAnalyzer

FrancescoRomeo02 / multimodalragApp

Ouns-AN / pdf-page-counter

mkapulica / PDF-Page-Counter

Rakshath66 / Chat-With-Your-PDF

PKHarsimran / IOC-Inspector

colingalbraith / OpenRAGSearch

fresh-milkshake / page-parser

IrinaDragunow / multimodal-document-assistant

1reverseengineer / pdfid-for-arch

rohanag03 / PDF-Insights

marcusmcb / ai-pdf-tutor

Improve this page

Add this topic to your repo