Skip to content
#

pdf-analysis

Here are 25 public repositories matching this topic...

PDF Analysis: Extracting words and their word frequencies from PDF files; Preparation of text data for performing topic analysis on annual reports of German car manufacturers - e.g. Volkswagen, Porsche and Audi. Please note that words are only being extracted, stemming is not being applied. In order to improve this, use nltk.stem.snowball.Snowba…

  • Updated Sep 11, 2019
  • Python

This project focuses on automating the analysis and reporting of bibliometric data, specifically targeting the annual production of academic articles. The primary goal is to understand trends, anomalies, and patterns in bibliometric data through a combination of statistical modeling and exploratory data analysis.

  • Updated Oct 3, 2025
  • R

A PDF Reader application powered by AI, allowing users to upload PDF documents and extract meaningful information using advanced NLP models. Built with Streamlit, Transformers, and Langchain, this app provides a seamless interface for interacting with and analyzing PDF content.

  • Updated Apr 24, 2025
  • Python

Improve this page

Add a description, image, and links to the pdf-analysis topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pdf-analysis topic, visit your repo's landing page and select "manage topics."

Learn more