Skip to content

PDF metadata reader. Scan PDF files looking for their metadata.

Notifications You must be signed in to change notification settings

rubenhortas/pdfMetadata

Repository files navigation

pdfMetadata

pdfMetadata is a python script that gets and displays metadata from PDF files.

pdfMetadata can extract data from one PDF file or for all PDF files in a directory (scans recursively).

GitHub repo file count GitHub code size in bytes GitHub repo size

GitHub issues GitHub closed issues GitHub pull requests GitHub closed pull requests GitHub all releases

REQUIREMENTS

  • Python3
  • PyPDF2
  • colorama

INSTALLATION

Create a virtual environment (or venv)

python3 -m venv pmvenv

You only have to do this the first time

Activate the venv

  • Linux/macOS
source pmvenv/bin/activate
  • Windows
pmvenv\Scripts\activate

You have to activate the venv every time you want to use it

Install the requirements

pip install -r requirements.txt

You only have to do this the first time

USAGE

python3 pdfmetadata options
usage: pdfMetadata [-h] [-t [log_file.txt]] [-c [log_file.csv]] [-a] ARGUMENTS [ARGUMENTS ...]

Scan pdf files looking for their metadata.

positional arguments:
  ARGUMENTS             file[s] or path[s] to scan pdf files

options:
  -h, --help            show this help message and exit
  -t [log_file.txt], --txt [log_file.txt]
                        Saves the output into a plain text file.
  -c [log_file.csv], --csv [log_file.csv]
                        Saves the output into a csv file.
  -a, --show-all        Shows scanned non-PDF files.

Troubleshooting

In case of any problem create an issue

Discussions

If you want ask (or answer) a question, leave an opinion or have an open-ended conversation you can create (or join) a discussion

Support

If you find this application useful you can star this repo.