MHTCET Cutoff PDF to Excel Converter

Overview

This Python project scrapes raw PDF data containing MHT CET college and branch cutoffs, extracts the relevant information, and creates a JSON file. Additionally, it generates a "skipped" folder with pageNo.txt files for lines that couldn't be understood and are excluded from the JSON data. The final output is an Excel file (output.xlsx) containing organized cutoff data.

Usage

Run main.py.
Provide the path to the MHT CET cutoff PDF file.
This will create data.json file
Next, run DataMigrater.py to create the final(output.xlsx) Excel file.

Requirements

Linux (Debian-based distributions)

sudo apt-get update
sudo apt-get install python3-pip
pip3 install pypdf openpyxl

Windows

Install Python 3.x from the official website: Python Downloads.
Open a command prompt (cmd) or PowerShell.
Run the following commands:
```
pip install pypdf openpyxl
```

macOS

Install Python 3.x (if not already installed) using Homebrew or the official website.
Open Terminal.
Run the following commands:
```
pip3 install pypdf openpyxl
```

Feel free to contribute or report issues on GitHub!

Sample Data and Output

The out folder in this repository contains the following files:

Sample PDF (2023 CET CAP Round 1 Cut-off): You can find the raw PDF file containing MHT CET college and branch cutoffs for the 2023 CAP Round 1. This is the input file that the Python program processes.
Final Output (output.xlsx): After running the main.py script and executing the data extraction process, the program generates an Excel file named output.xlsx. This file contains organized and structured cutoff data for colleges and branches.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
out		out
DataMigrater.py		DataMigrater.py
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MHTCET Cutoff PDF to Excel Converter

Overview

Usage

Requirements

Linux (Debian-based distributions)

Windows

macOS

Sample Data and Output

Create personilised college list under 7 minutes using MHTCET-cutoff-pdf-to-excel and Excel: https://bit.ly/akhi-spedRunMHTCETgd

About

Releases

Packages

Languages

License

TheTechTiger/MHTCET-cutoff-pdf-to-excel

Folders and files

Latest commit

History

Repository files navigation

MHTCET Cutoff PDF to Excel Converter

Overview

Usage

Requirements

Linux (Debian-based distributions)

Windows

macOS

Sample Data and Output

Create personilised college list under 7 minutes using MHTCET-cutoff-pdf-to-excel and Excel: https://bit.ly/akhi-spedRunMHTCETgd

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages