Phishing-Classifier

Introduction

The Internet has become an indispensable part of our life, However, It also has provided opportunities to anonymously perform malicious activities like Phishing. Phishers try to deceive their victims by social engineering or creating mockup websites to steal information such as account ID, username, password from individuals and organizations. Although many methods have been proposed to detect phishing websites, Phishers have evolved their methods to escape from these detection methods. One of the most successful methods for detecting these malicious activities is Machine Learning. This is because most Phishing attacks have some common characteristics which can be identified by machine learning methods.

Tech Stack Used

Python
Flask
Machine learning algorithms
Pandas
Scikit-Learn

Installation

The Code is written in Python 3.7.6. If you don't have Python installed you can find it here. If you are using a lower version of Python you can upgrade using the pip package, ensuring you have the latest version of pip. To install the required packages and libraries, run this command in the project directory after cloning the repository:

Project Archietecture

Training Successful Screenshots

Prediction Successful Screenshots

Step 1: Clone the repository

git clone https://github.com/jatin-12-2002/phishing-classifier.git

Step 2- Create a conda environment after opening the repository

conda create -p env567 python=3.7.6 -y

conda activate env567/

Step 3 - Install the requirements

pip install -r requirements.txt

Step 4 - Run the application server

python main.py

Step 5. Train application

http://localhost:5000/train

Step 6. Prediction application

http://localhost:5000/predict

Result

Accuracy of various model used for URL detection

Feature importance for Phishing URL Detection

Conclusion

The final take away form this project is to explore various machine learning models, perform Exploratory Data Analysis on phishing dataset and understanding their features.
Creating this notebook helped me to learn a lot about the features affecting the models to detect whether URL is safe or not, also I came to know how to tuned model and how they affect the model performance.
The final conclusion on the Phishing dataset is that the some feature like "SSLfinal_State", "URL_of_Anchor", "web_traffic" have more importance to classify URL is phishing URL or not.
XGBoost Classifier currectly classify URL upto 97.1% respective classes and hence reduces the chance of malicious attachments.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
DataTransform_Training		DataTransform_Training
DataTransformation_Prediction		DataTransformation_Prediction
DataTypeValidation_Insertion_Prediction		DataTypeValidation_Insertion_Prediction
DataTypeValidation_Insertion_Training		DataTypeValidation_Insertion_Training
Prediction_Batch_files		Prediction_Batch_files
Prediction_Raw_Data_Validation		Prediction_Raw_Data_Validation
Training_Batch_Files		Training_Batch_Files
Training_Raw_data_validation		Training_Raw_data_validation
application_exception		application_exception
application_logging		application_logging
best_model_finder		best_model_finder
data_ingestion		data_ingestion
data_preprocessing		data_preprocessing
file_operations		file_operations
notebooks		notebooks
screenshots		screenshots
templates		templates
.gitignore		.gitignore
README.md		README.md
flask_monitoringdashboard.db		flask_monitoringdashboard.db
main.py		main.py
phising.csv		phising.csv
predictFromModel.py		predictFromModel.py
prediction_Validation_Insertion.py		prediction_Validation_Insertion.py
requirements.txt		requirements.txt
schema_prediction.json		schema_prediction.json
schema_training.json		schema_training.json
setup.py		setup.py
trainingModel.py		trainingModel.py
training_Validation_Insertion.py		training_Validation_Insertion.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Phishing-Classifier

Introduction

Tech Stack Used

Installation

Project Archietecture

Training Successful Screenshots

Prediction Successful Screenshots

Step 1: Clone the repository

Step 2- Create a conda environment after opening the repository

Step 3 - Install the requirements

Step 4 - Run the application server

Step 5. Train application

Step 6. Prediction application

Result

Conclusion

About

Releases

Packages

Languages

jatin-12-2002/phishing-classifier

Folders and files

Latest commit

History

Repository files navigation

Phishing-Classifier

Introduction

Tech Stack Used

Installation

Project Archietecture

Training Successful Screenshots

Prediction Successful Screenshots

Step 1: Clone the repository

Step 2- Create a conda environment after opening the repository

Step 3 - Install the requirements

Step 4 - Run the application server

Step 5. Train application

Step 6. Prediction application

Result

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages