EDA and Modelling of Retracted Papers

Project Overview

This project performs exploratory data analysis (EDA) and predictive modeling on retracted papers. The aim is to understand the characteristics of retracted papers and to develop models to predict retractions.

Directory Structure

your-github-repo/
│
├── README.md
├── requirements.txt
│
├── data/
│   ├── raw/
│   └── processed/
│
├── src/
│   ├── eda/
│   │   └── EDA_retraction.py
│   ├── modeling/
│   │   ├── Preparation_modelling.py
│   │   ├── predictive_modelling_approach_2.py
│   │   ├── predictive_modeling_approach_3.py
│   │   └── confusion_matrix_random_forest.py
│
├── results/
│   ├── figures/
│   └── reports/
│
└── scripts/
    ├── run_modeling.py
    └── run_all.py

Setup Instructions

Prerequisites

Python 3.6 or higher
Git (optional, for cloning the repository)

Steps to Run the Project

Clone the Repository:

git clone https://github.com/your-username/your-repo.git
cd your-repo

Create a Virtual Environment (Recommended):
Install Dependencies:
```
pip install -r requirements.txt
```
Run the python file: For example
```
python scripts/run_all.py
```

Explanation of Key Scripts

EDA_retraction.py

This script performs exploratory data analysis on the retracted papers dataset. It generates visualizations and descriptive statistics to understand the characteristics of the data. Before this data cleaning has been done.

Preparation_modelling.py

This script prepares the data for modeling by preprocessing and transforming the dataset. It ensures the data is in the correct format for the predictive models.

predictive_modelling_approach_1.py

check https://github.com/bibekdhakal/research-retraction

predictive_modelling_approach_2.py

This script implements the second approach for predictive modeling. It trains and evaluates a machine learning model to predict retractions.

predictive_modeling_approach_3.py

This script implements the third approach for predictive modeling. It trains and evaluates another machine learning model to predict retractions. Applying clustering techniques (e.g., K-Means) to group similar data points, thereby capturing underlying patterns in the data.

confusion_matrix_random_forest.py

This script generates a confusion matrix for the Random Forest model. It evaluates the performance of the model and visualizes the results.

run_all.py

This script orchestrates the execution of all the key scripts in the correct order. It ensures that the entire workflow from data preparation to model evaluation is completed.

Results

The results of the analysis and modeling are saved in the results directory. This includes figures. Based on this a report has been made using Texmaker.

Contact

For any questions or issues, please contact at maheshtwari99@gmail.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EDA and Modelling of Retracted Papers

Project Overview

Directory Structure

Setup Instructions

Prerequisites

Steps to Run the Project

Explanation of Key Scripts

EDA_retraction.py

Preparation_modelling.py

predictive_modelling_approach_1.py

predictive_modelling_approach_2.py

predictive_modeling_approach_3.py

confusion_matrix_random_forest.py

run_all.py

Results

Contact

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
data		data
results		results
scripts		scripts
src		src
.DS_Store		.DS_Store
.gitattributes		.gitattributes
README.md		README.md
requirements.txt		requirements.txt

mahesh989/EDA-and-Modelling-of-Retracted-Papers

Folders and files

Latest commit

History

Repository files navigation

EDA and Modelling of Retracted Papers

Project Overview

Directory Structure

Setup Instructions

Prerequisites

Steps to Run the Project

Explanation of Key Scripts

EDA_retraction.py

Preparation_modelling.py

predictive_modelling_approach_1.py

predictive_modelling_approach_2.py

predictive_modeling_approach_3.py

confusion_matrix_random_forest.py

run_all.py

Results

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages