Skip to content

This repository provides code for a machine learning model that predicts the likelihood of stroke occurrence based on various risk factors.

Notifications You must be signed in to change notification settings

sarax0/brain-stroke-prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Brain Stroke Prediction

This project implements machine learning models to predict the likelihood of stroke occurrence based on various risk factors. It explores techniques for data exploration, pre-processing, model development, and hyperparameter tuning.

image

Exploratory Data Analysis (EDA)

Implements binning for numerical variables. Performs feature encoding for categorical data. Visualizes data distributions using kernel density estimation (KDE). Applies data transformations like MinMaxScaler and log scaling for improved modeling.

Data Preprocessing ( Data Preparation)

Addresses class imbalance issues to optimize model performance.

Model Development ( Machine Learning Models)

Implements and compares various classification models including:

  • Logistic Regression
  • Random Forest Classifiers
  • Decision Tree Classifiers
  • KNeighbors Classifiers
  • XGBoost Classifier
  • Gaussian Naive Bayes Utilizes GridSearchCV for hyperparameter tuning to optimize model performance.

Ensemble Learning ( Combining Models for Strength)

Implements Voting Classifier to leverage the strengths of multiple models for a more robust prediction.

Models' Performance Comparison

--- RUS ROS SMOT Original
Logistic Regression 75% 74% 74% -
KNeighbors Classifiers 66% 84% 78% -
XGBoost Classifier 75% 91% 92% -
ANN 72% - 88% 93.9%

About

This repository provides code for a machine learning model that predicts the likelihood of stroke occurrence based on various risk factors.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •