Skip to content

This project understands how the student's performance (test scores) is affected by other variables such as Gender, Ethnicity, Parental level of education, Lunch and Test preparation course.

Notifications You must be signed in to change notification settings

Revanth-Reddy-Pingala/End_to_End_ML_Project_Student_Performance_Prediction

Repository files navigation

End_to_End_ML_Project_Student_Performance_Prediction

Problem statement

  • This project understands how the student's performance (test scores) is affected by other variables such as Gender, Ethnicity, Parental level of education, Lunch and Test preparation course.

Steps followed for this Machine Learning Project

  • Understanding the Problem Statement
  • Dataset
  • Data Checks to perform
  • Exploratory Data Analysis
  • Data Pre-Processing
  • Model Training
  • Selecting the best model

Sequence of steps followed until Deployment

  • Github and Code Set Up
  • Project Structure, Logging And Exception Handling
  • Project Problem Statement, EDA And Model Training
  • Data Ingestion
  • Data Transformation using Pipelines
  • Model Training and Model Evaluating Component
  • Model Hyper Parameter Tuning
  • Created Prediction Pipeline using Flask Web App

Dataset

Dataset information

  • gender : sex of students -> (Male/female)
  • race/ethnicity : ethnicity of students -> (Group A, B,C, D,E)
  • parental level of education : parents' final education ->(bachelor's degree, some college, master's degree, associate's degree, high school)
  • lunch : having lunch before test (standard or free/reduced)
  • test preparation course : complete or not complete before test
  • math score
  • reading score
  • writing score

Data Checks Performed

  • Missing values
  • Duplicates
  • data type
  • the number of unique values of each column
  • statistics of data set
  • various categories present in the different categorical column

Exploratory Data Analysis (EDA)

Final Conclusions from EDA

  • Student's Performance is related with lunch, race, parental level education
  • Females lead in pass percentage and also are top-scorers
  • Student's Performance is not much related with test preparation course
  • Finishing preparation course is benefitial.

Models used for Training

  • Linear Regression
  • Lasso
  • Ridge
  • K-Neighbours Regressor
  • Decision Tree
  • Random Forest Regressor
  • XGB Regressor
  • CatBoosting Regressor
  • AdaBoost Regressor

Run Locally in your computer

Clone the project

  git clone https://github.com/Revanth-Reddy-Pingala/End_to_End_ML_Project_Student_Performance_Prediction

Go to the project directory

  cd my-project

After setting up environment and installing packages Run

  python app.py

Screenshots

Screenshot 2024-01-29 at 1 59 49 PM Screenshot 2024-01-29 at 2 00 22 PM

Links

linkedin Blogger Instagram

About

This project understands how the student's performance (test scores) is affected by other variables such as Gender, Ethnicity, Parental level of education, Lunch and Test preparation course.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published