GitHub - ItsGreyedOut/Predicting-Hospial-Readmission-for-Diabetes

About this Project:

Our team was interested in predicting hospital readmissions for diabetic patients. We focused on features that impact readmission within a 30 day period, based on the patient's state after being discharged from the hospital.

Business Implications:

The cost of hospital readmission accounts for a large portion of hospital inpatient services spending. Diabetes is not only one of the top ten leading causes of death in the world, but also the most expensive chronic disease in the United States. Hospitalized patients with diabetes are at higher risk of readmission than those without diabetes. American hospitals spend over $41 billion on diabetic patients who are readmitted within 30 days of discharge. Being able to determine factors that lead to higher readmission in such patients, and predicting which patients will get readmitted can help hospitals save millions of dollars while improving quality of care. Therefore, reducing readmission rates for diabetic patients has great potential to reduce medical cost.

Technologies:

Python/Pandas/Sklearn
Keras
Google Colab
SQLite
SQLAlchemy
Flask
D3
HTML/CSS/Bootstrap
Heroku

Approach:

Identify data sources and dependencies
Perform EDA, determine feature set and transform diabetes data
Compile, train and evaluate the model
Compare models for optimization of accuracy metric
Serialize and deserialize model using Keras and SQLlite
Create Flask App and connect routes to model
Create interactive web app using Javascript D3, html and css
Visualize dashboard in Heroku

Data Source

https://www.kaggle.com/iabhishekofficial/prediction-on-hospital-readmission/data Our dataset has 102k rows of data and 49 features.

Architectural Diagram

Preprocessing the Data

Reduced the data set to include only the intersted features we will use for prerdiction (race, age, gender, weight, time_in_hospital, max_glu_serum, insulin, diabetesMed)
Check for and Dropped invalid values (?)
Converted readmitted column to binary field
Determine the number of unique values in each column. Dropped colum (max_glu_serum) since there was only 1 unique value
Convert categorical data to numberic with 'pdget_dummies' - one hot encoding
Split the preprocessed data into a training and testing dataset
Create a StandardScaler instances
Fit the StandardScale
Scale the data

Compile, Train and Evaluate the Model

Define the model
Compile the model
Train the model
Evaluate the model using test data
Export our model to HDF5 file

Using the Model

Website Design

Limitations, Assumptions & Challenges

Limited time of project

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Resources		Resources
models		models
static/images		static/images
templates		templates
.gitignore		.gitignore
Cleaned_Data.ipynb		Cleaned_Data.ipynb
Procfile		Procfile
README.md		README.md
app.py		app.py
diabetes_model.ipynb		diabetes_model.ipynb
diabetes_model_v.2.ipynb		diabetes_model_v.2.ipynb
logreg_model.py		logreg_model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About this Project:

Business Implications:

Technologies:

Approach:

Data Source

Architectural Diagram

Preprocessing the Data

Compile, Train and Evaluate the Model

Using the Model

Website Design

Limitations, Assumptions & Challenges

About

Releases

Packages

Languages

ItsGreyedOut/Predicting-Hospial-Readmission-for-Diabetes

Folders and files

Latest commit

History

Repository files navigation

About this Project:

Business Implications:

Technologies:

Approach:

Data Source

Architectural Diagram

Preprocessing the Data

Compile, Train and Evaluate the Model

Using the Model

Website Design

Limitations, Assumptions & Challenges

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages