A Customer Segmentation Application using Kmeans Clustering built with Dash
In this project, I worked with data from the Survey of Consumer Finances (SCF). The SCF is a survey sponsored by the US Federal Reserve. It tracks financial, demographic, and opinion information about families in the United States. The survey is conducted every three years, and we'll work with an extract of the results from 2019.
This Dash App utilizes a Machine Learning model built with KMeans to segment customers based on various features
The App can be can be viewed here
From the US Federal Reserve website:
The Survey of Consumer Finances (SCF) is normally a triennial cross-sectional survey of U.S. families. The survey data include information on families’ balance sheets, pensions, income, and demographic characteristics. Information is also included from related surveys of pension providers and the earlier such surveys conducted by the Federal Reserve Board. No other study for the country collects comparable information. Data from the SCF are widely used, from analysis at the Federal Reserve and other branches of government to scholarly work at the major economic research centers.
- Compare characteristics across subgroups using a side-by-side bar chart.
- Build a k-means clustering model.
- Conduct feature selection for clustering based on variance.
- Reduce high-dimensional data using principal component analysis (PCA).
- Design, build and deploy a Dash web application.
Assuming you have git and heroku cli installed just carry out the following steps:
- Clone the repository
git clone https://github.com/Nneji123/Customer-Segmentation-Dash-Application.git
- Change the working directory
cd Customer-Segmentation-Dash-Application
- Login to Heroku
heroku login
heroku container:login
- Create your application
heroku create your-app-name
Replace your-app-name with the name of your choosing.
- Build the image and push to Container Registry:
heroku container:push web
- Then release the image to your app:
heroku container:release web