Gradient Boosting on the Lending Club dataset

In this Mini Project we will explore the use of pre-processing methods and Gradient Boosting on the popular Lending Club dataset. We are provided with two files: loan train.csv and loan test.csv.

We have be to pre-process the data appropriately, and then apply gradient boosting to classify whether a customer should be given a loan or not.

The target attribute is in the column loan status, which has values “Fully Paid” for which we can assign +1 to, and “Charged off” for which we can assign -1 to. The other records with loan status values “Current” (in both train and test) are not relevant to this problem.

We will be mainly working on the following:

(a) Pre-process the data as needed to apply the classifier to the training data.

(b) Apply gradient boosting using the function sklearn.ensemble.GradientBoostingClassifier
for training the model.

Lending Club Dataset

Training Data : loan_train.csv

Testing Data : loan_test.csv

Analysis

Proper Analysis can be found in the Report

Authors

@AyanPahari

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Gradient Boosting.ipynb		Gradient Boosting.ipynb
README.md		README.md
Report.pdf		Report.pdf
loan_test.csv		loan_test.csv
loan_train.csv		loan_train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gradient Boosting on the Lending Club dataset

Lending Club Dataset

Analysis

Authors

About

Releases

Packages

Languages

AyanPahari/Gradient-Boosting-on-the-Lending-Club-dataset.

Folders and files

Latest commit

History

Repository files navigation

Gradient Boosting on the Lending Club dataset

Lending Club Dataset

Analysis

Authors

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages