Skip to content

Predict the survival status of the passengers aboard the Titanic.

License

Notifications You must be signed in to change notification settings

Chau-Ngoc/titanic-survival-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Welcome to the Titanic Survival Status Classification

In the notebook, we will explore the Titanic dataset collected from Kaggle's Titanic competition. The data includes train.csv and test.csv.

The train.csv is provided with the outcomes (the ground truth) for each passenger. This data is used to train the models.

The test.csv does not include the outcomes, it's the models' job the predict.

Also included is gender_submission.csv, a set of predictions that assume all and only female passengers survive, as an example of what a submission file should look like.

The models used in this notebook are SVC, RandomForestClassifier and LogisticRegression and are evaluated on accuracy score. The best model achieves 82% accuracy.

Important Note: due to the fact that Github does not render plotly's plots, to be able to see the plotly's plots, please visit nbviewer

Releases

No releases published

Packages

No packages published