Welcome to the Titanic Survival Status Classification

In the notebook, we will explore the Titanic dataset collected from Kaggle's Titanic competition. The data includes train.csv and test.csv.

The train.csv is provided with the outcomes (the ground truth) for each passenger. This data is used to train the models.

The test.csv does not include the outcomes, it's the models' job the predict.

Also included is gender_submission.csv, a set of predictions that assume all and only female passengers survive, as an example of what a submission file should look like.

The models used in this notebook are SVC, RandomForestClassifier and LogisticRegression and are evaluated on accuracy score. The best model achieves 82% accuracy.

Important Note: due to the fact that Github does not render plotly's plots, to be able to see the plotly's plots, please visit nbviewer

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
titanic_classification.ipynb		titanic_classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Welcome to the Titanic Survival Status Classification

About

Releases

Packages

Languages

License

Chau-Ngoc/titanic-survival-classification

Folders and files

Latest commit

History

Repository files navigation

Welcome to the Titanic Survival Status Classification

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages