left to right: Yingxi Zhao, Wei Lu, Tian Su, Fangjing Xu, Xiongxing Li, Xiaoxuan Guo, Tengran Liu, Xuemin Zhang
This repo host the materials for the joblogic-x data science class 2017
Data folder hosts practice datasets.
Class01-03 are the preparation classes. They contain the basic knowledge of the github, python and python packages such as pandas.
Bootcamp_day1-day4 are the four day data science bootcamp. Contents include data wrangling, modeling, and object oriented programming using python.
Data & dictionary
split data
missing imputation
basice feature selection
dummification
data exploration
fit baseline model use default hyper-parameters
initial model evaluation (validation set)
Co-optimize:
- hyper-parameter tuning
- feature selection
Visualization of optimization