ds_school

test assignments for the data science school - Roonyx

Assignment 1.

Choose data set from Kaggle repo. Data set shouldn't be analysed before in the available tutorials.
Explore main data set features and target labels. (+ 0.5 score per unique method. Min score 2, Max score 4)
Feature engineering for the new parameters (+ 1 score per unique feature. Min score 2, Max score 4)
Choose and display statistics for the observations. In this step you need to create Statistical hypothesis and test them. Hypothesis should be meaningful and display some patterns in the data set (+ 3 score per hypothesis. Min score 6, Max score 12)
Visualise explored features and hypothesis (+ 1 score per plot. Min score 6, Max score 10)

Useful resources:

Assignment 2.

With selected data set:

Calculate entropy for full data set and for 2 selected groups. What is the information gain for such split? (+2 score)
Calculate Gini index for the same groups and compare results (+2 score)
Train a decision tree (DecisionTreeClassifier, random_state = 17) (+2 score)
Find the optimal maximum depth using 5-fold cross-validation (GridSearchCV) (+2 score)
Display final tree as an image (+2 score)

Useful resources:

Assignment 3.

With selected data set (if applicable, or change from school repo):

Create and train BaggingClassifier (+2 score)
Create and train RandomForestClassifier (+2 score)
Create and train Linear classifier (+2 score)
Create and train k Nearest Neighbors classifier (+2 score)
Compare models accuracy
Create an ensemble of models and estimate classification accuracy
Display different accuracy metrics for model (+ 1 score per metric. Min score 2, Max score 4)

Assignment 4.

With selected data set (if applicable, or change from school repo):

Create and train BaggingRegressor (+2 score)
Create and train RandomForestRegressor (+2 score)
Create and train Logistic Regression model (+2 score)
Create and train k Nearest Neighbors Regression (+2 score)
Compare models accuracy
Create an ensemble of models and estimate classification accuracy
Display different accuracy metrics for model (+ 1 score per metric. Min score 2, Max score 4)

Useful resources:

Assignment 5.

With selected data set (if applicable, or change from school repo):

Create and train AdaBoostClassifier (+2 score)
Create and train XGBoostClassifier (+2 score)
Create and train LightGBM Classifier (+2 score)
Create and train CatBoostClassifier (+2 score)
Compare accuracy for models (+2 score)

Final Assignments

Marketing data (one of the data topic per group):

Upwork analysis
Facebook CTF analysis

Face recognition task

Emotion recognition
Age recognition and gender recognition
Pose estimation and motion extraction

Sequence models

voice timbre detection
voice script recognition
??

Kaggle competition. Join one of the open competitions and create a kernel.

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
1 Exploratory Data Analysis		1 Exploratory Data Analysis
2 DecisionTree		2 DecisionTree
3 Classification models		3 Classification models
4 Regression models		4 Regression models
5 Boosting		5 Boosting
NN		NN
Neural_networks		Neural_networks
.gitignore		.gitignore
Neural_Net.py		Neural_Net.py
README.md		README.md
hfi_cc_2018.csv		hfi_cc_2018.csv
lecture3-bagging_randomforest.ipynb		lecture3-bagging_randomforest.ipynb
lecture4-Linear-Regression.ipynb		lecture4-Linear-Regression.ipynb
lecture5-K-Means.ipynb		lecture5-K-Means.ipynb
lecture5-Principal-Component-Analysis.ipynb		lecture5-Principal-Component-Analysis.ipynb
m.csv		m.csv
m1.csv		m1.csv
m2.csv		m2.csv
master.csv		master.csv
test.csv		test.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ds_school

Assignment 1.

Assignment 2.

Assignment 3.

Assignment 4.

Assignment 5.

Final Assignments

About

Releases

Packages

Contributors 9

Languages

alinapetukhova/ds_school

Folders and files

Latest commit

History

Repository files navigation

ds_school

Assignment 1.

Assignment 2.

Assignment 3.

Assignment 4.

Assignment 5.

Final Assignments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Languages

Packages