This is the solution repository of the 2nd place team monkeytyping, licensed under the Apache License 2.0.
Python 2.7
Tensorflow 1.0
Numpy 1.12
GNU Bash
For an understanding of our system, read the report of our solution:
Our source code:
Training scripts (training a model may take 3-5 days) are in
youtube-8m-wangheda/training_scripts
youtube-8m-zhangteng/train_scripts
Eval scripts for selecting best performing checkpoints
youtube-8m-wangheda/eval_scripts
youtube-8m-zhangteng/eval_scripts
Infer scripts for generating intermediate files used by ensemble scripts
youtube-8m-wangheda/infer_scripts
youtube-8m-zhangteng/infer_scripts
Ensemble scripts
youtube-8m-ensemble/ensemble_scripts
There are some conventions that we use in our code:
models are saved in
./model
train1 data is saved in
/Youtube-8M/data/frame/train
/Youtube-8M/data/video/train
validate1 data is saved in
/Youtube-8M/data/frame/validate
/Youtube-8M/data/video/validate
test data is saved in
/Youtube-8M/data/frame/test
/Youtube-8M/data/video/test
train2 data is saved in
/Youtube-8M/data/frame/ensemble_train
/Youtube-8M/data/video/ensemble_train
validate2 data is saved in
/Youtube-8M/data/frame/ensemble_validate
/Youtube-8M/data/video/ensemble_validate
intermediate results are stored in
/Youtube-8M/model_predictions/ensemble_train/[method]
/Youtube-8M/model_predictions/ensemble_validate/[method]
/Youtube-8M/model_predictions/test/[method]
- Train a single model
- evaluate the checkpoints to get the best one
- infer the checkpoint to get intermediate result.
- Write a configuration file
- train a stacking model
- evaluate the stacking model and pick the best checkpoint
- infer the checkpoint to get a submission file
Some of the single models are developed by Heda and some by Teng, so they are distributed in two folders.
Bagging models are in youtube-8m-wangheda/bagging_scripts
.
Boosting and distillation models are in youtube-8m-wangheda/bagging_scripts
.
Cascade models are in youtube-8m-wangheda/cascade_scripts
.
Stacking models are in youtube-8m-ensemble/ensemble_scripts
.