The 15th place solution of Sberbank Contest on Kaggle https://www.kaggle.com/c/sberbank-russian-housing-market Kaggle Discussions thread: https://www.kaggle.com/c/sberbank-russian-housing-market/discussion/35700
Models:
- model_1_price_per_meter.ipynb - the model fitting price_doc/full_sq
- model_2_price.ipynb - the model fitting price_doc
- model_3_price_kaggle.ipynb - my version of the public kernel https://www.kaggle.com/aharless/latest-iteration-in-this-silly-game/code
- mixing_final_models.ipynb - final averaging of the models
Data: I'm really sorry for that but I had to delete the data according to the competition rules. Here is the list of files needed for the models:
- train_sber_corrected.csv - train data after correction from Sberbank https://www.kaggle.com/c/sberbank-russian-housing-market/discussion/34364
- test_sber_corrected.csv - test data after correction from Sberbank https://www.kaggle.com/c/sberbank-russian-housing-market/discussion/34364
- macro.csv - macroeconomic data from Sberbank
- train_corr_clean.csv - train_sber_corrected.csv after cleaning with https://www.kaggle.com/keremt/very-extensive-cleaning-by-sberbank-discussions
- test_corr_clean.csv - test_sber_corrected.csv after cleaning with https://www.kaggle.com/keremt/very-extensive-cleaning-by-sberbank-discussions
And the output of models will be:
- model_1_output.csv - output of model_1_price_per_meter.ipynb
- model_2_output.csv - output of model_2_price.ipynb
- model_3_output.csv - output of model_3_price_kaggle.ipynb
- model_entire.csv - the weighted average, output of mixing_final_models.ipynb