Skip to content

GussLii/Amazone-movie-score-prediction-for-cs506

Repository files navigation

CS506 Midterm

Starter Code Instructions

  1. Download the train.csv and test.csv files from Kaggle into the data/ folder
  2. Run test_setup.py to make sure you can load the files and print the first few rows of train.csv and test.csv and view their shapes + some visualization
  3. feature_extraction.py will help you to generate features as well as generate X_test.csv which is test.csv but with the features from train.csv and whatever other features you added.
  4. Run predict-knn.py to predict the score using KNN
  5. Run the midterm_for_grade for best prediction

Good luck!