A simple document cluster using single value decomposition on a corpus of CNN-stories.
cleaning.py: Processes the directory of cnn-stories and produces a useful json file
model.py: Main program which does the clustering
#TODO Make a blog post explaining about the same