Recommender-System

A content based movie recommendation system using movielens dataset

Dataset Description :

Files movie.csv, rating.csv and tag.csv are utilized to build the system.

TfidfVectorizer is used for feature extraction followed by dimensionality reduction.

Tfidf : Thses are the word frequency scores that highlight the more interesting words.

The number of input variables for a data set is referred as dimensionality.

Large number of input features can cause poor performance of machine learning algorithms.

So, dimensionality reduction techniques are applied to enchance the performs of algorithms.