Skip to content

Intenzo21/sentiment-analysis-app

Repository files navigation

For the purpose of the project, we considered a real-life balanced dataset consisting of 50,000 labeled gourmet food reviews from Amazon extracted from the work of McAuley and Leskovec. A food review is labeled as 1 (positive) if it received four or more stars and 0 (negative) otherwise.

We developed, trained, and tuned (Randomized Search) a number of Machine Learning (ML) models such as Logistic Regression (LR), Support Vector Machine (SVM), and Long Short-Term Memory (LSTM). These models were then evaluated using a holdout test set previously created from the whole available data. For the evaluation, we considered various metrics such as accuracy, precision, recall, ROC AUC score, etc. Finally, we compared the models to determine the best-performing one (LR) that was then deployed using Flask and Render.