Element Coffee Latte Art, Image Source:
Element Coffee requested an analysis on the reviews for their brick and mortar coffee shop in Northfields London. Data was extracted using Google Cloud and the Google My Business API. NLP was used to classify the sentiment of reviews as either 'NEGATIVE' or 'POSITIVE'. with most (91.4%) reviews scoring a 5 Star Rating. A Word Cloud of the reviews was created to display the most important words and concepts that occur in the reviews.
- Importing modules and libraries
- Accessing Data with Google Auth 2.0 Playground
- Save and Import Data to and from .csv file
- Data Cleaning
- Exploratory Data Analysis 1
- NLP Sentiment-Analysis with Transformers Library
6.1 Pre-Processing for NLP - Exploratory Data Analysis 2
- WordCloud Image Generation
8.1 WordCloud for All Reviews
- transformers_sentiment_analysis.ipynb: Jupyter Notebook file for review analysis
- recursive_review_data.csv: Data file containing review data from Element Coffee
- img: folder containing images generated by WordCloud
- Primary ETL was performed with the Google Cloud And Google My Business API.
- Once obtained Data was cleaned and pre-processed for NLP semantic analysis using DistilBert.
- Each review was assigned a positive or negative label and a score.
- Word Clouds were generated for all reviews using WordCloud library.
-
Over 90% of Element Coffee's 104 reviews had a 5 star rating.
-
The model was able to label 74/78 reviews as 'POSITIVE with an average positive score of 0.999.
-
Two reviews were correctly labeled negative out of a total of four negative reviews. Two falsely labeled 'NEGATIVE' reviews were ambiguous and difficult for the model to interpret.
-
WordCloud suggests that 'lovely', 'friendly' 'staff' 'warm' 'atmosphere' are top words that occur in the reviews.
Wordcloud generation of all reviews data
The Element Coffee business is doing well with their ratings. Particularly the atmosphere and friendlieness of the staff is highly revered. Customers use words that suggest a high degree of certainty about their review of the coffee shop.
Using these Data to inform the online store is paramount in the coming times of lockdown and COVID19. It is not certain that the brick and mortar store will remain open therefore feeding these results back into the online presence and website is the best way to proceed.