Restaurant ratings in Mexico by real consumers including additional information about each restaurant and their cuisines, and each consumer and their preferences.
- What can you learn from the highest-rated restaurants? Do consumer preferences have an effect on ratings?
- What are the consumer demographics? Does this indicate a bias in the data sample?
- Are there any demand & supply gaps that you can exploit in the market?
- If you were to invest in a restaurant, which characteristics would you be looking for?
Our data set consists of the following observations which include:
- Consumer_ID - Unique identifier for each consumer
- City - City where the consumer lives
- State - State where the consumer lives
- Country - Country where the consumer lives
- Latitude - Latitude where the consumer lives
- Longitude - Longitude where the consumer lives
- Smoker - Whether the consumer smokes or not
- Drink_Level - Whether the consumer is an abstemious, casual, or social drinker
- Transportation_Method - Whether the consumer transports on foot, by public transport, or by car
- Marital_Status - The consumer's marital status (single or married)
- Children - Whether the consumer has dependent/independent children or kids
- Age - The consumer's age
- Occupation - The consumer's occupation (student, employed, or unemployed)
- Budget - The consumer's budget (low, medium, high)
- Consumer_ID - Unique identifier for each consumer
- Preferred_Cuisine - Types of food the consumer prefers
- Consumer_ID - Unique identifier for each consumer
- Restaurant_ID - Unique identifier for each restaurant
- Overall_Rating - The overall rating by the consumer for the restaurant (0=Unsatisfactory, 1=Satisfactory, 2=Highly Satisfactory)
- Food_Rating - The food's rating by the consumer for the restaurant (0=Unsatisfactory, 1=Satisfactory, 2=Highly Satisfactory)
- Service_Rating - The service rating by the consumer for the restaurant (0=Unsatisfactory, 1=Satisfactory, 2=Highly Satisfactory)
- Restaurant_ID - Unique identifier for each restaurant
- Name - The restaurant's name
- City - The restaurant's city
- State - The restaurant's state
- Country - The restaurant's country
- Zip_Code - The restaurant's zip code
- Latitude - The restaurant's latitude
- Longitude - The restaurant's longitude
- Alcohol_Service - Whether the restaurant serves no alcohol, wine & beer, or a full bar
- Smoking_Allowed - Whether any smoking is allowed, including in the bar or in smoking sections
- Price - The restaurant's price (low, medium, high)
- Franchise - Whether the restaurant is a franchise
- Area - Whether the restaurant is in an open or closed area
- Parking - Whether the restaurant offers any sort of parking (none, yes, public, valet)
- Restaurant_ID - Unique identifier for each restaurant
- Cuisine - Types of food the restaurant serves
- Get data -> More -> All -> Folder -> Connect -> Path leading to the folder dataset -> Click ok
- Click on transform data -> Duplicate the file -> Click on Binary to expand the dataset (Repeat the set for the no of datasets)
- Remove blanks
- make the first row a header
AgeGroup =
SWITCH(
TRUE(),
consumers[Age] <= 18, "Children and Adolescents",
consumers[Age] <= 30, "Young Adults",
consumers[Age] <= 45, "Adults",
consumers[Age] <= 60, "Middle-aged Adults",
"Seniors"
)
Service_Rating_Category = SWITCH(
TRUE(),
ratings[Service_Rating] = 0, "Unsatisfactory",
ratings[Service_Rating] = 1, "Satisfactory",
"Highly Satisfactory"
)
Overall_Rating_Category = SWITCH(
TRUE(),
ratings[Overall_Rating] = 0, "Unsatisfactory",
ratings[Overall_Rating] = 1, "Satisfactory",
"Highly Satisfactory"
)
Food_Rating_Category = SWITCH(
TRUE(),
ratings[Food_Rating] = 0, "Unsatisfactory",
ratings[Food_Rating] = 1, "Satisfactory",
"Highly Satisfactory"
)
- What is the distribution of consumers by city and state?
Most of the population is from San Luis Potosí, San Luis Potosí, while the second largest group is from Cuernavaca, Morelos.
- How does the age distribution of consumers vary by state?
In all three states, young adults under 30 years of age form the majority of the population. In two states, San Luis Potosí and Morelos, the second largest demographic consists of seniors, aged over 60 years.
- What percentage of consumers are smokers or non-smokers in each city?
The vast majority of consumers from all four cities are non-smokers, with Jiutepec having a 100% non-smoking population. In Cuernavaca city, smokers make up 25% of the population.
- How common is parking availability at restaurants in different cities?
The majority of restaurants across all cities lack parking facilities, while some have parking available. In San Luis Potosí and Cuernavaca, two restaurants offer valet parking, while public parking is available in San Luis Potosí, Ciudad Victoria, and Cuernavaca.
- How does the availability of parking correlate with restaurant price levels across different cities or states?
- What is the distribution of restaurants by city, state, or country?
- How do restaurant franchises compare to non-franchises in terms of consumer ratings
- What are consumers' preferred cuisines based on their demographic profiles?
- How does the type of alcohol service offered vary by region?
- What transportation methods are most commonly used by consumers in different regions?
- How does the presence of alcohol service influence consumer ratings in different regions?
- What percentage of restaurants allow smoking in each city or state?
- What are the common occupations of consumers in different regions?
- How does the drink level (abstemious, casual, social) vary across different cities or states?
- How does marital status correlate with smoking or drinking habits?
- Is there a relationship between consumers' occupations and their budget levels?
- What are the top 5 restaurants by food rating?
- What are the top 5 restaurants by service rating?
- What are the top 5 restaurants by overall rating?