Is the difference between ratings across different platforms for Guardians of Galaxy the norm? And are the differences in ratings systematic, or are the ratings largely unrelated?
To shed light on the question, we scraped the list of American films on Wikipedia produced between 1950 and 2020. (See here, for example.) We then scraped the Google Knowledge Panel using APIfy for each of the movies. (The scripts for scraping the data are posted here.)
In all, we have 16,319 movies and movie ratings from 19 platforms. For seven platforms, however, we have less than 100 movie reviews. We subset our initial analysis on 48,613 (average) ratings from the 12 platforms for which we have ratings for 100 or more movies.
We estimate the correlation between the average platform ratings. (See below for the (Pearson's) correlation matrix.) The median correlation between average ratings of two platforms was .37. (The median Spearman's correlation was .35.)
Movies with the largest difference between Rotten Tomatoes and IMDb Rating.
title | rotten_tomatoes_rating | IMDb_rating |
---|---|---|
After Last Season | 92 | 1.6 |
The Catered Affair | 17 | 7.5 |
Sparkle | 10 | 6.9 |
Only the Strong | 8 | 6.7 |
The Life of David Gale | 19 | 7.6 |
The Vanishing of Sidney Hall | 11 | 6.9 |
I'll Never Forget You | 14 | 7.1 |
The Professor | 10 | 6.7 |
Gang of Roses | 75 | 2.4 |
Movies with the largest difference between Rotten Tomatoes and Google Rating.
title | rotten_tomatoes_rating | p_google_likes |
---|---|---|
After Last Season | 92 | 6 |
Faithful | 7 | 95 |
Supercross | 5 | 92 |
Material Girls | 4 | 91 |
Only the Strong | 8 | 94 |
Big Mommas: Like Father, Like Son | 5 | 91 |
The Covenant | 4 | 90 |
McHale's Navy | 3 | 89 |
Equalizer 2000 | 11 | 96 |
Grind | 8 | 93 |
To see how movies have been rated over time, we plotted the average of a movie-quality index over time (see below). To build an index of movie quality, we rescaled the ratings from the three platforms for which we have the most data---IMDB, Rotten Tomatoes, and Google---to lie between 0 and 1 and averaged it. The figure shows that movie quality steadily declined between 1950 and 2000. And since then, there has been a modest uptick.
But if you separate the platform ratings, you get three different trend lines. Google ratings are largely flat over time, with a modest decline since 2010. Rotten tomatoes ratings follow a very different trend with a sharp decline between 1950 and 2000 and a sharp uptick since 2000. IMDB ratings follow yet another trend---they decline till the 1980s and gently slope upward after that.
The differing over-time trend lines across platforms are disconcerting. But to further understand the issue, we plot correlations across platform ratings by decade (see below). The correlation between Rotten Tomatoes Ratings and IMDB ratings is relatively similar across time. But the other two trends in correlations are much more volatile. The correlation between Rotten Tomatoes and Google changes sizably, from .09 in the 1950s to .36 in the 1960s. The correlation between IMDB and Google ratings swings even more, from .22 in the 1950s to .65 in the 2020s. Overall, it seems the correlation between IMDB and Google ratings becomes stronger over time.
If we check correlation across platforms by maturity ratings, we see some variation between Rotten Tomatoes and Google, but overall it doesn't seem that maturity rating have much explanatory power.
Next, we investigate how the correlation between ratings across platforms varies by genre. We limit ourselves to the top 10 most popular genres. We see fairly sizably variation in the correlation between IMDB and Google, with the correlation varying between .29 and .71. The range of correlations between Rotten Tomatoes and Google is also very wide---.04 to .48.
For IMDB, Google, and Rotten Tomatoes, here's the summary of the principal components.
PC1 | PC2 | PC3 | |
---|---|---|---|
Standard deviation | 1.3951 | 0.9045 | 0.48538 |
Proportion of Variance | 0.6487 | 0.2727 | 0.07853 |
Cumulative Proportion | 0.6487 | 0.9215 | 1.00000 |
I estimated the correlation between ratings for movies that earned the most (unadjusted for inflation) on the domestic box office. (Presumably variation in ratings for top 100 is smaller than if we had the entire sample. And that means the correlation is estimated over a small range. It is also likely that lots of people rated these movies so the measures are likely precise.)
The correlation between metacritic and rotten tomatoes is just .48. The correlation between imdb and google ratings is just .33. The highest correlation is surprisingly between rotten tomatoes and imdb ratings.
imdb | rotten | metacritic | ||
---|---|---|---|---|
imdb | 1.00 | 0.74 | 0.36 | 0.33 |
rotten | 0.74 | 1.00 | 0.48 | 0.23 |
metacritic | 0.36 | 0.48 | 1.00 | 0.18 |
0.33 | 0.23 | 0.18 | 1.00 |
Released under CC BY 2.0.