The Super Bowl is a hallmark of American culture. Every year close to 100 million individuals tune in to watch the three hour extravaganza. Although people tune in for the action on the field, it is not the only aspect of the spectacle that garners attention. A lot of viewers are purely interested in what happens in the time when the game is not played, the advertisements. This year commercials reached a new record level of expensive, with a thirty second advertisement costing $7 million on average. We decided to analyze a Superbowl ads dataset, hoping to understand what factor influences advertisement. Using this dataset, we answer three main research questions:
Through these three questions and the dataset, we analyze Superbowl ad data from this century.
The data was collected in 2021 for the twenty previous years of Super Bowl commercials. The selection of observations was done based on the ten brands that aired the most spots over the given time period resulting in 233 observations. Researchers watched the advertisements taking note if certain criteria were present including if it was funny or involved danger. In terms of the variables, we cleaned the data to include the year, brand, all seven themes, view count, like count, and dislike count. We added a new variable that groups years based on half decade in order to analyze trends. Half decade was chosen over decade as the data is only from 2000 to 2020.
For several reasons, we narrowed down the 233 observations to 103. First, we removed any observations that had a record on N/A for any criteria or quantitative variable such as like or view count. The second reason we narrowed our data also points to our largest limitation. As with any video posted on YouTube, the view, like, and dislike count can increase every day. Therefore, taking a snapshot of the data from three years ago, does not represent the videos where they are today. Similarly, some videos have exponentially higher counts which can occur within days or hours. Some videos had counts that were unreasonably low which had the potential to skew the data. Therefore, the decision was made to limit our dataset to observations within the range of 1 million and 100,000 for both view counts.
## # A tibble: 5 × 17
## year brand funny show_pr…¹ patri…² celeb…³ danger animals use_sex view_…⁴
## <dbl> <chr> <int> <int> <int> <int> <int> <int> <int> <dbl>
## 1 2018 Toyota 0 0 0 0 0 0 0 173929
## 2 2006 Bud Light 1 0 0 0 1 1 0 142310
## 3 2020 Coca-Cola 1 0 0 1 0 1 0 304254
## 4 2010 Hynudai 0 1 1 0 0 0 0 68458
## 5 2007 Budweiser 1 1 0 0 0 1 1 184689
## # … with 7 more variables: like_count <dbl>, dislike_count <dbl>, title <chr>,
## # numThemes <int>, half_decade <chr>, like_ratio <dbl>, dislike_ratio <dbl>,
## # and abbreviated variable names ¹show_product_quickly, ²patriotic,
## # ³celebrity, ⁴view_count
In order to understand the relationship between view count and like count for commercial popularity, we used a scatter plot. The majority of the observations are concentrated in a very low like count range. Although conclusions cannot be made without significant testing, there is a strong positive correlation between like and view counts. Therefore, in part of determining the popularity of a video based on the like count, view count does provide some information as to popularity. However, like count and dislike count have a stronger correlation. This could potentially point to the fact that interaction of any kind has more of a relationship than like count indicating popularity.
## view_count like_count dislike_count
## view_count 1.0000000 0.8364820 0.6925321
## like_count 0.8364820 1.0000000 0.9135665
## dislike_count 0.6925321 0.9135665 1.0000000
This figure illustrates the relationship between the different themes and how popular they are. For this mosaic plot, the x-axis are the seven different themes, and the y-axis are the ratios between likes and views. With the majority of videos falling into the 0.0 to 0.003 bin, we still notice that some themes receive a higher proportion rating than others. Specifically, videos with celebrities or using patriotism are noticeably more likely to have a higher proportion than 0.003.
To further confirm this, we use a Pearson Residual mosaic plot. Because the residuals are greater than 2, we note that the “celebrity” theme has a higher proportion of videos that land in the 0.006 to 0.009 range than expected. This is somewhat expected, as we inferred similar results from the previous mosaic plot. No other pairings had significantly higher or lower residuals than expected.
This figure shows the distribution of commercials by the number of themes used. We find that the most popular amount of themes is three, followed by two and then four. We also found that zero commercials used all seven themes, and two commercials used zero themes. We also looked at the average number of views per group, and found that the most successful commercials in this regard used four themes, followed by five and two themes. It is also worth noting that the most popular number of themes (three), only had an average amount of views - so maybe future commercials should consider using four themes.
This figure shows how often each theme was used each year, as a proportion of all of the commercials that came out that year. It’s clear that the most popular themes throughout the 2000s were commercials that were funny and showed the product quickly. Commercials that were patriotic, and used celebrities were generally the least popular. It is worth noting that there were spikes in the proportion of funny commercials that showed the product quickly in 2003 and 2013.
After observing that ‘funny’ and ‘show_product_quickly’ were the most popular themes, we now focus on how often those specific two themes were used over the span of the dataset. We see that they are often used a similar amount each year, with the exception of 2004 (twice as many funny commercials) and 2014 (0 commercials were funny and 3 showed the product quickly). It is plausible to assume that commercials intending to be funny perhaps would not show the product in the opening seconds of the video, but this data shows otherwise.
We wanted to learn more about whether or not brands stick to the same themes over the years, which suggests that we should examine the relationship between themes and brands over the time period of the dataset. After conducting an analysis of which themes present the greatest differences over the years, we found that celebrity was a particular theme that experienced a lot of drastic change, especially in the last five years, which was in line with our personal expectations of watching the Super bowl. Thus, we decided to explore how celebrity involvement in Super Bowl ads change over time by brand. (To see the rest of the themes, take a look at the Appendix.)
The above graph suggests that over the years, brands have started to display celebrities more often in their Super Bowl ads. In particular, we see that for Bud Light, Budweiser, Coca-Cola, Doritos, Hyundai, and Toyota, they only started using celebrities to market from 2015 onwards. Why do brands do this? Is it because brands with celebrities get more recognition? In order to test for significance, we conducted a two-sample t-test to understand whether ads with celebrities get more views than ads without celebrities. Our findings are illustrated in the table below.
##
## Welch Two Sample t-test
##
## data: view_count by celebrity
## t = -0.84041, df = 39.989, p-value = 0.4057
## alternative hypothesis: true difference in means between group 0 and group 1 is not equal to 0
## 95 percent confidence interval:
## -973119.9 401519.1
## sample estimates:
## mean in group 0 mean in group 1
## 592300.6 878101.0
The above table suggests that there is no significant difference in view count between ads with celebrities and those without. Consequently, if total view count is not significantly affected by the appearance of a celebrity, what about the number of likes? Thus, this got us thinking about whether or not this seemingly intentional shift to include more celebrities led to a greater enjoyment of the ads, as measured by like_count divided by total view_count in order to mitigate differences in youtube engagement over the years.
The above graph suggests that there is not really a huge difference between whether celebrities are shown or not, as evidenced by how closely the red and blue lines track each other. The huge jump in celebrity = true we saw in the year 2002 was a viral ad by Britney Spears and Pepsi which garnered a like ratio of 0.015, which is much higher than the average of 0.0034. Thus, if there is not a huge difference in like ratio, we wanted to test whether a dislike ratio would produce a difference. Therefore, we produced a similar graph but with the dislike ratio plotted.
The above graph suggests that on average, an ad without a celebrity shown has a higher dislike ratio than one with a celebrity shown. This effect is most prominent since 2017, as shown by the big divergence in the red line and the blue line while the slope remains constant.
Our research yielded many interesting conclusions. Regarding the first research question, we found that an ad having three themes was the most popular, but more successful ads have four themes. Regarding the second research question, we discovered that the themes funny and show-product-quickly were the most popular, and that the two themes appear together more often than one would expect. For the research question, we explored how brands used the theme celebrity over the years. We found that celebrities were used more in recent years (post 2015), but that ads with celebrities did not receive more views than ads without. In addition, the proportion of people who liked an ad did not differ substantially when a celebrity was shown and not shown, but people seemed to dislike ads more when a celebrity was not shown vs. when it was not. Overall, our research into Superbowl ads helped us better understand what type of themes to include, and how to better generate reviews from viewers.
While this dataset gave us a good introduction to analyzing Super Bowl commercials, specifically their themes and what makes them successful - there is a lot more research that can be done on this subject. To begin with, it would be worthwhile to see other themes recorded in future datasets, for example - does the commercial relate to football? Do they use popular trends from the year? There are many possibilities in this regard.
When measuring the success of the commercial, we think it would be relevant to compare the views of the commercial to the overall popularity of the football game itself, as we believe that more popular Super Bowls will generally garner more views for their commercials. Additionally, it would be valuable to see how many views the YouTube video gets in the first 24 hours of publishing (as that is typically when the commercials are still freshest in everyone’s mind) instead of to the date of data recording. This would help standardize the counting of views for each commercial.
Additionally, it would also be valuable to have data on commercials outside of the 10 brands in the dataset. Only having 10 brands limits the amount of data points we can work with, and every brand did not have a commercial each year. Being able to see data on other, potentially smaller, brands would open up more opportunities for analysis - as we would be able to better answer the questions of how themes are used and what makes a commercial popular with more data points. We would also be able to compare how smaller brands, who are perhaps making their first Super Bowl commercial, fare in making popular commercials compared to the more established brands.