This research paper aims to examine the world happiness dataset, which includes the following variables:
The data is a continuous report that ranks 155 countries by their happiness levels. The report is used to help inform policy-making decisions for countries by governments, organizations, and civil society. The measurements above are being used effectively to assess the progress of nations and to help explain the personal and national variations in happiness.
With this dataset, we are exploring three research questions:
The Covid-19 pandemic took the world by surprise and continues to cause chaos in many different aspects of our lives with new variants. Since this data set pertains to happiness scores for all the countries within it, we seek to understand how the average happiness score (aka, Life Ladder in the dataset) has changed over time with the plot below.
This time-series analysis conveys the average happiness scores for all regions of the world from 2005 to 2020. Average happiness scores are colored by region. A dotted line is added at year = 2019 to focus in on the trend that exists from 2019-2020. A key takeaway for this plot is that many regions of the world had fluctuating average happiness scores from 2005-2019, which makes sense because there are many different events that can take place in certain regions to impact the overall average happiness score for a given year. Overall, average happiness ranges from 4 - >7.0.
For 2019-2020, the Middle East and and North Africa’s scores increased, the Commonwealth States’ scores increased, Central and Eastern Europe’s scores also sloped upward. South Asia’s average happiness score increased and so did Sub-Saharan Africa’s score. East Asia’s score also continued to increase which is interesting because East Asia was put in the hot seat by the media for Coronavirus, and so one would think that their scores would go down due to the negative PR. Southeast Asia’s average happiness score went down, and so did Latin America and the Carribean’s scores.
We observe that North America and Western Europe have higher average happiness scores than the rest of the world on average, and during 2019-2020, they mostly flat-lined and continued in the same direction they were going in before. Some regions’ average happiness scores decreased, whereas many other countries average happiness scores actually increased, interestingly enough. This chart fits into our research question of how average happiness has changed over time, especially over course of the pandemic.
Our research question aims to find if happiness levels are the same throughout the world. Ideally, it would comforting to know that happiness is homogeneous across the Earth and no matter where one is they can find happiness just as likely as another place. Thus, we will study further into whether happiness levels are the same across region.
To start off, we want to examine an areal map of the world and color it by the average happiness score of countries.
From the areal plot above, it seems like we can conclude that North America, South America, Europe, Australia, and New Zealand have higher average happiness scores compared to other regions in the world. The majority of parts of Asia and Africa appear to have lower average happiness scores compared to other regions.
We may look at other plots to get a better idea of the distribution of average happiness score given region instead, and then run test of equal mean and equal variance for the regions.
From the graph above, we conclude very similiar findings to the areal map. Having a higher average happiness score than other regions we find North America and ANZ, Western Europe, and Latin America and Caribbean. Having a lower average happiness score than other regions we find South Asia and Sub-Saharan Africa. At the same time, this graph can tell us more information about the regions and we also find that information about range, mean, outliers, and quartiles. We won’t dive into all regions, but we note that North America and ANZ is skewed left and has a very small range of average happiness levels, Middle East and North Africa ranges the most out of the regions, and Sub-Saharan Africa, South Asia, and Latin America and Caribbean are the regions that have outliers.
Thus, through our visualizations we can conclude that happiness levels are not the same across region. In other words, there are certain regions that are more prone to have higher happiness levels or lower happiness levels that are significantly different from other regions.
We were able to answer our research questions above. From the following series of graphs and analysis, we were able to identify that happiness levels differ by region with our boxplot analysis. We were also able to find out that in general more underdeveloped regions have lower happiness levels (by life ladder score) and more developed regions have higher happiness levels, as highlighted in our analysis of the faceted density plot between life ladder and log GDP per capita. Lastly, we also found that countries with more corruption and oppression have populations with lower happiness levels, expressed through the regression models.