Data was gathered from eia (Energy Information Administration) and USGS (United States Geological Survey) APIs (Application Programming Interfaces), and combined into a dataset. The data was filtered to contain only wind farms and turbines located in the contiguous United States. Variables utilized from USGS include latitude, longitude, elevation, retrofit status, and year retrofitted. Variables utilized from eia include wind farm megawatt production, geographic region, and wind farm capacity.
This project asks the questions:
What is the landscape of wind farms and turbines for the contiguous United States?
What is the landscape of megawatt production across the contiguous United states?
Are there seasonal trends by state or region?
Are there significant increases in megawatt production with retrofitting?
To investigate the landscape of wind farms in the contiguous United States, a heat map was constructed showing wind turbine locations.
This map shows a large concentration of wind turbines in California (CA) and Texas (TX) with a smaller, though still noticeable concentration at the borders, of Minnesota, Iowa, South Dakota, and Nebraska; a smaller concentration at the borders of Illinois and Indiana as well as the borders of Washington and Oregon; and an even smaller concentration in Wyoming and Colorado.
As turbines rely heavily on wind currents, and one would expect these currents to vary with elevation, a histogram showing the distribution of turbine elevation on the density scale was constructed.
This histogram shows that few turbines are installed at sea level. The data is right skewed, indicating that most turbines are higher than sea level, but not particularly high, with a mean and median of 687 m and 561m, respectively. The peak of the density closely matches the elevation of Pittsburgh (372.8 m). Most turbines are located at elevations higher than Los Angeles, CA (93 m) and Dallas, TX (131 m). A KS test returns a p value of approximately zero, and we can, therefore, reject the null hypothesis and conclude that the histogram does not follow a normal distribution.
We want to further investigate the time series data between California and Texas, and motivate whether their trends apply to all turbine locations.
In particular, we are researching: Do different areas of the United States follow the same trends for megawatt production? If not, what seasonality trends can we find out about each state?
We notice that across all time periods, Texas consistently outperforms California in megawatt production. However, their trends appear to be fairly similar, where each state’s peaks and troughs occur around the same time.
Let’s create a seasonal decomposition graph of this dataset to explore their seasonal trends. More particularly, we will separate the two lines by plant state locations.
We see that Texas and California follow similar trend patterns. However, it may be helpful to notice that the global trend of megawatt production in Texas in 2000-2005 saw a drastic increase in production compared to California.
Another difference we see is between their seasonal trends, where the trend peaks of Texas are slightly before the peaks of California. This may relate to the geographic location of the two states, which we will explore in the next section.
After comparing the trends of California and Texas windfarm productions, we want to learn more about windfarm productions, and investigate which factors are related to windfarm productions.
In this analysis of windfarm productions, we use the data on the annual productions in 2019 as it includes the most windfarms. First, we plot a scatterplot of the windfarm production overlayed on top of a map.
In the plot above, the size of the circle representing each windfarm is proportional to their annual production in 2019. As we can see, the windfarms with high productions are mostly in the South, particularly the states of Texas, Oklahoma, and Kansas. Some windfarms in some northern states also have relatively high productions, namely Iowa and Illinois. In comparison, although there are many windfarms in the West (Washington, California) and the Northeast, in general they have lower productions.
Aside from the state and location of the windfarms, there may be other factors that affect windfarm productions. We look at the variable p_year
, which is the year that the turbines in the windfarm became operational and began providing power. In the side-by-side boxplots below, we grouped the year into 4 categories, as well as the states into 4 regions.
The plot above confirms that Southern and North Central states have higher productions in 2019 than Western and Northeastern states. For Southern and North Central states, windfarms that became operational in later years have higher productions than windfarms that began operations earlier. This does not seem to be the case for Western and Northeastern states, as windfarms with different starting years seem to have a similar distribution of production.
A turbine is considered partially retrofitted if it has undergone rotor and/or nacelle replacement or installation of other new equipment. The common goal of retrofitting is to extend the lifespan of the turbine and improve efficiency and energy production if the turbine is fitted with a larger rotor diameter. In this section of the report, we are interested in turbines that have indicated to be retrofitted to answer teh following questions:
Are there energy production trends of turbines that have been retrofitted?
Are turbines being retrofitted with equipment different from turbines that have not been retrofitted that would allow for increased energy production beyond the capabilities of their original construction? Can we observe this difference by clustering turbines by attributes such as rotor diameter, sweep area, hub height, turbine height?
Around 8% of the turbines in the U.S. Wind Turbine Database is known to have been partially retrofitted after the turbine was constructed by the manufacturer.
We investigate the effects on energy production of retrofitting turbines. Specifically, we are interested in whether after retrofitting, energy production increases. Out of 1167 wind farms from the dataset, 75 wind farms have retrofitted turbines.
To investigate changes in energy production pre and post retrofit, we chose to visualize the energy production the top six wind farms with the most retrofitted turbines. A vertical line indicates the first day of the year which turbines on the farm were retrofitted.
As shown by the dark line displaying a rolling average of monthly energy production, retrofit occurs after a period of decline in production and production post retrofit increases previous levels. This is suggests that retrofit is a means of prolonging turbine energy production lifespan. Our question now is whether the retrofitted new equipment is larger or more powerful than those from original construction such that retrofitted turbines may produce more energy than it’s original production capacity.
Now we explore the physical characteristics of the retrofitted and never retrofitted turbines, specifically the quantitative variables documenting energy production capacity (t_cap), hub height (t_hh), rotor diameter (t_rd), rotor sweep area (t_rsa), and turbine total height (t_ttlh). We are interested to see if retrofitted turbines are being fitted with replacements different from never retrofitted turbines that would allow for increased energy production such as larger rotors.
From the PCA plot of the first two principal components that together explain over 95% of the variation in the physical characteristics of turbines, we notice that retrofitted turbines and never retrofitted turbines are not well clustered these principal components. Specifically, the majority of partially retrofitted turbines appear to fall along an increasing line against the vectors of coefficients of the variables. From the coefficient vectors, we see the variables are highly correlated with each other, especially turbine capacity (t_cap) with rotor sweep area (t_rsa) and rotor diameter. This indicates that turbines with higher energy output capacity have larger rotors and are also taller as capacity is also highly correlated with turbine total height (t_ttlh) and turbine hub height (t_hh).
Since retrofit is not well clustered from our principal component analysis of turbine physical attributes, we suspect that retrofitted turbines’ physical attributes are not very different from those turbines which have not been retrofitted. We interpret this to potentially indicate that turbines are not being retrofitted with equipment which are different from their original parts and that here may be other factors that better cluster retrofit such as the age of the turbine.
This project examined the location, elevation, and megawatt production of wind turbines and wind farms in the contiguous United States. It was concluded that the majority of wind turbines are located in the states California and Texas, at elevations of peak density closely matching the elevation of Pittsburgh (372.8 m). Further investigation into whether topography, which could influence accessibility of turbines for inspection and maintenance, could play a role in turbine farm location.
An analysis of megawatt production showed Texas has a consistently higher megawatt production than California.
The windfarm productions are correlated to their geographical regions, as well as the year they became operational.
Retrofitting wind turbines is known to be a way to extend the lifespan and increase energy production of the turbine. From our analyses, we conclude that turbines are commonly retrofitted during a period of energy production decline, and post retrofit energy production does climb back to previously stable levels. However, considering it is well known that physical characteristics such as rotor diameter/sweep area and height are important factors determining turbine energy production, principal component analysis on energy production capacity (t_cap), hub height (t_hh), rotor diameter (t_rd), rotor sweep area (t_rsa), and turbine total height (t_ttlh) did not cluster retrofit very well. We interpret this as retrofitted turbines are not being fitted with new equipment such as larger rotors or hubs that distinguished them from turbines that have not been retrofitted. Instead there may be other factors that better cluster retrofit such as turbine age. Perhaps this indicates that retrofitting is primarily to revive turbines to their newly constructed state rather than attempt to improve upon their original construction, or other variables in the turbine’s construction could prevent retrofitting with larger equipment that would allow for increased capacity for energy production from their original construction.
There are many questions on retrofitted wind turbines that investigating this one variable can expand to its own research project.
Our data capture the turbines in a single moment of time which makes analyzing pre and post retrofit beyond the monthly energy production difficult. If annual inspection records exist with data on turbines physical characteristics, we would like to compare the same attributes used in our PCA analysis to explicitly investigate the changes after retrofit. With further data on the retrofitted components, we may investigate the monetary cost and energy production return of retrofit, whether it is an investment more wind farms beyond the Midwest/Great Plains and Texas regions should consider to improve their turbine efficiency and lifespan.
Jossi, Frank. “Industry Report: Midwest and Great Plains Lead Wind Energy Expansion.” Energy News Network, 19 Apr. 2017, https://energynews.us/2017/04/19/industry-report-midwest-and-great-plains-lead-wind-energy-expansion/.