Research Article | Open Access
Patrick Cavanagh, Corey Lang, Xinran Li, Haoran Miao, John David Ryder, "Searching for the Determinants of Climate Change Interest", Geography Journal, vol. 2014, Article ID 503295, 11 pages, 2014. https://doi.org/10.1155/2014/503295
Searching for the Determinants of Climate Change Interest
A meaningful CO2 mitigation policy is unlikely at the national level in the United States. What is currently happening and what is much more likely to occur in the future are city and regional level efforts of mitigation and adaptation. This paper aims to understand the geographic and socioeconomic characteristics of metropolitan areas and regions that lead to engagement with the issue of climate change. We use geographically explicit, internet search data from Google to measure information seeking behavior, which we interpret as engagement, attention, and interest. Our spatial Hot Spot analysis creates a map that potentially could be harnessed by policymakers to gauge mitigation support or adaptation potential. The results of our multivariate analysis suggest that socioeconomic factors are the strongest determinants of search behavior and that climate and geography have little to no impact. With regard to political ideology, we find evidence of a nonlinear, inverse-U relationship with maximum search activity occurring in metropolitan areas with a near even political split, suggesting that parity may be good for engagement.
The United States has failed to achieve meaningful carbon legislation to limit emissions and mitigate damage. For many Americans and certainly politicians, the whole idea of climate change remains contentious, and therefore a path leading to meaningful legislation is difficult to envision. Despite the stasis at the federal level, cities and regions are finding the political will to take action on both mitigation and adaptation. Nine states in the Northeast have banded together in the Regional Greenhouse Gas Initiative (RGGI) and implemented a cap-and-trade policy to reduce emissions from electricity generation. Similarly, California recently enacted the Global Warming Solutions Act (AB 32) that will use market mechanisms to reduce all emissions to 1990 levels by 2020. New York City’s PlaNYC and Boston’s Climate Action Plan focus both on reducing emissions and improving resiliency in the face of rising sea levels and other climate change impacts. These local and regional actions are consistent with the ideas of Kahn , who argues that cities will drive adaptation in an effort to compete for footloose residents. Thus, understanding the characteristics of metropolitan areas that make them more or less likely to engage in climate change policy is critically important.
This paper seeks to add to that understanding by utilizing internet search data from Google Trends. Specifically, we examine how the characteristics of metropolitan areas, in terms of demographics (income, education, and race), political preferences, geography, climate, and carbon intensity of their economy, determine search volumes for climate change-related words. We downloaded data from Google Trends on search activity for every media market (roughly equivalent to a metropolitan area) in the Continental United States (a total of 205 media markets). Data were gathered for the search terms “global warming” and “climate change,” as well as “weather” which has different motivations behind it but still serves as a baseline comparison and validity check for the data and methods.
Traditionally, researchers have gathered data on beliefs and interest level through survey techniques, and there are two reasons why internet search data offer benefits over survey data. First, a survey is a stated preference, where the respondent may or may not tell the truth. In contrast, internet searches represent a type of revealed preferences. The internet is private and anonymous, so there is no reason for searches to conform to existing notions of what type of person (e.g., conservative) should believe in climate change. Second, when a person uses Google to seek information, this is active engagement with a subject, which is far more important for learning about and adapting to climate change than answering a yes or no question when someone calls you on the phone. The major drawback of internet search data is that they do not represent a clear opinion. We discuss in more detail how we view inference using Google Trends and how others have used this tool in Section 2.
Our first analysis examines search activity in a spatial context and seeks to determine if there are clusters of high or low search interest. The intent is to begin a process of providing policymakers and environmental activists with a map indicating regions where mitigation support may be high or low or prudent adaptation likely or unlikely to occur. The results indicate that clusters of high interest, or hot spots, are the Northeast, the Great Lakes region of the Midwest, California, and the Pacific Northwest. The Northeast and California already have enacted mitigation policies (RGGI, AB32), which generates confidence in the method and data and further may suggest that the Great Lakes region and Pacific Northwest are amenable to similar policies. The South, stretching from Texas to Alabama, is a cluster of low interest. This finding is perhaps surprising given the regency of Hurricane Katrina and projected increase in hurricane intensity and may suggest that education and outreach efforts are needed in this area.
Second, we analyzed the determinants of search behavior using multivariate regression with socioeconomic, geographic, and climate characteristics as independent variables. Our results suggest that socioeconomic factors are much stronger determinants of climate change-related searches than climate, geography, and CO2 emissions. Specifically, we find that climate change-related searches are positively correlated with a metropolitan area’s proportion of college graduates, proportion of Democrats, and proportion of whites and are negatively correlated with a metropolitan area’s average income. Metropolitan areas on the coast are no more or less likely to seek information about climate change. Similarly, a hot or cold climate and high or low CO2 emissions per capita do not affect information seeking. Our findings contrast with those of O’Connor et al. , who argue that demographic factors tend to have modest relationships with climate change perceptions. Prior research has found a strong link between personal experiences of local weather and attitudes towards the legitimacy of climate change [3, 4]. While we do not examine the effect of weather, our results add to this literature by showing that climate does not appear to influence search behavior.
Given the prominence of politics in debates over climate change, we examine in some detail how political affiliation affects internet search behavior related to climate change. While many surveys have shown a substantial gap between Democrats and Republicans in the belief in climate change and willingness to take action , we speculated that this result could be due to posturing to project what political elites would expect respondents to say and using private search data could reveal a different pattern. We perform a semiparametric analysis and find that, for the search term “global warming,” search volume is greatest when metropolitan areas are more centrist and the peak occurs around 43% Democrat, with lower search volumes as metropolitan areas become more Partisan. For the search term “climate change,” however, search activity increases as metropolitan areas lean more Democratic over the whole range of the political spectrum. These results first suggest that “climate change” may be more of a branded political word than “global warming.” Second, with “global warming,” our results may indicate that metropolitan areas that are more moderate are seeking information and engaging in healthy debate.
The main contribution of this paper is to add to the strong literature that seeks to understand beliefs of Americans about climate change [6–9]. By using a novel data set, we are able to have far more geographic detail than typical surveys, which allows us to expand on the spatial variation in climate change perception found by Hamilton and Keim  and examine correlation with a broader range of geographic and socioeconomic factors. Further, the data represent revealed preferences and active engagement, which are far more valuable when assessing learning and adaptive capacity.
2. Benefits and Considerations of Internet Search Data
For many Americans, the internet has become the primary means of gathering information. Among American internet users across all age groups, 90% utilize search engines for information seeking, and 83% of these queries are entered using Google Search . Search engines give people considerable freedom to search for myriad topics without many of the limits and barriers associated with traditional information seeking methods.
Google Trends data are a measure of public attentiveness, engagement, interest, and issue salience. Internet search data have been used to gauge religious influence within specific geographic areas , determine if ballot measures were more likely to be voted on by voters , measure Hurricane information seeking behavior following tropical storm events , predict unemployment and sales , and forecast cinema admissions . Internet searches have also been popular in the field of epidemiology, particularly in tracking the spread of the flu virus [17, 18].
Beyond measuring interest, internet search data are a revealed preference alternative to traditional survey data and can be used to gather information on taboo or controversial topics. Stephens-Davidowitz  uses Google Trends to measure racist attitudes and improve 2008 presidential voting predictions. Most related to the present research, Kahn and Kotchen  and Lang  use Google data to investigate interest in the politically charged topic of global warming. Kahn and Kotchen  find that online interest in global warming declines as unemployment rises, and interestingly the decline is larger for Democrats. Lang  finds that weather fluctuations cause people to seek information about climate change and that different types of weather events lead to different responses in areas of different political ideologies and average educational attainment.
Additionally, Google Trends provide reliable, near real-time updates [22, 23], which is prohibitively costly for the vast majority of surveys. Further, internet search data can more effectively gather indicators from underrepresented groups  and avoid interviewer bias .
Most of the research to date using Google Trends has focused on time variation and prediction. While there are advantages to a time series or panel data approach, often a substantial portion of variation in search is not used. For example, for the search term “global warming” over our sample period, 65% of variation is explained by media market and month-year fixed effects (dummy variables) alone, leaving only 35% of variation to identify a relationship. The purpose of this paper is to investigate how geography and spatial relationships may explain some of that variation.
Despite the many advantages over surveys, internet search data are fundamentally more different than opinions and do not necessarily correlate with support for an issue.1 For example, not everyone searching “climate change” would support a mitigation policy, but the data can still provide useful trends . Search data is an indicator of the area’s willingness to invest some time and effort into actively executing a search engine query and consciously processing the results . Using search data to gauge this motivated search behavior has proven to be a valuable tool for researchers and will likely become increasingly popular as additional applications are explored and validated.
Turning the attention specifically to our focus of examining search data for “climate change,” “global warming,” and “weather,” different search terms can be classified by the user’s intention [27, 28]. “Climate change” and “global warming” are classified as informational; people search these to learn more about a concept. In contrast, “weather” is classified as resource or navigational; that is, people search this to obtain a quick fact (e.g., the daily forecast) or navigate to a website like weather.com, as opposed to something larger like wanting to understand the science behind meteorology. Understanding this classification is critical for the present study because it implies that the data from Google Trends are an aggregation of individuals seeking information and wanting to engage with the issue.
3. Data and Spatial Analysis
This section first describes the Google Trends data we collected. Second, we present maps that illustrate the degree of spatial variation in search activity and a Hot Spot analysis that looks for spatial clustering of search activity. Finally, we describe the extensive geographic and socioeconomic data collected for use in our multivariate regression analysis of the determinants of search activity.
3.1. Google Trends
We downloaded internet search activity in the United States using Google Trends for the keywords “global warming,” “climate change,” and “weather.” The smallest geographic unit that is offered by Google Trends for the USA is media market. Media markets consist of several contiguous counties, are typically associated with a metropolitan area, and are used primarily to gauge television viewership habits. The USA is broken into 210 media markets; however, this paper focuses on the 205 media markets in the Continental USA. We will refer to media markets as metropolitan areas, interchangeably.
Google Trends reports a time series index of search activity on a 0–100 scale, with 100 being the maximum search interest observed over that time and all other numbers being proportional to that maximum. Google also reports an average of this index, which can be interpreted as the average search interest over time, and we refer to it as the average query share (AQS). However, at the moment, AQSs for each media market cannot be compared because they are all relative to their own maximum. Because we are interested in across space comparisons, we downloaded each media market’s AQS simultaneously with the AQS for Washington DC. Now each media market’s AQS is relative to Washington DC. We create our key variable Google Search index (GSI), which will become the dependent variable in our multivariate regression, using the following formula: where is average query share for media market with respect to media market . The term in parentheses is a necessary scalar in case Washington DC does not have the maximum search interest compared to . If Washington DC does have the maximum search interest, then the second term equals one. We restricted the time window of Google searches to be from January 2008 to November 2012. January 2008 was chosen because several independent variables were measured in 2008 and prior to this time search volume was relatively low for global warming and climate change. We downloaded the data in November 2012, so that was the natural end point. For a fuller explanation of how and why we calculate GSI this way, please see the online appendix.
Google is mindful of not disclosing too much information. This is one reason why media markets are the smallest geographic unit at which data are available. Another important impact of Google’s concern is that if search volume is lower than a threshold, which is undisclosed, then the AQS reported is zero. This effectively truncates GSI at zero. Weather is a popular search term, and there are no zeros for the 205 media markets. However, global warming and climate change are substantially less popular and have 18 and 60 zeros, respectively. All media markets with zero AQS are still included in the analysis, but our regression models are designed to account for this truncation.
While we have scaled GSI relative to Washington DC in order to facilitate across space comparisons, the magnitudes of GSI values are difficult to interpret. To this end, we normalize the distribution of weather, global warming, and climate change GSI data to each have a standard deviation of 10. With this change, we can more easily interpret differences in GSI levels in terms of standard deviation. Further, when we present the regression analysis, we can interpret the coefficients in terms of standard deviation.
Table 1 presents summary statistics for the GSI of our three search terms: weather, global warming, and climate change. We present means and standard deviations, as well as minimums and maximums. The distribution of GSI for global warming and climate change seems skewed, as in both cases the maximum is about six standard deviations greater than the mean. The means of the three GSI terms cannot be meaningfully compared. While the ordinal relationship is correct, with weather having the highest GSI followed by global warming and then climate change, the magnitudes of differences are incorrect. In reality, on a national scale, average query share for weather is 67 times higher than global warming, which is twice as large as climate change.2
|Notes. GSI is gathered from Google Trends and is derived from raw Google search data generated by individuals across the United States from January 2008 to November 2012. We normalized the standard deviation to 10 to allow the spread of the distribution of a variable with a large mean and correspondingly large standard deviation to be compared more appropriately with the spread of the distribution of another variable with smaller mean and correspondingly smaller standard deviation. Average income data comes from The Bureau of Economic Analysis. Carbon dioxide emissions were gathered from the Vulcan fossil fuel CO2 emissions data, produced at Arizona State University in conjunction with collaborators at Colorado State University and Lawrence Berkeley National Laboratory. The Democrat data is from Dave Liep’s Atlas of US Presidential Elections 2008. College graduate data is from the 5-year American Community Survey from 2006 until 2010. The climate variables were gathered daily from the National Oceanic and Atmospheric Administration’s Global Historical Climatology Network.|
3.2. Spatial Trends in GSI
Using GIS, we created a map illustrating the spatial variation in GSI for each of our three search terms. These maps are displayed in Figure 1. The data are displayed as classified by quintiles with darker areas indicating areas of high search interest. High levels of interest in weather are generally found in the higher latitudes of the country, while pockets in the Southern, Southwestern, and Eastern seaboard portions of the country show relatively little interest in weather. Trends for searches of global warming show high areas of interest centering in the interior portion of the country, particularly in the Midwest, Northwest, and Great Lakes regions. Areas of low global warming interest exist particularly in the Southern and Northern Rocky Mountain regions. Similarly, searches for climate change show little interest in the Southern and Northern Rocky Mountain regions, while peak areas of interest are focused in the Southwest, West Coast, and Northeast regions. Speculatively, high search interest in the Northeast could be a result of changing snowfall levels in that part of the country and in the Midwest could be a direct result of major droughts in the corn and grain belt of the country.
Moving beyond simple spatial comparisons of GSI, we would like to identify statistically significant regions or clusters of interest and disinterest to inform policymakers where mitigation support may be high or low or prudent adaptation likely or unlikely. To this end, we completed a Hot Spot analysis of each Google variable using GIS, which identifies regions where several neighboring media markets have GSI that is a similarly large standard deviation from the population mean. In order to determine those hot and cold regions, a Getis-Ord statistic is calculated for each media market.3 For a hot (cold) spot to be designated, a media market must have a large positive (negative) and must be next to other media markets with similarly large .
Figure 2 is the result of the Hot Spot analysis. The solid black and gray areas indicate hot spots, statistically significant clusters of high GSI, and the similarly colored hashed areas are cold spots, statistically significant areas of small GSI. The Hot Spot analysis for weather showed clusters of high values in the northern latitudes of the country, mostly in the regions east of the Rockies and west of the Great Lakes. Clusters of low interest in weather searches are found in the Southern portions of California and Nevada, much of New Mexico, Southern Texas, and Florida, as well some parts of Appalachia. For global warming GSI, statistically significant clusters of high values were found in the Midwest, particularly south and west of the Great Lakes. Clusters of low values for global warming exist in the Southern region of the country stretching west from Georgia into central Texas, as well as an area encompassing much of Montana. Analysis for climate change searches shows four distinct regional hotspots. A similar cluster of disinterest exists for climate change in the South, starting in central Texas and becoming more significant heading east into Louisiana and Mississippi, before losing significance over Georgia. Three major areas of climate change interest center around California, the Northeast, and Washington State. In two of these areas, regional CO2 mitigation policies have been adopted (AB32 and RGGI).4
While it is unclear why these regions are hot and cold spots and what the determinants of high and low interest are, we feel that these maps are of inherent value. Similar to the work that has been done in Development Economics with geographic targeting of aid for increased efficacy [29, 30], these maps identify regions of high interest that can be harnessed for political support and areas of low interest that campaigns could target to rally support or may need targeted adaptation assistance. Areas of high interest have the potential for regional collaboration such as the efforts in California and New England described above. Conversely the areas of low interest indicate areas where awareness campaigns and increased community outreach may be needed to inspire change. As an unfortunate example, the epicenter of Hurricane Katrina is an area of low interest. EPA and FEMA officials may want to target this area to raise awareness, increase understanding of impacts, and encourage action in this region which is likely to be hard hit by the effects of climate change.
To better explain the variation of GSI across media markets, we collected a series of variables that are intuitively pertinent to the motivation to seek information about climate change. All the variables, their units, means and standard deviations, and minimums and maximums are listed in Table 1. The sources of the data are as follows. Total population, percent of Whites, percent of those over 65 years old, and percent of males come from the 2010 Census. Average income comes from the Bureau of Economic Analysis. CO2 emissions per capita come from the Vulcan fossil fuel data project . Democrat vote share is the share of votes received by the Democratic Party in the 2008 Presidential Election, and these data come from Dave Liep’s Atlas of US Presidential Elections. Percent of college graduates is the percent of the population that has completed college and was drawn from the five-year American Community Survey 2006–2010. All of the above variables were downloaded at the county level. They were then aggregated to the media market level using a crosswalk between counties and media markets provided by Gentzkow and Shapiro . Counties were weighted by population share within media market in order to attain the correct media market-wide measures.
We downloaded weather data from the National Oceanic and Atmospheric Administration’s Global Historical Climatology Network for all years 2000–2011. For each day in the sample, we averaged all weather station data at the media market level. To estimate approximate climate, average annual temperature was calculated as the average temperature across all days and all years, and annual total precipitation and annual total snow were calculated as the cumulative rain and snow for a given year, averaged over all years. Finally, coast is an indicator variable created using GIS, which equals one if a media market contains an ocean coastline.
4. Multivariate Analysis
In this section, we present the results of our multivariate regression analysis investigating the determinants of GSI for weather, global warming, and climate change. Table 2 presents the results. The first column shows the results for weather GSI and uses OLS. Because we have standardized each GSI term such that the standard deviation is 10.0, the coefficients are best interpreted in terms of the standard deviation.
|Notes. Column (1) uses OLS. Columns (2) and (3) use Tobit regression. The sample size is 205 for all columns. Standard errors are in parentheses. ; ; .|
The coefficient for weather searches on the coast variable is 2.80, which indicates that coastal media markets on average have a GSI 0.28 standard deviation higher than inland media markets. The coefficients on average annual temperature and its square are −1.42 and 0.021, respectively, which indicate a convex quadratic function of temperature with a minimum at 33.8°C. This suggests that the coldest media markets have the highest search interest, all else equal, and interest declines at a decreasing rate as the climate warms for essentially the whole range of temperatures (only two media markets have an average temperature above 33.8°C). The significance of these climate variables on weather queries lends credence to the Google data and our modeling approach. However, the coefficients on precipitation and snow are insignificant. The results in Column (1) additionally show that several socioeconomic variables are significant determinates of weather GSI. Originally, this idea is surprising, but when considering lifestyle patterns of different groups this result is understandable. For example, the coefficient on percent over 65 is 0.33 and is statistically significant, indicating that metropolitan areas with larger senior populations are more likely to seek information about the weather. With the caveat of ecological inference, this may suggest that seniors, who may be more affected by adverse weather than others, are more likely to search for weather on the internet. The coefficient on Democrat vote share is negative and significant, suggesting the Democratic leaning metropolitan areas are less likely to search weather. The coefficient on log population is negative and significant, indicating that metropolitan areas with smaller populations are more likely to search for weather. This potentially could be a result of a larger agricultural sector in low population areas, less infrastructure to deal with adverse weather, or simply less access to alternative forecasts from, say, local news. These results point out that people with varying socioeconomic characteristics may also care about weather in different ways. This evidence solidified our intuition that socioeconomic variables may be important determinants of global warming and climate change GSI.
Columns (2) and (3) display the results of Tobit regressions of global warming and climate change GSI. We use a Tobit regression because a number of our GSI values are zero, indicating they were censored by Google Trends for privacy reasons if a media market does not have enough search volume to reach a minimal threshold.5 For global warming GSI, income, percent of college graduates, percent of Whites, and percent of those over 65 years old are statistically significant. For every thousand dollar increase in media market income, global warming GSI decreases by 0.08 standard deviations. As the percent of college graduates increases one percent, global warming GSI increases by 0.061 standard deviations. Global warming GSI increases by 0.021 and 0.065 standard deviations as the share of Whites and the share of those over 65 increase by one percent, respectively. The coefficient on log population is also significant, but this is likely a function of Google’s volume cutoff for disclosure and small media market’s inability to have search volume above that mark. Surprisingly, geographic and climate variables are not correlated with search interest.
Column (3) examines climate change GSI and finds that the determinants are similar to those of global warming GSI. Income has a negative effect and share of Whites has a positive effect on climate change GSI. As with global warming GSI, percent of college graduates has a significant, positive effect, but for climate change GSI the magnitude is 2.6 times larger, which suggests that climate change is a more common search term in well-educated metropolitan areas. A more substantial difference is that Democrat vote share is a significant determinant of climate change GSI; a one percent of increase in Democrat vote share raises GSI by 0.028 standard deviations. This is intuitive as Democrats tend to be more supportive of climate change science and policies.6
4.1. Relationship between Climate Change GSI and Global Warming GSI
Many people think of global warming and climate change as synonymous, and certainly the results of Table 2 suggest a similar set of determinants. However, we wanted to examine if there are differences in the types of metropolitan areas that would search one term over the other. Table 3 gives the results of a model with climate change GSI as the dependent variable and global warming GSI as an explanatory variable, along with the full set of covariates from Table 2.
|Notes. The results shown come from ordinary least squares regression with climate change GSI as the dependent variable. The data with 0 in GSI of climate change or global warming are removed, which results in number of observations being 145. Standard errors are in parentheses. ; ; .|
The coefficient on global warming GSI indicates that while these two measures of GSI are strongly positively correlated, the correlation is far from perfect. Democrat vote share, percent of college graduates, and percent of Whites are still significant determinants of climate change GSI even when controlling for global warming GSI. These results suggest that there could be socioeconomic divides in the types of people that choose to search for climate change versus global warming. Specifically, metropolitan areas tend search more for climate change when the population is more liberal, more educated and more white. The education result seems intuitive as climate change is a more technical term, and people living in metropolitan areas with a highly educated population would be more likely to be exposed to this term.
4.2. Is There an Optimal Mix of Political Ideology for Engagement?
While the results so far have shown a positive correlation between Democrat vote share and GSI, given the prominence of politics in debates over climate change, we chose to examine this relationship in more detail. We first obtained residuals from the regression of GSI on all covariates except Democrat vote share, to partial out the correlation due to other covariates. Then we created nonparametric plots of those residuals against Democrat vote share. These results are given in Figure 3. Figure 3(a), which displays the plot for global warming GSI, shows that GSI increases steeply as metropolitan areas become less Republican, up until around a Democrat vote share of 43%, at which point GSI begins declining gradually. This inverse-U shaped relationship explains why Democrat vote share was an insignificant determinant of global warming GSI in Table 2. For climate change GSI, displayed in Figure 3(b), the relationship is always increasing, indicating that climate change GSI increases as metropolitan areas lean more Democratic over the whole range of the political spectrum. These results first suggest that climate change is more of a branded political word than global warming. Second, with global warming, our results indicate that metropolitan areas that are more moderate are seeking information and engaging in this topic, perhaps implying healthy debate.
This paper uses a novel data set of geographically explicit online information searches about climate change to assess the extent of geographic clustering of search interest and the climatological and socioeconomic determinants of search interest. Additionally, the data are distinctly private and anonymous, which may curtail any posturing that occurs during surveys about this divisive issue. Our results identify regions and the characteristics of metropolitan areas that actively seek information. One goal with these results is to inform the political process of passing meaningful carbon mitigation legislation by determining which areas of the United States and which types of cities are likely to support legislation and which will need to be campaigned for. Regardless of future legislation, some impacts of climate change are inevitable. Our results additionally identify the types of metropolitan areas that may adapt better to changes due to their knowledge and also which metropolitan areas and regions may need assistance.
In light of the findings of this paper, there are several areas of future research that would be valuable. Ultimately, one of the broader goals of this and other similar papers is to understand what shapes public opinion in order to pursue meaningful mitigation policies. This paper measures information seeking behavior, but this is fundamentally different from an opinion. Future research should seek to assess how the process of acquiring knowledge matters for policy support. Also, this paper measures static differences in information seeking behavior. It is just as critical to understand how behavior changes over time and what types of areas are most likely to change their behavior. On the adaptation side, future research could examine how information acquisition affects decisions about energy efficiency choices or flood protection or a host of smart adaptation strategies.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
This paper is a contribution of the Rhode Island Agricultural Experiment Station (no. 5410).
- Another concern with Google Trends is the extent to which it can represent the general population to the same degree as survey takers. The research examining this concern has been done in developing countries, where survey data are more prevalent and internet access is limited to a much smaller proportion of the population than in the United States, seemingly increasing the burden on search data. Carrière-Swallow and Labbé  and Mellon  validate search data as an equal or better indicator of a broad range of topics.
- These comparisons were made using the whole United States for the time period January 2008 through November 2012.
- The Getis-Ord local statistic is given as where is the attribute value for feature is the spatial weight between feature and is equal to the total number of features, and The statistic is a -score, so no further calculations are required. The statistic returned for each media market in the dataset is a -score. For larger and statistically significant positive -scores, intense clustering of high values or a hot spot is indicated. For smaller and statistically significant negative -scores, intense clustering of low values or a cold spot is indicated.
- An analysis of temporal trends suggested that search interest did not change upon passage of either RGGI or AB32, which indicates that the results of the Hot Spot analysis are not a product of those policies.
- While Google does not truncate at zero, the index that is reported by Google Trends is truncated at zero and that is why we use zero as the truncation point. The interpretation of each coefficient is in terms of marginal changes in the latent (uncensored) variable.
- As an additional validity check, we downloaded data for the search term “Mars,” which we view as a scientific search term, but without any political bend. If the determinants of Mars GSI were similar as those for global warming GSI and climate change GSI, it would question whether the results were specific to climate change or just search terms in general. The results indicate that only income and log population are determinants of Mars GSI, suggesting that global warming/climate change is somewhat unique and using Google Trends may lead to important insights about spatial disparities in engagement.
The purpose of the Supplementary Material is to provide additional details on the calculation of GSI. Beyond what is in the main text, we further explain how the data were derived from Google Trends. With the use of examples with screenshots of the Google Trends interface, we argue why our normalization of AQS is necessary.
- M. E. Kahn, Climatopolis: How Our Cities Will Thrive in the Hotter Future, Basic Books, New York, NY, USA, 2010.
- R. E. O'Connor, R. J. Bord, B. Yarnal, and N. Wiefek, “Who wants to reduce greenhouse gas emissions,” Social Science Quarterly, vol. 83, no. 1, pp. 1–17, 2002.
- K. Akerlof, E. W. Maibach, D. Fitzgerald, A. Y. Cedeno, and A. Neuman, “Do people “personally experience” global warming, and if so how, and does it matter?” Global Environmental Change, vol. 23, no. 1, pp. 81–91, 2013.
- P. J. Egan and M. Mullin, “Turning personal experience into political attitudes: the effect of local weather on Americans' perceptions about global warming,” The Journal of Politics, vol. 74, no. 3, pp. 796–809, 2012.
- R. E. Dunlap and A. M. McCright, “A widening gap republican and democratic views on climate change,” Environment, vol. 50, no. 5, pp. 26–35, 2008.
- J. A. Krosnick, A. L. Holbrook, L. Lowe, and P. S. Visser, “The origins and consequences of democratic citizens' policy agendas: a study of popular concern about global warming,” Climatic Change, vol. 77, no. 1-2, pp. 7–43, 2006.
- R. Shwom, D. Bidwell, A. Dan, and T. Dietz, “Understanding U.S. public support for domestic climate change policies,” Global Environmental Change, vol. 20, no. 3, pp. 472–482, 2010.
- C. P. Borick and B. G. Rabe, “A reason to believe: examining the factors that determine individual views on global warming,” Social Science Quarterly, vol. 91, no. 3, pp. 777–800, 2010.
- R. J. Brulle, J. Carmichael, and J. C. Jenkins, “Shifting public opinion on climate change: an empirical assessment of factors influencing concern over climate change in the U.S., 2002–2010,” Climatic Change, vol. 114, no. 2, pp. 169–188, 2012.
- L. C. Hamilton and B. D. Keim, “Regional variation in perceptions about climate change,” International Journal of Climatology, vol. 29, no. 15, pp. 2348–2352, 2009.
- K. Purcell, J. Brenner, and L. Rainie, Search Engine Use 2012, Pew Internet and American Life Project, Washington, DC, USA, 2012.
- C. P. Scheitle, “Google's insights for search: a note evaluating the use of search engine data in social research,” Social Science Quarterly, vol. 92, no. 1, pp. 285–295, 2011.
- S. Reilly, S. Richey, and J. B. Taylor, “Using google search data for state politics research: an empirical validity test using roll-off data,” State Politics and Policy Quarterly, vol. 12, no. 2, pp. 146–159, 2012.
- K. Sherman-Morris, J. Senkbeil, and R. Carver, “Who's googling what? What internet searches reveal about hurricane information seeking,” Bulletin of the American Meteorological Society, vol. 92, no. 8, pp. 975–985, 2011.
- H. Choi and H. Varian, “Predicting the present with Google Trends,” Economic Record, vol. 88, no. 1, pp. 2–9, 2012.
- C. Hand and G. Judge, “Searching for the picture: forecasting UK cinema admissions using Google trends data,” Applied Economics Letters, vol. 19, no. 11, pp. 1051–1055, 2012.
- J. Ginsberg, M. H. Mohebbi, R. S. Patel, L. Brammer, M. S. Smolinski, and L. Brilliant, “Detecting influenza epidemics using search engine query data,” Nature, vol. 457, no. 7232, pp. 1012–1014, 2009.
- K. J. Pelc, “Googling the WTO: what search-engine data tell Us about the political economy of institutions,” International Organization, vol. 67, no. 3, pp. 629–655, 2013.
- S. Stephens-Davidowitz, “The effects of racial animus on a black presidential candidate: using Google search data to find what surveys miss,” Working Paper, Harvard University, 2012.
- M. E. Kahn and M. J. Kotchen, “Business cycle effects on concern about climate change: the chilling effect of recession,” Climate Change Economics, vol. 2, no. 3, pp. 257–273, 2011.
- C. Lang, “Do weather fluctuations cause people to seek information about climate change?” Climatic Change, vol. 125, pp. 291–303, 2014.
- J. Mellon, “Where and when can we use google trends to measure issue salience?” PS—Political Science and Politics, vol. 46, no. 2, pp. 280–290, 2013.
- C. D. Swearingen and J. T. Ripberger, “Google Insights and U.S. senate elections: does search traffic provide a valid measure of public attention to political Candidates?” Social Science Quarterly, vol. 95, no. 3, pp. 882–893, 2014.
- S. Manzano and J. D. Ura, “Desperately seeking sonia? Latino heterogeneity and geographic variation in web searches for judge sonia sotomayor,” Political Communication, vol. 30, no. 1, pp. 81–99, 2013.
- M. Scharkow and J. Vogelgesang, “Measuring the public agenda using search engine queries,” International Journal of Public Opinion Research, vol. 23, no. 1, pp. 104–113, 2011.
- H. A. Carneiro and E. Mylonakis, “Google trends: a web-based tool for real-time surveillance of disease outbreaks,” Clinical Infectious Diseases, vol. 49, no. 10, pp. 1557–1564, 2009.
- D. E. Rose and D. Levinson, “Understanding user goals in Web search,” in Proceedings of the 13th International World Wide Web Conference Proceedings, pp. 13–19, May 2004.
- J. T. Ripberger, “Capturing Curiosity: Using Internet Search Trends to Measure Public Attentiveness,” Policy Studies Journal, vol. 39, no. 2, pp. 239–259, 2011.
- J. L. Baker and M. E. Grosh, “Poverty reduction through geographic targeting: how well does it work?” World Development, vol. 22, no. 7, pp. 983–995, 1994.
- C. Lang, C. B. Barrett, and F. Naschold, “Targeting maps: an asset-based approach to geographic targeting,” World Development, vol. 41, no. 1, pp. 232–244, 2013.
- K. R. Gurney, D. L. Mendoza, Y. Zhou et al., “High resolution fossil fuel combustion CO2 emission fluxes for the United States,” Environmental Science and Technology, vol. 43, no. 14, pp. 5535–5541, 2009.
- M. Gentzkow and J. M. Shapiro, “Introduction of television to the United States Media Market, 1946–1960,” in Inter-University Consortium for Political and Social Research [Distributors], University of Chicago, Chicago, Ill, USA, 2008.
- Y. Carrière-Swallow and F. Labbé, “Nowcasting with Google trends in an emerging market,” Journal of Forecasting, vol. 32, no. 4, pp. 289–298, 2013.
Copyright © 2014 Patrick Cavanagh et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.