Home /

ECONLOG POST

Apr 7 2009

Economic Methods

Open Source Forecasting

Arnold Kling

Categories: Economic Methods

By Arnold Kling, Apr 7 2009

Google’s Hal Varian and Hyunyoung Choi write,

We find that Google Trends data can help improve forecasts of the current level of activity for a number of different economic time series, including automobile sales, home sales, retail sales, and travel behavior…

we expect that there are several other interesting ideas out there. So we suggest that forecasting wannabes download some Google Trends data and try to relate it to other economic time series.

READER COMMENTS

READ COMMENT POLICY

rpl

Apr 7 2009 at 6:27pm

I was pretty skeptical of the notion of “forecasting” anything using Google data, but the Google article is not really talking about forecasting as such, but rather getting data on current happenings more quickly than you could get it by waiting on the official statistics. That seems a lot more plausible.

My first thought was to wonder how well Google searches really correlate with actual behavior. I’ve only skimmed the paper, but it looks like their model contains some adjustable parameters, and the phrase “cross-validation” doesn’t appear anywhere in the text. That isn’t a good sign, but one would have to read the procedure more carefully to untangle the subtleties.

My second thought was, how easily could these statistics be manipulated? If people come to rely on Google Trends and its models, then the operators of, say, a large bot-net could generate a bunch of bogus searches to create the appearance of a fake recovery in, say, retail sales. Stocks of retailers would presumably surge, creating an opportunity for the perpetrators to profit using short sales or judicious options purchases.

Google is justifiably proud of its mammoth data set, but I don’t think they’ve given too much thought to quality controlling the data. QC can be a real headache even in cases where the data source is well understood. For example, in meteorology bad ASOS and radiosonde observations sometimes make it into models, and good ones are sometimes erroneously rejected. Both types of error have been known to compromise forecasts, and I would aver that human users are even less predictable than weather instruments. Therefore, I would conjecture that the QC problem will be a deal-breaker for using Google Trends data as a significant economic indicator.

Bman

Apr 8 2009 at 3:21pm

Dear Dr. Kling,

Thanks for the excellent blog. Related to this post, you might want to have a look here:

http://messymatters.com/2009/03/21/the-future-is-yesterday/

Many of the series that correlate with Google trends data can often be forecast just as well or better using standard data and simple techniques.

Thank you.

Patri Friedman

Apr 10 2009 at 8:55pm

My old team at Google :). (Although I worked on other stuff – auctions, competitiveness of the search market)

rpl – you are totally wrong about the QC. Keep in mind that Google Searches => Google Ads => Google revenue and billing Google’s advertisers. False searches mean false billing for ads. As a company that cares about providing long-term value, Google goes to enormous effort to identify many kinds of fraudulent search (“search spam”, “ad spam”) and eliminate it from their records.

Sure, it is imperfect, but QC is *not* ignored. Lots of effort goes into QCing that data because advertisers are billed based on it.

Comments are closed.

Some Libertarian Basics

Arnold Kling

In the comments on my health care rationing post, I received many standard attacks as being cold-hearted and willing to deny health care to people who need it. From a libertarian perspective, your generosity is reflected in what you do with your own money, not in what you do with other people's money. If I give a lot...

Apr 8 2009

Finance: stocks, options, etc.

Taleb's Solution

Arnold Kling

Mark Thoma points to an op-ed by Nassim Taleb. He and I are on the same page in many ways. However, I wish that Taleb would dial back on the colorful rhetoric in order to focus on substance. More detailed comments follow. Nothing should ever become too big to fail. I share this wish. However, putting it into pract...

Apr 7 2009

Economic Methods

Open Source Forecasting

Arnold Kling

Google's Hal Varian and Hyunyoung Choi write, We find that Google Trends data can help improve forecasts of the current level of activity for a number of different economic time series, including automobile sales, home sales, retail sales, and travel behavior... we expect that there are several other interesting idea...

COLLECTION: ECONOMIC METHODS

The article you’re reading is part of Econlib’s Economic Methods collection. Explore other Economic Methods articles:

Apr 24 2024

It’s Not “Midwest Nice” to Break the Rules

Tyler Watts
Apr 21 2024

Human Costs, Animal Costs, and Economic Costs

Pierre Lemieux
Mar 17 2024

My Weekly Reading and Viewing for March 17, 2024.

David Henderson
Mar 6 2024

Lawrence O'Donnell Makes Good Point Innumerately

David Henderson

Open Source Forecasting

READER COMMENTS

rpl

Apr 7 2009 at 6:27pm

Bman

Apr 8 2009 at 3:21pm

Patri Friedman

Apr 10 2009 at 8:55pm

RECENT POST

Some Libertarian Basics

Taleb's Solution

Open Source Forecasting