Forecasting Principles - Evidence-based Forecasting
Rules for Cheaters: How to Get a High R2

Taken from J. Scott Armstrong, 1985, Long-Range Forecasting, 2nd ed., p. 487.

Some people are impressed by a high R2. There are simple things that can be done to raise R2; other than that, they are of no value. The most important thing is not to use these rules (please don't), but to be aware that others use them.

  1. Discard outliers after you examine the regression results.
  2. Aggregate the data, especially when it reduces sample size significantly.
  3. Experiment by trying lots of variables.
  4. Try different functional forms.
  5. Use stepwise regression and retain all coefficients with t statistics greater than 1.0 (Haitovsky, 1969).
  6. Include a lot of variables in the final equation.
  7. Use R2 rather than .

These rules should yield R2 values of over 99% for time series data and about 90% for cross-sectional data. My advice is that you should not use R2 for time series data. The dangers outweigh any potential benefits.


 

Principle of the day:

14.8 Obtain good feedback about forecast accuracy and the reasons why errors occurred

Advertisement
Advertisement
There have been mod_vvisit_countermod_vvisit_countermod_vvisit_countermod_vvisit_countermod_vvisit_countermod_vvisit_countermod_vvisit_countermod_vvisit_counter visits to this website since February 14, 1998.
© Copyright 1997-2009 by J. Scott Armstrong. All rights are reserved. Web Design by Zoe Communications Ltd.
This site is directed by J. Scott Armstrong and Kesten C. Green.

The Forecasting Principles site was sponsored by the Marketing Department of The Wharton School, University of Pennsylvania for the first nine years. The International Institute of Forecasters has been a sponsor since July 2006.