Sala I Martin
Sala I Martin
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted
digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about
JSTOR, please contact support@jstor.org.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
http://about.jstor.org/terms
American Economic Association is collaborating with JSTOR to digitize, preserve and extend access to The
American Economic Review
This content downloaded from 129.82.37.215 on Wed, 28 Sep 2016 22:57:11 UTC
All use subject to http://about.jstor.org/terms
I Just Ran Two Million Regressions
By XAVIER X. SALA-I-MARTIN *
Following the seminal work of Robert Barro An initial answer to this question was given
(1991 ), the recent empirical literature on eco- by Ross Levine and David Renelt (1992).'
nomic growth has identified a substantial num- They applied Edward Leamer's (1985)
ber of variables that are partially correlated extreme-bounds test to identify "robust" em-
with the rate of economic growth. The basic pirical relations in the economic growth
methodology consists of running cross- literature. In short, the extreme-bounds test
sectional regressions of the form works as follows. Imagine that there is a pool
of N variables that previously have been iden-
(1) Y = a + 1x, + 32X2 tified to be related to growth and one is inter-
ested in knowing whether variable z is
+.-.+Inxn +? "robust." One would estimate regressions of
the form
where y is the vector of rates of economic
growth, and x,, ... , x, are vectors of explan- (2) y = aj + pyjy + Pzjz + Ixjxj + s
atory variables, which vary across researchers
and across papers. Each paper typically reports where y is a vector of variables that always
a (possibly nonrandom) sample of the regres- appear in the regressions (in the Levine and
sions actually run by the researcher. Variables Renelt paper, these variables are the initial
like the initial level of income, the investment level of income, the investment rate, the sec-
rate, various measures of education, some pol- ondary school enrollment rate, and the rate
icy indicators, and many other variables have of population growth), z is the variable of
been found to be significantly correlated with interest, and xj E X is a vector of up to three
growth in regressions like ( 1 ). I have collected variables taken from the pool X of N vari-
around 60 variables which have been found to ables available. One needs to estimate this
be significant in at least one regression. regression or model for all the possible M
The problem faced by empirical growth combinations of xj E X. For each model j]
economists is that growth theories are not one finds an estimate, fzj r and a standard de-
explicit enough about what variables xj viation,
be- azj The lower extreme bound is de-
long in the "true" regression. That is, even fined to be the lowest value of zj -2ozj,
if it is known that the "true" model looks and the upper extreme bound is defined to be
like (1), one does not know exactly what the largest value of fzj + 2ozj. The extreme-
particular variables xj should be used. Ifbounds
one test for variable z says that if the
starts running regressions combining the lower extreme bound for z is negative and
various variables, variable xi will soon be the upper extreme bound is positive, then
found to be significant when the regression variable z is not robust. Note that this
includes variables x2 and x3, but it becomes amounts to saying that if one finds a single
nonsignificant when X4 iS included. Since the regression for which the sign of the coeffi-
"true" variables that should be included are cient f3z changes or becomes insignificant,
not known, one is left with the question: then the variable is not robust.
what are the variables that are really corre- Not surprisingly, Levine and Renelt's
lated with growth? conclusion is that very few (or no) vari-
This content downloaded from 129.82.37.215 on Wed, 28 Sep 2016 22:57:11 UTC
All use subject to http://about.jstor.org/terms
VOL. 87 NO. 2 RECENT EMPIRICAL GROWTH RESEARCH 179
ables are robust. One possible reason for A. Case 1: The Distribution of the Estimates
finding few or no robust variables is, of of f3 across Models Is Normal
course, that very few variables can be iden-
tified to be correlated systematically with In order to compute the cumulative distri-
growth. Hence, some researchers' reading bution function [CDF(0)], one needs to know
of the Levine and Renelt paper concluded the mean and the standard deviation of this
that nothing can be learned from this em- distribution. For each of the M models, com-
pirical growth literature because no vari- pute the (integrated) likelihood, Lj, the point
ables are robustly correlated with growth. estimate ,j, and the standard deviation u,j.
Another explanation, however, is that the With all these numbers one can construct the
test is too strong for any variable to pass it:
mean estimate of /, as the weighted average
if the distribution of the estimators of /, has
each of the M point estimates, I83z:
some positive and some negative support,
M
then one is bound to find one regression for
which the estimated coefficient changes (3) fz= E zj
1=1
signs if enough regressions are run. Thus,
giving the label of nonrobust to all variables where the weights, wzj are proportional to the
is all but guaranteed. (integrated) likelihoods
This content downloaded from 129.82.37.215 on Wed, 28 Sep 2016 22:57:11 UTC
All use subject to http://about.jstor.org/terms
180 AEA PAPERS AND PROCEEDINGS MAY 1997
This content downloaded from 129.82.37.215 on Wed, 28 Sep 2016 22:57:11 UTC
All use subject to http://about.jstor.org/terms
VOL. 87 NO. 2 RECENT EMPIRICAL GROWTH RESEARCH 181
do not report these results, I performed the TABLE 1-MAIN RESULTS OF REGRESSIONS
extreme-bounds test on the 59 tested vari- (DEPENDENT VARIABLE = GROWTH)
This content downloaded from 129.82.37.215 on Wed, 28 Sep 2016 22:57:11 UTC
All use subject to http://about.jstor.org/terms
182 AEA PAPERS AND PROCEEDINGS MAY 1997
It is interesting to note some of the variables My claim in this paper is that, if one is in-
that are not in the table (because they appear terested in knowing the coefficient of a par-
not to be important): no measure of govern- ticular variable in a growth regression, the
ment spending (including investment) appears picture emerging from the empirical growth
to affect growth in a significant way. The var- literature is not the pessimistic "nothing is
ious measures of financial sophistication, the robust" obtained with the extreme bound
inflation rate, and its variance do not appear to analysis. Instead, a substantial number of
matter much. (In fairness to the authors who variables can be found to be strongly related
proposed these variables, I should say that they to growth.
specifically say that they affect growth in non-
linear ways, and my analysis allowed these REFERENCES
variables to enter in a linear fashion only.)
Other variables that do not seem to matter in- Barro, Robert J. "Economic Growth in a Cross
clude various measures of scale effects (mea- Section of Countries." Quarterly Journal of
sured by total area and total labor force), Economics, May 1991, 106(2), pp. 407-
outward orientation, tariff restrictions, the 43.
black-market premium, and the recently pub- . "'Determinants of Democracy."
licized "ethno-linguistic fractionalization" Mimeo, Harvard University, July 1996.
(which is supposed to capture the degree to Barro, Robert J. and Lee, Jong-Wha. "Interna-
which there are internal fights among vanous tional Comparisons of Educational Attain-
ethnic groups).' ment." Journal of Monetary Economics,
December 1993, 32(3), pp. 363-94.
De Long, J. Bradford and Summers, Lawrence.
"'Equipment Investment and Economic
7 The data for this paper were taken from the World
Growth." Quarterly Journal of Economics,
Bank Research Department's Web page. May 1991, 106(2), pp. 445-502.
8 The data for Sachs and Warner ( 1995) were provided Hall, Robert and Jones, Charles. "The Produc-
by Andrew Warner; the data for Hall and Jones (1996) tivity of Nations." National Bureau of
were taken from Charles Jones's Web page.
Economic Research (Cambridge, MA)
9 See Sala-i-Martin ( 1996) for the complete list of vari-
ables, with their estimated coefficients and levels of Working Paper No. 5812, November
significance. 1996.
This content downloaded from 129.82.37.215 on Wed, 28 Sep 2016 22:57:11 UTC
All use subject to http://about.jstor.org/terms
VOL. 87 NO. 2 RECENT EMPIRICAL GROWTH RESEARCH 183
This content downloaded from 129.82.37.215 on Wed, 28 Sep 2016 22:57:11 UTC
All use subject to http://about.jstor.org/terms