0% found this document useful (0 votes)

42 views5 pages

Zipf Plots and The Size Distribution of Firms: Economics Letters

This document analyzes the size distribution of firms using a Zipf plot technique. The analysis finds that while the log-normal distribution fits the data reasonably well, there is less mass in the upper tail than expected, with fewer extremely large firms than predicted by the log-normal. Specifically, the largest firm, General Motors, has sales that are smaller than the 8th or 9th largest predicted by the log-normal. This suggests deviations from Gibrat's law of proportional growth and the assumption that growth rates are independent of firm size.

Uploaded by

dr musafir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views5 pages

Zipf Plots and The Size Distribution of Firms: Economics Letters

Uploaded by

dr musafir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

economics

letters
ELSEVIER Economics Letters 49 (1995) 453-457

Zipf plots and the size distribution of firms

• a b
M i c h a e l H . R . S t a n l e y a, S e r g e y V. B u l d y r e v a, S h l o m o H a v h n ' ,
R o s a r i o N. M a n t e g n a a, M i c h a e l A . Salinger c'*, H. E u g e n e S t a n l e y a
aDepartment of Physics, Boston University, 590 Commonwealth Avenue, Boston, MA 02215, USA
bDepartment of Physics, Bar-Ilan University, Ramat-Gan, Israel
CSchool of Management, Boston University, 704 Commonwealth Avenue, Boston, MA 02215, USA
Received 14 November 1994; revised version received 9 March 1995; accepted 13 March 1995

Abstract

We use a Zipf plot to demonstrate that the upper tail of the size distribution of firms is too thin relative to the log
normal rather than too fat, as had previously been believed.

Keywords: Firm size; Zipf plot; Gibrat's Law

JEL classification: L l l

This paper presents new evidence on the size distribution of firms. Like earlier studies, it
shows that the log-normal distribution fits the data well except for the upper tail. However, in
contrast to earlier studies, we find that there is too little mass in the upper tail, not too much.
We demonstrate this point with a statistical technique that has been used rarely in economics,
but is more c o m m o n in physics. 1 The technique, known as a Zipf plot, is a plot of the log of
the rank vs. the log of the variable being analyzed.
Let ( x l , . . . ,xN) be a set of N observations on a r a n d o m variable x for which the
cumulative distribution function is F ( x ) , and suppose that the observations are ordered from
largest to smallest so that the index i is the rank of x i. The Zipf plot of the sample is the graph
of In x i against In i. Because of the ranking, i / N = 1 - F ( x i ) , so

In i = In[1 - F ( x i ) ] + In N . (1)

Thus, the log of the rank is simply a transformation of the cumulative distribution function. It
accentuates the upper tail of the distribution and therefore makes it easier to detect deviations

* Corresponding author.
1 See Gell-Mann (1994, p. 93) for a discussion.

Elsevier Science S.A.

SSDI 0165-1765(95)00696-6
454 M.H.R. Stanley et al. / Economics Letters 49 (1995) 453-457

in the upper tail from the theoretical prediction of a particular distribution. Since there has
been interest in the upper tail of the size distribution of firms, the Zipf plot is particularly
useful for analyzing this question.
The Zipf plot for the log-normal distribution is characterized by

In i = I n [ 1 - ~ ( l n x i - +
- - tlz n) ] N C r , (2)

w h e r e / z and o- are the mean and standard deviation of In xi, and q~ is the standard normal
cumulative distribution function. Solving (2) for In x i as a function of In i gives

lnx i = ~rq~-l(1 - - eNi

- ) + (3)

The data for this study are the 1993 sales of 4071 manufacturing firms (SIC codes
2000-3999) on Compustat. 2 Fig. 1 shows a histogram of the log of sales with bin sizes equal to
X/2. The curve is the normal density function with mean and standard deviation equal to the
sample mean and standard deviation of the log of sales. The graph seems to suggest that the

250 I
r
I

200 [
¢-
°
t
_

E 150
e i

lOO

E L
z i
50 ~

02 104 106 108 101° 10 TM

Sales (Dollars)
Fig. 1. Distribution of firm size. The circles are a histogram showing the number of firms having 1993 sales of X
dollars as a function of log X. The data are for the 4071 Compustat firms in SIC codes 2000-3999. The values of the
sales are binned in powers of ~/2. The solid curve is a log-normal fit to the data using the mean of the log of sales
and the standard deviation of the log of sales as fitting parameters.

z Compustat is not, of course, the entire population of firms. In principle, though, it is the entire population of
publicly traded firms. While we only report results here for 1993, we have done the analysis for 1975, 1979, 1980,
and 1984 and obtained qualitatively similar results.
M.H.R. Stanley et al. I Economics Letters 49 (1995) 453-457 455

1012 .......

10 l°

10 8

m 06

10 4

0 , . . . . . . . i , , , , , , , , i , , . . . . . . t . . . . . . .

I0 I O0 1000 10000
Rank
Fig. 2. Zipf plot. The bottom curve is a Zipf plot (double logarithmic plot of sales vs. rank) for the same sample as
in Fig. 1. The top curve is a predicted Zipf plot obtained from the log-normal fit shown in Fig. 1.

distribution of the log of sales fits the log normal reasonably well. Fig. 2 shows the Zipf plot
along with the theoretical Zipf plot for the log normal. Like the histogram, the Zipf plot
suggests that the log normal fits the distribution of sales reasonably well. However, in contrast
to the histogram, the Zipf plot makes clear that the sales of the largest firms are smaller than
would be the case for a true log normal. The actual Zipf plot lies below the theoretical Zipf
plot for roughly the largest 100 firms.
With the aid of the Zipf plot, the deviations from the log normal can also be seen in Fig. 1.
First, the three points on the right lie slightly below the best fitting density function. The main
source of deviation is, however, that the upper tail should contain additional firms. The largest
firm in the sample, General Motors, has sales of $136 billion. The natural log of GM's sales is
25.63. The mean of the natural log of sales is 17.76 and the standard deviation is 2.72. Thus,
the natural log of GM's sales is 2.90 standard deviations above the mean. The probability that
an observation from a standard normal distribution exceeds 2.90 is 0.0019. Multiplying this
probability by the number of firms (other than GM) in the sample (4070) gives 7.73, which is
the expected number of firms with sales greater than $136 billion. If the distribution were log
normal, therefore, we would expect GM's level of sales to be the eighth or ninth largest, not
the first. All that would be needed for the size distribution of firms to be log normal would be
seven firms larger than GM!
This deviation from log normality is statistically significant. Under the null hypothesis of log
normality, the number of firms with sales greater than $136 billion has a binomial distribution
with p = 0.0019 and N = 4070. The variance is, therefore, 4070 x 0.0019 x 0.9981 = 7.72; and
the standard deviation is 2.78. Thus, the actual number of firms with sales greater than
456 M.H.R. Stanley et al. / Economics Letters 49 (1995) 453-457

$136 billion, which is 0, is 2.78 standard deviations less than the expected n u m b e r , which is
7.73. 3 The probability that none of the 4070 firms other than G M would have sales greater
than $136 billion is ( 1 - 0.0019) 4070= 0.00043, which is substantially below any conventional
standard for significance. 4
These results are of interest because of their implications for the literature on the dynamics
of firm growth. Gibrat (1931) showed that if the distribution of growth rates is i n d e p e n d e n t of
firm size, the static distribution of firm size would approach the log normal. In an early
empirical test using British data, Hart and Prais (1956) found evidence b o t h that the log
normal fits the distribution of firm sizes reasonably well and that the growth rates of firms
were i n d e p e n d e n t of initial size. T h e y found, however, statistically significant deviations of the
distribution from the log normal by estimating the third and fourth m o m e n t s of the
distribution. The distributions were 'somewhat skewed to the right and slightly leptokurtotic. '5
Q u a n d t (1966) p r o p o s e d four tests of the distribution of firm size. H e was able to reject log
normality for the Fortune 500 in both 1955 and 1960 with each of the four tests he used.
A l t h o u g h he was able to reject every distribution he tested for the F o r t u n e 500 with at least
one test for at least one of the years, the two Pareto distributions and the C h a m p e r n o w n e
generally fit better than the log normal. In summarizing the literature, Hall (1987) wrote:
" T h e size distribution of firms conforms fairly well to the log normal, with possibly s o m e
skewness to the right" (p. 584). Thus, the results here may suggest a qualitative change in the
size distribution in firms from the earlier time periods used in those studies. 6

Acknowledgments

We have benefited from conversations with Glenn Loury, Jeff Miron, and M a r t h a Schary
and from a referee's comments.

References

Gell-Mann, M., 1994, The quark and the jaguar (W.H. Freeman, New York).
Gibrat, R., 1931, Les in6galit6s economiques (Sirey, Paris).

3 Because p is small, N is large, and Np is moderate, the distribution of the number of firms with sales greater
than $136 billion is approximately Poisson. The fact that the standard deviation and the number of standard
deviations from the mean are (approximately) equal is due to the well-known result that the variance of a Poisson
distribution equals the mean.
4 Even if there were one firm with sales larger than $136 billion, log normality could be rejected at the 1% level.
Log normality could be nearly rejected at the 5% level if there were three firms with sales above $136 billion. (The
p-value for three firms is 0.051.)
5 The skewness of the log of sales in our sample is -4.01. The kurtosis is 176.7, which is 3.22 times the square of
the variance. This ratio is only slightly above the theoretical value of 3 for a normal distribution.
6 An alternative explanation is that the earlier studies by Simon and Bonini (1958) and Quandt (1966) used the
Fortune 500 as their samples.
M.H.R. Stanley et al. / Economics Letters 49 (1995) 453-457 457

Hall, B.H., 1987, The relationship between firm size and firm growth in the U.S. manufacturing sector, The
Journal of Industrial Economics 35, 583-606.
Hart, P.E. and S.J. Prais, 1956, The analysis of business concentration: A statistical approach, Journal of the Royal
Statistical Society, Series A, 119, 150-181.
Quandt, R., 1966, On the size distribution of firms, American Economic Review 56, 416-432.
Simon, H. and C.P. Bonini, 1958, The size distribution of business firms, American Economic Review 46, 607-617.

Chapter 4
No ratings yet
Chapter 4
34 pages
Normal Distribution Curve
0% (1)
Normal Distribution Curve
14 pages
Logarithm in Biology - Mechanisms Generating The Log-Normal Distribution Exactly
No ratings yet
Logarithm in Biology - Mechanisms Generating The Log-Normal Distribution Exactly
15 pages
Facts From Figures PDF
No ratings yet
Facts From Figures PDF
481 pages
Descriptive Statistics - Numerical Methods 1
No ratings yet
Descriptive Statistics - Numerical Methods 1
64 pages
Normal Distribution
No ratings yet
Normal Distribution
10 pages
Gibrat's Law and Diversification
No ratings yet
Gibrat's Law and Diversification
29 pages
(Jagdish K. Patel and Campbell B. Read) Handbook o (BookFi) PDF
No ratings yet
(Jagdish K. Patel and Campbell B. Read) Handbook o (BookFi) PDF
344 pages
Newman Powerlaw Review
No ratings yet
Newman Powerlaw Review
30 pages
Normal Distribution Curve
No ratings yet
Normal Distribution Curve
19 pages
Power Newman06
No ratings yet
Power Newman06
28 pages
Normal Distribution
No ratings yet
Normal Distribution
84 pages
Descriptive Statistics - Numerical Methods 1
No ratings yet
Descriptive Statistics - Numerical Methods 1
64 pages
NORMAL DISTRIBUTION Updated Slides
No ratings yet
NORMAL DISTRIBUTION Updated Slides
44 pages
Gibrat's Law For (All) Cities
No ratings yet
Gibrat's Law For (All) Cities
23 pages
Power Law, Pareto Distribuition and Zipfs Law - Newman2005
No ratings yet
Power Law, Pareto Distribuition and Zipfs Law - Newman2005
30 pages
Chapter 04 R
0% (1)
Chapter 04 R
31 pages
Newman PARETO ZIPF
No ratings yet
Newman PARETO ZIPF
30 pages
Lecture 4 - Normal and Nonnormal Dist - HS - 070323en
No ratings yet
Lecture 4 - Normal and Nonnormal Dist - HS - 070323en
39 pages
Common Properties and Sectoral Specificities in The Dynamics of U.S. Manufacturing Companies
No ratings yet
Common Properties and Sectoral Specificities in The Dynamics of U.S. Manufacturing Companies
16 pages
Newman 2006 B
No ratings yet
Newman 2006 B
28 pages
On The Mechanics of Firm Growth: Erzo G. J. Luttmer
No ratings yet
On The Mechanics of Firm Growth: Erzo G. J. Luttmer
27 pages
Quartiles
No ratings yet
Quartiles
27 pages
Firm Size MM
No ratings yet
Firm Size MM
22 pages
Topic05.Normal Distr
No ratings yet
Topic05.Normal Distr
27 pages
Power Laws in Economics
No ratings yet
Power Laws in Economics
21 pages
Power Laws, Pareto Distributions and Zipf's Law
No ratings yet
Power Laws, Pareto Distributions and Zipf's Law
28 pages
Business Firm Growth and Size
No ratings yet
Business Firm Growth and Size
14 pages
Physica A: Piero Montebruno, Robert J. Bennett, Carry Van Lieshout, Harry Smith
No ratings yet
Physica A: Piero Montebruno, Robert J. Bennett, Carry Van Lieshout, Harry Smith
18 pages
QTAOR
No ratings yet
QTAOR
14 pages
Zipf's Law
No ratings yet
Zipf's Law
15 pages
Z Table
No ratings yet
Z Table
13 pages
Session On Non-Gaussian Distribution
No ratings yet
Session On Non-Gaussian Distribution
13 pages
05f M-Probability6507GBMfgv4
No ratings yet
05f M-Probability6507GBMfgv4
16 pages
Lab Report Gassiuan Distribution
100% (1)
Lab Report Gassiuan Distribution
13 pages
University School of Business (MBA) : SUBJECT NAME: Decision Science-II Subject Code: 20bat652
No ratings yet
University School of Business (MBA) : SUBJECT NAME: Decision Science-II Subject Code: 20bat652
15 pages
Normal Distribution - What It Is, Properties, Uses, and Formula
No ratings yet
Normal Distribution - What It Is, Properties, Uses, and Formula
11 pages
B39AX Topic2-P PDF
No ratings yet
B39AX Topic2-P PDF
16 pages
Capital Regulation and Bank Failure Contagion: Daniel Mckeever
No ratings yet
Capital Regulation and Bank Failure Contagion: Daniel Mckeever
48 pages
Normal Distribution
No ratings yet
Normal Distribution
15 pages
Tmpe882 TMP
No ratings yet
Tmpe882 TMP
10 pages
Ders 5 6 7 8 Normal Distribution
No ratings yet
Ders 5 6 7 8 Normal Distribution
68 pages
Application of The Normal Distribution
No ratings yet
Application of The Normal Distribution
7 pages
The Role of Financial Variables in Predicting Economic Activity
No ratings yet
The Role of Financial Variables in Predicting Economic Activity
32 pages
Jones and Hensher 2004
No ratings yet
Jones and Hensher 2004
28 pages
Journal of Organizational Behavior Management
No ratings yet
Journal of Organizational Behavior Management
27 pages
Research MMW Lopez
No ratings yet
Research MMW Lopez
5 pages
Zavgren 1985 PDF
No ratings yet
Zavgren 1985 PDF
27 pages
Normal Distribution
No ratings yet
Normal Distribution
23 pages
Corporate Failure Prediction Models in The Twenty-First Century: A Review
No ratings yet
Corporate Failure Prediction Models in The Twenty-First Century: A Review
23 pages
Corporate Failure Prediction Models in The Twenty-First Century: A Review
No ratings yet
Corporate Failure Prediction Models in The Twenty-First Century: A Review
23 pages
Eljelly 2001
No ratings yet
Eljelly 2001
23 pages
Normal - Distribution
No ratings yet
Normal - Distribution
23 pages
Environmental Dynamism, Capital Structure and Performance: A Theoretical Integration and An Empirical Test
No ratings yet
Environmental Dynamism, Capital Structure and Performance: A Theoretical Integration and An Empirical Test
19 pages
Nam and Jinn 2000
No ratings yet
Nam and Jinn 2000
21 pages
Unit-4 Biostatistics Descriptive
No ratings yet
Unit-4 Biostatistics Descriptive
19 pages
Normal Distribution - Wikipedia, The Free Encyclopedia
No ratings yet
Normal Distribution - Wikipedia, The Free Encyclopedia
22 pages
Lesson 3 Normal Distribution
No ratings yet
Lesson 3 Normal Distribution
49 pages
An Empirical Comparison of Bankruptcy Models: Review
No ratings yet
An Empirical Comparison of Bankruptcy Models: Review
19 pages
Normal Distribution For ML
No ratings yet
Normal Distribution For ML
17 pages
Week 9 Chapter 1 Normal
No ratings yet
Week 9 Chapter 1 Normal
51 pages
Normal Distribution
No ratings yet
Normal Distribution
3 pages
BIOEPI
No ratings yet
BIOEPI
2 pages
The Normal Distribution Is The Most Important and Most Widely Used Distribution in Statistics
No ratings yet
The Normal Distribution Is The Most Important and Most Widely Used Distribution in Statistics
2 pages
Central Tendency Formulas
No ratings yet
Central Tendency Formulas
3 pages
Statistical Concepts 1 Running Head: Statistical Concepts
No ratings yet
Statistical Concepts 1 Running Head: Statistical Concepts
27 pages
Delving Into The Bell Curve
No ratings yet
Delving Into The Bell Curve
16 pages
Zipf Law In!rms Bankruptcy: Yoshi Fujiwara
No ratings yet
Zipf Law In!rms Bankruptcy: Yoshi Fujiwara
12 pages
EJ1165803
No ratings yet
EJ1165803
15 pages
JRCRS ARTICLE Comparison of SWD
No ratings yet
JRCRS ARTICLE Comparison of SWD
10 pages
Li LEE ZHOU SUN 2011
No ratings yet
Li LEE ZHOU SUN 2011
9 pages
Out Altman Flopped
No ratings yet
Out Altman Flopped
7 pages
Asr F P S M
No ratings yet
Asr F P S M
6 pages
The Normal Distribution Bab 4 Summary
No ratings yet
The Normal Distribution Bab 4 Summary
1 page
The Normal Distribution Is The Most Important and Most Widely Used Distribution in Statistics
No ratings yet
The Normal Distribution Is The Most Important and Most Widely Used Distribution in Statistics
2 pages
Market Profile Basics: What is the Market Worth?
From Everand
Market Profile Basics: What is the Market Worth?
Daniel Christal
4.5/5 (13)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Zipf Plots and The Size Distribution of Firms: Economics Letters

Uploaded by

Zipf Plots and The Size Distribution of Firms: Economics Letters

Uploaded by

economics

Zipf plots and the size distribution of firms

Keywords: Firm size; Zipf plot; Gibrat's Law

Elsevier Science S.A.

lnx i = ~rq~-l(1 - - eNi

02 104 106 108 101° 10 TM

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.