0% found this document useful (0 votes)

15 views

Ch. 3 - Systems of Equations

The document discusses systems of equations and seemingly unrelated regression models. It introduces a general model with m equations that can be estimated simultaneously. For seemingly unrelated regressions (SUR), it assumes the errors are correlated across equations but not across observations. This results in a variance-covariance matrix Σ of the disturbances that is block diagonal with the m×m covariance matrix Ω repeated along the diagonal. Generalized least squares (GLS) provides efficient estimates by using the inverse of Σ as weights, while feasible GLS replaces the unknown Σ with a consistent estimate.

Uploaded by

Volkan Veli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Ch. 3 - Systems of Equations

Uploaded by

Volkan Veli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Chapter 3

System of Equations

3.1 Introduction

So far we have considered the estimation of models that concerned the estimation of one
equation. There are many cases, however, when such equations are not determined by
themselves, but rather simultaneously with other equations. In these cases it makes more
sense to estimate all the equations simultaneously. Consider, for example, that we want
to estimate the household demand functions for gas and groceries. These decisions are
normally made simultaneously given an expenditure constraint, so it stands to reason
that the unobserved effects on the demand of each good must be related, i.e. the errors
of both equations must be correlated across equations.

In this chapter we are going to consider a modeling framework for multiple equations
that can be used for different applications, and then we are going to consider different
estimation methods for different cases. The general model of equations can be written
as Let there be m equations that you want to estimate which we can write as

y 1 = X1 β 1 + ε 1
y 2 = X2 β 2 + ε 2
...
y m = Xm β m + ε m (3.1)

1
2 CHAPTER 3. SYSTEM OF EQUATIONS

3.2 Seemingly Unrelated Equations

There are m equations and n observations to estimate each equation in (3.1). Each
equation in this system has its own set of Ki regressors, for i = 1 to m, so the
coefficient vector for each equation also has its own size (Ki ). For the seemingly
unrelated regressions (SUR) model, we assume that there is strict exogeneity, i.e.
E [εi | X1 , X2 , . . . , Xm ] = 0, for i = 1 to m, and homoskedasticity within the equations,
i.e. E [εi ε0i | X1 , X2 , . . . , Xm ] = σii In . The errors, although uncorrelated across observa-
tions from the assumption of homoskedasticity within the equations, are correlated across
equations. This means that for observations t and s in equations i and j, respectively,

E [εit εjs | X1 , X2 , . . . , Xm ] = σij , if t = s and 0 otherwise.

This means that the variance covariance matrix of the disturbances, Σ is

       
σ11 0 ... 0 σ12 0 ... 0 σ1m 0 ... 0
  0 σ . . . 0 0 σ . . . 0  0 σ1m . . . 0
     
11 12
. . .
      
  . . . . . . . .  .. .. .. ..
  .. .. . . ..   .. .. . . .. 
    
  . . . .  


 0 0 . . . σ11 0 0 . . . σ12 0 0 ... σ1m 

 
       

 σ21 0 ... 0 σ22 0 ... 0 σ2m 0 ... 0 

  0 σ21 . . . 0   0 σ22 . . . 0   0 σ2m . . . 0
     
. . .
   
  . . . . . . . .  .. .. .. ..
  .. .. . . ..   .. .. . . .. 
    
  . . . .  


 0 0 . . . σ 21 0 0 . . . σ 22 0 0 ... σ2m 

 .. .. .. .. 
 . . .  . 
      

 σm1 0 ... 0 σm2 0 ... 0 σmm 0 ... 0 

  0 σm1 . . . 0   0 σm2 . . . 0  0 σmm . . . 0
     
  . . .
 
  .. .. . . . .  .. .. ..

. . . ..   ..
  .. . . . ..  .. 
.

  . .   . . .  
0 0 . . . σm1 0 0 . . . σm2 0 0 ... σmm

Let the m × m covariance matrix of the disturbances for the tth observation be
 
σ11 σ12 . . . σ1m
 σ21 σ22 . . . σ2m 
Ω =  .. , (3.2)
 
.. ... ..
 . . . 
σm1 σm2 . . . σmm
3.2. SUR 3

so that the covariance matrix of the residuals is1

 
σ11 In σ12 In . . . σ1m In
 σ21 In σ22 In . . . σ2m In 
Σ = Ω ⊗ In =  . (3.3)
 
.. .. .. ..
 . . . . 
σm1 In σm2 In . . . σmm In

3.2.1 Generalized Least Squares

Consider the stacked model

      
y1 X1 0 ... 0 β1 ε1
 y2   0 X2 ... 0  β2   ε2 
 ..  =  .. +  = Xβ + ε (3.4)
      
.. ... ..  .. ..
 .   . . .  .   . 
ym 0 0 . . . Xm βm εm

OLS estimation of this model although consistent would be inefficient. If we know Σ

GLS provides the efficient estimator. Since Σ = Ω ⊗ In , Σ−1 = Ω−1 ⊗ In .2 The GLS
estimator is
0 −1
−1 −1 0 −1
X0 Σ−1 y = X0 Ω−1 ⊗ In X

β
b
GLS = X Σ X X Ω ⊗ In y, (3.5)

and the asymptotic covariance matrix of the GLS estimator is

h i −1 0 −1 −1
0 −1
V β b
GLS | X = X Σ X = X Ω ⊗ In X . (3.6)

Feasible Generalized Least Squares

Since we do not know Σ, we need to estimate it and use the FGLS estimator. Remember
that Σ = Ω ⊗ In , so in reality all we need is an estimate of Ω. To get such estimate,
all we need is to run OLS equation by equation, and use the vectors of residuals of each
1
The Kronecker product between matrices A and B, denoted A⊗B, is defined such that each element
in matrix A is multiplied by matrix B.
−1 −1
2
The rule is that (A ⊗ B) = A−1 ⊗ B−1 , so since I−1
n = In , Σ = Ω−1 ⊗ In .
4 CHAPTER 3. SYSTEM OF EQUATIONS

equation to calculate the different σ

bij . The OLS residuals can be used to provide a
consistent estimate
ε0ib
b εj
σ
bij = . (3.7)
n

Equation (3.7) does not correct for the degrees of freedom lost because of the number
of regressors used. Notice that equations i and j have Ki and Kj regressors, respectively,
where Ki does not necessarily equal Kj . Two possibilities that are unbiased when i = j
are
ε0ib
εj
bij∗ =
b
σ (3.8a)
[(n − Ki ) (N − Kj )]1/2
and
ε0ib
εj
bij∗∗ =
b
σ . (3.8b)
n − max (Ki , Kj )

Notice that FGLS does not require that Σ b is unbiased, only consistent, which would
be the case if we just used the estimates in equation (3.7). Using those estimates, then
 
σ
b11 σ b12 . . . σ b1m
 σ
 b21 σ b22 . . . σ b2m 
Σ = Ω ⊗ In =  .. ..  ⊗ In , (3.9)

b b .. . .
 . . . . 
σ
bm1 σ bm2 . . . σ bmm

and the FGLS estimator is

h −1 i−1 −1
0 b 0 b
β
b
F GLS = X Ω ⊗ In X X Ω ⊗ In y. (3.10)

3.2.2 Maximum Likelihood

Seemingly unrelated equations can also be estimated by maximum likelihood. Assume

that the errors from the model in equation (3.4) are normally distributed, such that
ε ∼ N [0, Ω ⊗ In ]. The likelihood function is
nm 1 1 0 −1
L = (2π)− 2 |Ω ⊗ In |− 2 e− 2 (y−Xβ) (Ω⊗In ) (y−Xβ)
3.2. SUR 5

which is equivalent to3

nm n 1 0 −1
L = (2π)− 2 |Ω|− 2 e− 2 (y−Xβ) (Ω ⊗In )(y−Xβ)
. (3.11)

The log-likelihood thus is

nm n 1
ln (2π) − ln (|Ω|) − (y − Xβ)0 Ω−1 ⊗ In (y − Xβ) .

ln (L ) = − (3.12)
2 2 2
We maximize the log-likelihood in equation (3.12) with respect to its parameters β and
the different σij in Ω. The first-order condition with respect to β is

∂ ln (L ) 1 0 −1

= 0 ⇒2 − (−X ) Ω ⊗ In (y − Xβ) = 0
∂β 2
X Ω−1 ⊗ In Xβ = X0 Ω−1 ⊗ In y.
0

The maximum-likelihood estimator thus is

b M LE = X0 Ω−1 ⊗ In X −1 X0 Ω−1 ⊗ In y,

β (3.13)

which is the same as the GLS estimator in equation (3.5). Let σ ij be the ij th element in
Ω−1 .The first-order condition with respect to σ ij is
∂ ln (L ) n 1
= 0 ⇒ 2σij − 2 (yi − Xi β i )0 yj − Xj β j = 0

∂σ ij 2 2
0
nσij = εi εj .

The maximum-likelihood estimator thus is σ ε0ib

bij,M LE = b εj /n, which is the one we use
for FGLS. In fact, iterated FGLS may converge to the maximum-likelihood estimator.
Greene (2012, footnote 15, p. 298) comments how when FGLS converges, it converges
to MLE. There are some times, however, when iterated FGLS does not converge at all,
i.e. the change in the estimates of the residuals is always large.

Iterated FGLS

Maybe we should have covered this in the previous chapter with FGLS and heteroskedas-
ticity. What I present now is also valid for iterated FGLS in that case. We have just
3 n m −1
Let A be an m×m matrix, and B be an n×n matrix. Then |A ⊗ B| = |A| |B| , and (A ⊗ B) =
A ⊗ B−1 .
−1
6 CHAPTER 3. SYSTEM OF EQUATIONS

mentioned that iterated FGLS converges to MLE, and this may be confusing because
when deriving both estimators we came up with the same mathematical expression for
both the coefficients and the covariances of the errors. The issue at hand is that when not
iterated, FGLS uses OLS residuals to estimate the σ bij . Even though they are consistent
estimators, MLE provides better estimates of the covariances of the errors. However,
iterated FGLS may converge to MLE, so let us explain how iterating FGLS works. It
follows the following steps:

1. estimate OLS equation by equation;

2. use OLS residuals to calculate all the σ ε0ib

bij = b εj /n;

3. use the σ
bij to form Ω
b and estimate FGLS model;

4. use the FGLS residuals (b εi,F GLS = yi − Xi β εj,F GLS = yj − Xj β

b i,F GLS and b b j,F GLS )
to calculate σ εi,F GLS 0b
bij = b εj,F GLS /n;

5. go back to step 3 until convergence, i.e. until the change in either β

b
F GLS or in the
σ
bij (one implies the other and vice-versa) is very small.

3.2.3 Identical (Multivariate) Regressors

A particular case arises when we are using the same K variables to estimate each of the
m equations. In that case X1 = X2 = · · · = Xm = Z, so that

 
Z 0 ··· 0
 0 Z ··· 0 
X= .. .. . . ..  = Im ⊗ Z. (3.14)
 
 . . . . 
0 0 ··· Z
3.2. SUR 7

Substituting this in equation (3.5)4

b GLS = (Im ⊗ Z)0 Ω−1 ⊗ In (Im ⊗ Z) −1 (Im ⊗ Z)0 Ω−1 ⊗ In y

β
−1
= (Im ⊗ Z0 ) Ω−1 ⊗ In (Im ⊗ Z) (Im ⊗ Z0 ) Ω−1 ⊗ In y

−1 −1
= Ω−1 ⊗ Z0 (Im ⊗ Z) Ω ⊗ Z0 y

−1 −1
= Ω−1 ⊗ Z0 Z Ω ⊗ Z0 y

h i
−1
= Ω ⊗ (Z0 Z) Ω−1 ⊗ Z0 y

h i
−1
= Im ⊗ (Z0 Z) Z0 y. (3.15)

The estimator in equation (3.15) is equivalent to estimating the coefficients by OLS

equation by equation. First, notice that Ω no longer appears in the equation. Now, to
show that this is true, let us expand the Kronecker product in the equation:

(Z0 Z)−1 Z0
 
0 ··· 0
0 −1 0
 0 (Z Z) Z · · · 0 
β
b GLS =  y

 .. .. ... ..
 . . . 
0 0 · · · (Z Z)−1 Z0
0

(Z0 Z)−1 Z0 y1
   
β
b
1
 (Z0 Z)−1 Z0 y2 
  β
 b 
2 
=  =  ..  .

..
 .   . 
0 −1 0
(Z Z) Z ym β
bm

Notice that even though the efficient estimates of the coefficients are the same as
those obtained via OLS equation by equation, the variance of the coefficients estimates
still has to account for the correlation of the coefficients. Equation (3.6) simplifies to
h i
V βb GLS | Z = Ω ⊗ (Z0 Z)−1 . (3.16)

0
4
Some additional rules of Kronecker products that are useful here. (A ⊗ B) = A0 ⊗ B0 , and
(A ⊗ B) (C ⊗ D) = AC ⊗ BD.
8 CHAPTER 3. SYSTEM OF EQUATIONS

3.2.4 Testing Hypotheses

We now consider how to setup tests to test hypotheses (restrictions) about the popula-
tions’ coefficients. We consider two types of tests: one based on an F Wald-test type of
statistic, and another based on the likelihood ratio test. As usual whether you use one or
another depends on whether you want to estimate the restricted model or not. The Wald
test is based on statistics calculated from the results of the unrestricted model, and the
likelihood ratio test is based on statistics from the estimations of both the unrestricted
and restricted models.

Wald Test

To test hypotheses about β, a statistic similar to the F -ratio in multiple regression

analysis is
0 −1
−1 −1
0b
Rβ − q
b R XΣ X R0 Rβb − q /J
F [J, M T − K] = . (3.17)
ε0 Σ−1b
b ε/ (mn − K)

The above statistic needs Σ for its estimation, which is unknown. Using FGLS Σ b from
equation (3.9), and since the denominator converges to one, in large samples the statistic
will behave the same as:
1 b 0 h h i i−1
0
Fb = Rβ F GLS − q RV b β b
F GLS | X R R β
b
F GLS − q . (3.18)
J
You can compare this to a critical F statistic with J degrees of freedom in the numerator
and mn − K degrees of freedom in the denominator.5 Because we are using Σ, b even with
normally distributed errors, the F distribution is only valid approximately. In general,
the statistic F [J, k] converges to J −1 χ2 (J) as k → ∞. So
0 h h i i−1
0
J F = Rβ F GLS − q RV β F GLS | X R
b b b b Rβ F GLS − q
b (3.19)

follows a χ2 (J) distribution. This is basically a Wald statistic that measures the
distance between Rβ b
F GLS and q. Both statistics are valid asymptotically but (3.18)
may perform better in small samples.
5
K here represents the total number of coefficients estimated in the estimation of the system of
equations.
3.2. SUR 9

Likelihood Ratio Test

The general procedure for a likelihood ratio test is to run both the unrestricted and
restricted models using maximum likelihood, calculate the likelihood of each of the two
and form a test-statistic based on the ratio of the likelihood of the restricted model and
the likelihood of the unrestricted model. You can estimate the SUR model using a maxi-
mum likelihood estimator in any econometric software that maximizes the log-likelihood
function in (3.12), under the assumption that the errors are normally distributed. Al-
ternatively, you could run iterated FGLS of each model until they converge, but there is
no guarantee that either estimation may actually converge.

In the case of seemingly

unrelated equations the likelihood of the MLE estimation
of any model A is ΩA , i.e. the determinant of the estimated covariance matrix of
b

the residuals in equation (3.2). Let the restricted model be identified by R and the
unrestricted model be identified by U . The likelihood ratio statistic is then

 
Ω
b R
λLR = n ln  = n ln ΩR − ln ΩU , (3.20)
 b b
ΩbU

which follows a χ2 (J) distribution, where J is the number of restrictions imposed in

the restricted model. If you reject the null hypothesis that the restrictions hold, the
unrestricted model is the appropriate one.

Test of Specification

Under the assumption of the normality of the errors, we can perform a test to see
whether seemingly unrelated equations estimation is appropriate or not. We would like
to test the null hypothesis is that OLS equation by equation is appropriate against the
alternative that the seemingly unrelated equations model is the correct specification.

Consider that OLS holds, the covariance matrix for an observation would be a di-
agonal matrix with the different variances of the residuals in the respective equations
in such a diagonal. Let Ω
b O represent the estimate of the mentioned covariance matrix,
10 CHAPTER 3. SYSTEM OF EQUATIONS

then
 
σ
b11 0 ··· 0
 0 σb22 ··· 0 
Ω
bO =  , (3.21)

 .. .. .. ..
 . . . . 
0 0 ··· σ
bmm
so, its determinant would equal the multiplication of the diagonal elements of the matrix,
and the log-likelihood would thus equal
X m
ln ΩO = εi 0 b
ln (b εi /n) . (3.22)
b
i=1

So the log-likelihood is the sum of the natural logarithms of the OLS errors’ variance
estimates of each equation.

The OLS equation by equation estimation is the restricted model. This is because we
are restricting σij to be zero for every i 6= j in equation (3.2). The unrestricted model is
thus the SUR model. Letting U represent this model, the likelihood ratio test statistic
for the specification test is
" m #
X
λLR = n ln Ω O − ln ΩU = n εi 0 b
ln (b εi /n) − ln ΩU , (3.23)
b b b
i=1
a
− χ2 [m (m − 1) /2] distribution.6
and λLR →

3.3 System of Demand Equations: Singular Systems

Many applications of the multivariate regression model have been in the context of
systems of demands equations, either commodity demands or factor demands in
studies of production. In principle each is basically an application of the SUR model
we have just covered, but with additional constraints or characteristics that need to
be accounted for. For example, the systems are generally constrained across equations,
and many of these models the covariance matrix of the disturbances Ω is singular. We
consider two cases: the Cobb-Douglas and the Translog cost functions.
6
The only unrestricted parameters under OLS equation by equation in equation (3.2) are the diagonal
elements of the matrix. There are m diagonal elements, and thus m × m − m = m (m − 1) off-diagonal
elements. Since Ω is a symmetric matrix, i.e. σij = σji , there are thus m (m − 1) /2 restrictions.
3.3. SINGULAR SYSTEMS 11

3.3.1 Cobb-Douglass Cost Function

Consider a Cobb-Douglas production function

M
Y
Q = α0 xαi i . (3.24)
i=1

Profit maximization with an exogenously determined output price calls for the firm to
maximize output for a given cost level of C (or minimize costs for a given output Q).
The maximization problem has the following Lagrangean
M
Y
Λ = α0 xαi i + λ (C − p0 x) (3.25)
i=1

where p is the vector of M factor prices. The first order conditions thus are
∂Λ αi Q αi Q
=0⇒ − λpi = 0; pi xi =
∂xi xi λ
and
M
∂Λ X
= 0 ⇒ C − p0 x = 0; C = p0 x; C = p i xi .
∂λ i=1

Solving for the systems of equations yields the demands for the factors, x∗i (Q, p), and
λ∗ (Q, p). The total cost of production is
M M
X X αi Q
pi x∗i = ,
i=1 i=1
λ∗

which means that the cost share allocated to the ith factor is
p i xi αi
PM ∗
= PM = βi . (3.26)
i=1 pi xi i=1 αi

The full system to be estimated is then

M
X
ln C =β0 + βq ln Q + βi ln pi + εc , (3.27)
i=1
si =βi + εi , for i = 1, · · · , M. (3.28)
12 CHAPTER 3. SYSTEM OF EQUATIONS

By construction, M
P PM
i=1 βi = 1 and
PM i=1 i
s = 1. The cost shares will also add up to
1 in the data, which implies that i=1 εi = 0 at every data point,7 making the system
of equations singular. To further understand this point, disregard equation (3.27) and
concentrate on the system of equations represented by (3.28). Let ε =P[ε1 , ε2 , · · · , εM ]0 .
Letting i be a column of ones, what we have discussed means that ε0 i Mi=1 εi = 0. This
0
means that E [εε i] = Ωi = 0, which means that Ω is singular, and thus non-invertible,
so we cannot apply SUR to estimate the system of equations presented here.

To avoid this singularity, we can impose the restriction that βM = 1 − β1 − β2 −

· · · − βM −1 in the model above. This has two effects. The first effect is that we can
drop the equation of the share for factor M from the system in (3.28). The second effect
comes from imposing the restriction in equation (3.27). Let us consider that equation,
and what the constraint would imply.

M
X −1
ln C = β0 + βq ln Q + βi ln pi + (1 − β1 − · · · − βM −1 ) ln pM + εc ;
i=1
M
X −1
ln C − ln pM = β0 + βq ln Q + βi (ln pi − ln pM ) + εc ;
i=1
M −1
C X pi
ln = β0 + βq ln Q + βi ln + εc .
pM i=1
pM

The system that can be estimated thus is

M −1
C X pi
ln = β0 + βq ln Q + βi ln + εc , (3.29)
pM i=1
pM
si = βi + εi , for i = 1, · · · , M − 1. (3.30)

The system provides estimates for β0 , βq , and β1 , · · · , βM −1 . Estimates for βM is obtained

using the imposed constraint. Both FGLS and MLE are invariant to the choice of
numeraire, i.e. to which factor we are leaving out.

7
For example for every firm we observe.
3.3. SINGULAR SYSTEMS 13

3.3.2 Translog Cost Function

Specifying output with a Cobb-Douglas functional form, restricts the elasticity of sub-
stitution of the factors to be equal to 1. Let output be, for now, defined by a general
function Q = f (x), where x is the vector of factors. The solution of the cost minimiza-
tion problem for a given output level Q, will give the factor demands x∗i (Q, p), where p
is the vector of factor prices, for i = 1, · · · , M . The cost function would thus be
M
X
C (Q, p) = pi x∗i (Q, p) (3.31)
i=1

If we can assume that there are constant returns to scale, then C (Q, p) /Q =
c (p), where c (p) is the per unit, average, cost.8 We get the cost-minimizing factor
demands by applying Shephard’s lemma
∂C (Q, p) Q∂c (p.)
x∗i (Q, p) = = (3.32)
∂pi ∂pi
If instead we differentiate the cost function logarithmically, we obtain the cost-minimizing
factor cost shares
∂ ln C (Q, p) pi x i
si = = . (3.33)
∂ ln pi C (Q, p)
With constant returns to scale C (Q, p) = Qc (p), so ln C (Q, p) = ln Q + ln c (p), and
∂ ln c (p)
si = . (3.34)
∂ ln pi
The purpose of many empirical studies are to determine the elasticities of factor sub-
stitution, θij , and the own price elasticities of factor demand, ηii . These are given by
∂2c
∂pi ∂pj
θij = c ∂c ∂c
(3.35a)
∂pi ∂pj

and

ηii = si θii . (3.35b)

8
Remember with constant returns to scale the firm’s minimum average cost is constant for a given
set of factor prices.
14 CHAPTER 3. SYSTEM OF EQUATIONS

So by suitably specifying the cost function and the cost shares we have an M or M + 1
equation model that we can use to estimate the quantities on equations (3.35a) and
(3.35b).

The transcendental logarithmic or translog function is a very flexible cost functional

form that expands ln c (p) in a second-order Taylor series about the point p = 0
M M M
∂ 2 ln c

X ∂ ln c 1 XX
ln c (p) ≈ β0 + ln pi + ln pi ln pj . (3.36)
i=1
∂ ln pi 2 i=1 j=1 ∂ ln pi ∂ ln pj

Let βi , for i = 1, · · · , M , represent the first derivatives, and δij , for i = 1, · · · , M and
j = 1, · · · , M , represent the second derivatives. The function can be expressed as
M M M
X 1 XX
ln c (p) = β0 + βi ln pi + δij ln pi ln pij (3.37)
i=1
2 i=1 i=1

Converting from unit cost to total cost we have the translog cost function
M
1 X
ln C = β0 + βQ ln Q + δQQ (ln Q)2 + βi ln pi
2 i=1
M M M
1 XX X
+ δij ln pi ln pij + δQi ln Q ln pi (3.38)
2 i=1 i=1 i=1

If all δ coefficients are zero, we would have the Cobb-Douglas cost function in equation
(3.27). Applying Shephard’s lemma, we get the shares of the cost
M
∂ ln C X
= Si = βi + δQi ln Q + δii ln pi + δij ln pj (3.39)
∂ ln pi j=1

Notice that a cost function must be homogenous of degree one in prices,

i.e. for a given output total cost must increase proportionally when all prices increase
proportionally. This implies the following constraints for the parameters:
M
X
βi = 1 (3.40a)
i=1
3.3. SINGULAR SYSTEMS 15

M
X
δQi = 0 (3.40b)
i=1

M
X M
X M X
X M
δij = δij = δij = 0 (3.40c)
i=1 j=1 i=1 j=1

In addition, it should be clear from equation (3.36) that we also need to constraint
δij = δji since ∂ 2 ln c/ (∂ ln pi ∂ ln pj ) = ∂ 2 ln c/ (∂ ln pj ∂ ln pi ).

To estimate the parameters we can use SUR estimation by imposing the restrictions
in equations (3.40a) to (3.40c), the symmetry restriction that δij = δji and solving
the problem of singularity of the disturbance covariance matrix of the share
equations by dropping one of the equations and adapting the translog cost function
accordingly. Estimation must be done now via maximum likelihood (iterated FGLS)
to ensure invariance of the parameters with respect to the choice of share equation
omitted.

For the translog cost functions we can see if there are scale economies or not with a
simple measure. Let SCE be

∂ ln C
SCE = 1 − . (3.41)
∂ ln Q

For positive SCE numbers there are economies of scale, for negative numbers there are
diseconomies of scale, and when SCE = 0 we have constant returns to scale. Fur-
thermore, the translog cost function has easy ways to estimate the elasticities of factor
substitution:
δij + si sj
θij = (3.42a)
si sj

δii + si (sj − 1)
θii = (3.42b)
s2i

ηii = si θii . (3.42c)

These elasticities would have a different value for each observation, since the predictions
from the shares equations would have a different value for each observation. Normal
practice is to compute them at some central point such as the mean of the data.
16 CHAPTER 3. SYSTEM OF EQUATIONS

3.4 SUR Assignment

The assignment for Seemingly Unrelated Regression is based on Christensen and Greene
(1976). The idea is to estimate the same models that they estimate there and test the
different restrictions of the models, as well as to test whether there are economies of
scale present in the industry with the results of each model estimated. Their study uses
a translog function system setup to analyze the electrical power sector. They consider
three factors (inputs): labor, L, capital, K, and fuel F . Using the factor letter as
subindexes for the different parameters, the translog cost function can be written as:9

(ln Q)2
ln C = β0 + βQ ln Q + δQQ + βL ln pL + βK ln pK + βF ln pF
2
(ln pL )2 (ln pK )2 (ln pF )2
+ δLL + δKK + δF F
2 2 2
+ δQL ln Q ln pL + δQK ln Q ln pK + δQF ln Q ln pF
+ δLK ln pL ln pK + δLF ln pL ln pF + δKF ln pK ln pF + εC . (3.43)

This means that the share equations are

sL = βL + δLL ln pL + δQL ln Q + δLK ln pK + δLF ln pF + εL (3.44a)

sK = βK + δKK ln pK + δQK ln Q + δLK ln pL + δKF ln pF + εK (3.44b)

sF = βF + δF F ln pF + δQF ln Q + δLF ln pL + δKF ln pK + εF (3.44c)

and the restrictions are

βL + βK + βF = 1 (3.45a)

δQL + δQK + δQF = 0 (3.45b)

δLL + δLK + δLF = 0 (3.45c)

δLK + δKK + δKF = 0 (3.45d)

δLF + δKF + δF F = 0. (3.45e)

9
This equation is already taking into account the symmetry of the second derivatives, i.e. that
δij = δji for i 6= j.
3.5. SIMULTANEOUS EQUATIONS MODELS 17

Dropping the fuel share equation, and using the restrictions above to adjust the model
accordingly the model then is:
(ln Q)2

C pL pK
ln = β0 + βQ ln Q + δQQ + βL ln + βK ln +
pF 2 pF pF
[ln(pL /pF )]2 [ln(pK /pF )]2

pL
δLL + δKK + δQL ln Q ln +
2 2 pF

pK pL pK
δQK ln Q ln + δLK ln ln + εC (3.46)
pF pF pF

pL pK
sL = βL + δLL ln + δQL ln Q + δLK ln + εL (3.47a)
pF pF

pK pL
sK = βK + δKK ln + δQK ln Q + δLK ln + εK (3.47b)
pF pF
This is Model A in the paper. To estimate this model you use SUR but you need to
make the cross-equation restrictions to ensure that the coefficients that are supposed to
be the same in the different equations are actually the same, e.g. δLK has to be the same
in all three equations.

Model B in the paper restricts the production function to be homothetic, which is

equivalent as restricting δQK = δQL = 0. Model C further restricts the production
function to be homogeneous, which imposes the same constraints as homotheticity plus
δQQ = 0.

Model D, E, and F are models A, B, and C, respectively, constrained to have unitary

elasticities of substitution. This means that δLL = δKK = δLK = 0.

The assignment is to perform the estimation of the 5 models (A through C), test if
the restrictions in models B, C, D, E, and F hold by using likelihood ratio tests. You also
are responsible to proof in your writeup that by removing sF from the model and using
the restriction the modified model is the one expressed in equations (3.46) to (3.47b).

3.5 Simultaneous Equations Models

We now consider another case where several equations are better estimated as a system
of equations. Simultaneous equations models (SEM) are developed because many times
18 CHAPTER 3. SYSTEM OF EQUATIONS

when applying economic theory we have models that rely on several equations to be
solved together. For example, a simple market equilibrium needs three equations to be
solved together to get the equilibrium price and quantity: the market demand equation,
the market supply equation, and the market clearing condition.

These models will have variables that will be solved for in the model, the endogenous
variables, and variables that are needed to estimate the endogenous variables but that
the model does not need to solve for, i.e. the exogenous variables. If we express the
model as derived from economic theory, the model will be in its structural form. As we
transform the model to express the endogenous variables in terms of the exogenous ones,
we then have the reduced form equations.

To understand this concept consider the following structural equations for a market
equilibrium,

qd = α0 + α1 p + α2 z + εd (demand), (3.48a)
qs = β 0 + β 1 p + β 2 z + ε s (supply), (3.48b)
qd = q s = q (equilibrium), (3.48c)

where q is the quantity of the good in the market, p is the price, and z is the price of a
related good that we are assuming affects both the supply and the demand (an example
could be oil that is needed for the production of electricity and that also affects the
consumption of other goods in different ways). This model solves for both price and
quantity together, so the only exogenous variable in this model is z. Solving for q and
p in terms of z we get the reduced form of the model
α1 β0 − α0 β1 α1 β2 − α2 β1 α1 εs − β1 εd
q= + z+ = π11 + π21 z + νq , (3.49a)
α1 − β1 α1 − β1 α1 − β1
β0 − α0 β2 − α2 εs − εd
p= + z+ = π12 + π22 z + νp . (3.49b)
α1 − β1 α1 − β1 α 1 − β1

Our purpose with these models is to estimate the reduced form of the models and then
calculate the estimates of the structural equations from the estimates of the parameters
of the reduced equations. This poses a problem since the number of parameters in the
reduced form equations are usually less than the parameters in the structural equations.
Notice that there are only 4 parameters to estimate in equations (3.49a) and (3.49b), but
there are 6 parameters in equations (3.48a) and (3.48b), so it is impossible to identify
the structural parameters.
3.5. SIMULTANEOUS EQUATIONS MODELS 19

In this section we consider a general setup for these models, how to estimate the
coefficients of the reduced form equations, and the identification issues of the structural
parameters.

3.5.1 General Model

Let y represent the endogenous variables, x represent the exogenous ones, and i represent
an observation with i = 1, 2, · · · , n. The general structural form would be

γ11 yi1 + γ21 yi2 + · · · + γM 1 yiM + β11 xi1 + β21 xi2 + · · · + βK1 xiK = εi1 ,
γ12 yi1 + γ22 yi2 + · · · + γM 2 yiM + β12 xi1 + β22 xi2 + · · · + βK2 xiK = εi2 ,
..
.
γ1M yi1 + γ2M yi2 + · · · + γM M yiM + β1M xi1 + β2M xi2 + · · · + βKM xiK = εiM .

Since this is a linear system of equations, in order to be able to find the solutions for the
M endogenous variables, there must be M equations. This makes the system complete.
We can write the system in matrix notation
 
γ11 γ12 · · · γ1M
 γ21 γ22 · · · γ2M 


y1 y2 · · · yM i  .. .. .. .. 
 . . . . 
γM 1 γM 2 · · · γM M
 
β11 β12 · · · β1M
 β21 β22 · · · β2M 


+ x1 x2 · · · xM i  .. .. . .. .
.. 
 = ε 1 ε 2 · · · ε M i
 . .
βM 1 βM 2 · · · βM M

yi0 Γ + x0i B = ε0i . (3.50)

Looking at the matrices of the parameters, Γ and B, we can see that each column
is the vector of the coefficients of a particular equation, and each row is the vector of
the coefficients for a given variable across equations. Now, to express the endogenous
20 CHAPTER 3. SYSTEM OF EQUATIONS

variables in terms of the exogenous variables we transform equation (3.50) to get the
reduced form

yi0 = −x0i BΓ−1 + ε0i Γ−1

= −x0i Π + vi0 . (3.51)

For this solution to exist, the model must satisfy the completeness condition for
simultaneous equations systems: Γ must be nonsingular.

The structural disturbances are assumed to be randomly drawn from an M -variate

distribution with

E [εi | xi ] = 0 and E [εi ε0i | xi ] = Σ.

We assume for now that

E [εi εj 0 | xi , xj ] = 0 ∀i, j

so there is neither heteroskedasticity nor autocorrelation. It follows then that the

reduced-form disturbances, vi0 = ε0i Γ−1 have
0
E [vi | xi ] = Γ−1 0 = 0,
0
E [vi vi0 | xi ] = Γ−1 ΣΓ−1 = Ω,

which means that

Σ = Γ0 ΩΓ.

3.5.2 Identification

Before I mentioned that even though we estimate the reduced form system of equations,
the purpose really is to be able to measure the parameters of the structural model. The
problem that arises is that one reduced form can represent different structural models
(economic theories). When more than one theory is consistent with the same reduced
form, then the theories are said to be observationally equivalent The problem that
this causes is that without more information about the theory, and thus the structural
form, we cannot estimate the structural parameters from the parameters estimated by
the reduced form. This allows us to identify the model. The additional information
about the theory comes in the following forms:
3.5. SIMULTANEOUS EQUATIONS MODELS 21

Normalization: Because we have a dependent variable for each reduced form equation,
we can normalize each structural form equation so that the dependent variable in
the reduced form of the equation has a coefficient of 1.
Identities: In some models, variable definitions or equilibrium conditions imply that all
the coefficients of a particular equation are known.
Exclusions: Omitting variables from equations places zeros on B and Γ.
Linear Restrictions: If we know that the structural parameters follow some restric-
tions, these restrictions will help too in ruling out false structures.
Restrictions on the Disturbance Covariance Matrix: Whether we know if the struc-
tural disturbances are correlated or uncorrelated with each other.

There are two conditions for identification of the equations:

Order Condition: Notice that in the reduced form each equation may include vari-
ables that are endogenous as explanatory variables. The order condition is that in
each equation the number of exogenous variables of the whole system that are not
included in the equation is at least as large as the number of endogenous variables
included as explanatory variables in the equation. The idea is that the exogenous
variables that are left out are going to be used as instruments for the endogenous
variables in the equation, therefore the equation must be at least just-identified for
us to be able to estimate it. If you have more endogenous variables in the equation
than exogenous variables that are not included, then the equation is not identified.
If you have fewer endogenous variables in the equation than exogenous variables
are left out, you then have an over-identified equation. This is equivalent to what
we saw when we talked about endogeneity and the IV and 2SLS estimators. The
order condition is only a necessary condition, i.e. it is necessary for the system
to be identified that each equation satisfies the order condition, but it is not a
sufficient condition.
Rank Condition: The rank condition states that for each equation, each of the vari-
ables excluded from the equation must appear in at least another equation (no
zero columns), and at least one of the variables excluded from the equation must
appear in each of the other equations (no zero rows). This is equivalent as saying
that the matrix of coefficients of the variables excluded in one equation in the other
equations must have full row rank. This is a sufficient condition for identification.
22 CHAPTER 3. SYSTEM OF EQUATIONS

To illustrate how to test for these two conditions, consider the following system of
equations:

y1 = β11 + β21 x1 + β41 x3 + u1

y2 + γ32 y3 = β12 + β22 x1 + β32 x2 + u2
γ13 y1 + y3 = β13 + β23 x1 + β43 x3 + u3 .

In this system there are three endogenous variables y1 , y2 , and y3 , and three exogenous
variables x1 , x2 , and x3 . To test whether the order and rank conditions are met, we
build a matrix of coefficients, with the rows containing the equations, and the columns
the variables
y1 y2 y3 x 1 x2 x 3
Eq. 1 1 0 0 β21 0 β41
Eq. 2 0 1 γ32 β12 β32 0
Eq. 3 γ13 0 1 β23 0 β43

Let us start with equation 1. To test the order condition, the number of coefficients
on the endogenous variables for the equation has to be less or equal to the number of
zeros of the exogenous variables. We see that there are no coefficients on the exogenous
variables (no γs), and there is one zero in the exogenous variables (one β is missing), so
equation 1 is over-identified. We now need to test the rank condition for this equation.
To do this we check where the equations have zeros (y2 , y3 , and x2 ), and form a matrix
with the values in those columns that are not included in equation 1

1 γ32 β32
.
0 1 0

First no rows are all zeros, and second the two rows are linearly distinct (there is no
possible multiple of the first row that equals the second), so the matrix has full row rank
(2), and the equation is identified.

Let us now consider equation 2. The number of endogenous variables in the equa-
tion is equal to the number of excluded exogenous variables, 1, so the equation is just-
identified and satisfies the order condition. The relevant matrix to test the rank condition
is

1 β41
.
γ13 β43
3.5. SIMULTANEOUS EQUATIONS MODELS 23

As long as the coefficients in one row are not proportional to the other row, the matrix
will have full row rank, 2, and be identified. If there is a multiple that can transform
row 1 into row 2, then the matrix will not have full row rank, and the equation would
not be identified.

We finally consider equation 3. The number of included endogenous variables is

again equal to the number of excluded exogenous variables, so this equation is also
just-identified and satisfies the order condition. The relevant matrix to check the rank
condition for equation 3 is

0 0
.
1 β32

We can see that this matrix does not have full row rank, since all we need to do is to
multiply the first column by β32 to have 2 identical columns. The rank of the above
matrix is 1, and equation 3 is not identified.

The case of equation 3 clearly illustrates how satisfying the order condition is not
sufficient to ensure identification. Equation 3 is just-identified according to the order
condition, but fails to be identified because it fails to satisfy the rank condition.

3.5.3 Estimation Methods

For estimation purposes, let Xj represent the exogenous variables in equation j, and Yj
the endogenous variables in the right hand side of equation j. We can then write the
general model to be estimated as:

y j = X j δ j + Yj γ j + ε j
= Zj β j + εj , (3.52)

Just like in SUR estimations, we can estimate the system of equations either equa-
tion by equation, using Limited information estimators or estimate all equations
simultaneously using Full information estimators. The major difference between
these estimators and the SUR estimators, is that we now have to accommodate for the
endogeneity of Yj .
24 CHAPTER 3. SYSTEM OF EQUATIONS

Limited Information Estimation

When you estimate the models equation by equation, since there are endogenous vari-
ables, OLS will provide a biased and inconsistent estimate, so the consistent estimator is
2SLS (which translates into the IV estimator in the just-identified case) for each equation
−1
0b b 0 yj
β j,2SLS = Zj Zj
b b Z j
h i−1
−1 −1
= Z0j X (X0 X) X0 Zj Z0j X (X0 X) X0 yj , (3.53)
where X = (Xj X−j ) is the matrix of all exogenous variables in the system. The
asymptotic variance estimate is
h i −1
0b
V
b β b
j,2SLS = σ
bjj Z
b Z
j j
h i−1
−1
=σbjj Z0j X (X0 X) X0 Zj , (3.54)
where
0
yj − Zj β
b
j yj − Zj β
b
j
σ
bjj = ,
n
which uses the original variables not the predicted ones, Z
bj.

Note the role of the order condition of identification. It requires that the number of
exogenous variables that appear elsewhere in the model to be at least as large as the
number of endogenous variables that appear in the equation. This is because we are
predicting Zj = (Xj Yj ) using X = (Xj X−j ). This means that there must be at
least as many variables in X−j than in Yj , which is the order condition.

Full Information Estimation

We are now going to estimate the coefficients using all the equations together. Let us
formulate the whole system as
      
y1 Z1 0 . . . 0 β1 ε1
 y2   0 Z2 . . . 0   β   ε2 
 2  
 ..  =  .. ..   ..  +  ..  (3.55a)
   
.. . .
 .   . . . .   .   . 
yM 0 0 . . . ZM βM εM
3.5. SIMULTANEOUS EQUATIONS MODELS 25

or
y = Zβ + ε (3.55b)
where E [ε | X] = 0 and E [εε0 | X] = Σ = Ω ⊗ In , i.e. homoskedastic by equation.

The OLS estimator β b = (Z0 Z)−1 Z0 y is equation by equation OLS and is inconsistent.
But even if it were consistent, we know from SUR that it would be inefficient compared
to an estimator that uses the cross-equation correlations of the disturbances. For the first
issue, inconsistency, we need an IV based estimator. For the second issue, inefficiency,
we use a GLS approach.

For this we use 3-stage least square estimator (3SLS):

1st Stage: Estimate the reduced form in equation (3.51) by OLS (equation by equation)
and compute Y
b j for each equation. Notice that this is similar to the first stage of
2SLS.
2nd Stage: Compute β b
j,2SLS for each equation by running OLS on each equation and
replacing Yj on each equation by Y b j from stage 1. Then Ω
b can be formed with
0
yi − Zi βb
i,2SLS y j − Z β
j j,2SLS
b
σ
bij = .
n

rd
3 Stage: Run FGLS of y on Z = X Y to get the 3SLS estimator
b b
h
0 −1
i−1 0 −1
β
b
3SLS = Z
b Ω ⊗ In Z
b b Ω ⊗ In y.
Z (3.56)

The asymptotic variance estimate is:

h i h
0 −1
i−1
V
b β b
3SLS = Z
b Ω ⊗ In Z
b . (3.57)

3.5.4 System of Equations Assignment

The command for 3SLS is reg3. This command is only valid for homoskedastic errors
as we have assumed is the case in the presentation above, which of course is problematic
26 CHAPTER 3. SYSTEM OF EQUATIONS

when dealing with cross-section data, since it is often hetersokedastic. An alternative to

estimate the coefficients using full information is the user written cmp command.10 This
is a very flexible command that can estimate many different types of models (including
SUR) using maximum likelihood, and it allows for heteroskedastic robust errors. The
problem is that you need STATA 10 or later to run cmp, and that it only works for
fully-recursive systems.11

The assignment this time is to estimate the model discussed in Cameron and Trivedi
(2009, section 6.6.). Estimate each equation independently using 2SLS (remember that
the instruments for each equation are the exogenous variables in the other equation that
are not included in that equation), test for endogeneity, the validity of the instruments
and for the presence of weak instruments as you did in the endogeneity assignment. Es-
timate then the whole system using 3SLS (basically do the same as they do in the book).
3SLS is supposed to be more efficient than 2SLS equation by equation, so make sure
you compare the results of both estimations in that sense. Remember that although the
purpose of the assignment is to do 3SLS the write-up has to be a report on the estimation
of both equations and you must analyze what the coefficients of the estimations mean,
and if the difference of the coefficients between both models. Finally check whether the
equations in he following model are identified:
 
1 γ12 0 0
 γ21 1 γ23 γ24 
y1 y2 y3 y4   0 γ32 1 γ34 


γ41 γ42 0 1
 
0 β12 β13 β14
 β21 1 0 β24 


+ x1 x2 x3 x4 x5   β 31 β32 β 33 0 

 0 0 β43 β44 
0 β52 0 0

= ε1 ε2 ε3 ε4

10
For more information on cmp go to http://ideas.repec.org/c/boc/bocode/s456882.html.
11
See Greene (2012, p. 319) for what a recursive system is.
Bibliography

Cameron, A. Colin and Pravin K. Trivedi, Microeconometrics Using Stata, 1 ed.,

College Station, TX USA: Stata Press, 2009.

Christensen, Laurits R. and William H. Greene, “Economies of Scale in U.S.

Electric Power Generation,” The Journal of Political Economy, August 1976, 4, Part
1, 655–676.

Greene, William H., Econometric Analysis, 7 ed., Upper Saddle River, NJ USA:
Prentice Hall, 2012.

Cameron & Trivedi 2005 Microeconometrics Methods and Applications Solutions
0% (3)
Cameron & Trivedi 2005 Microeconometrics Methods and Applications Solutions
19 pages
Process Analysis by Statistical Methods D. Himmelblau
100% (4)
Process Analysis by Statistical Methods D. Himmelblau
474 pages
Hayashi Econometrics
50% (2)
Hayashi Econometrics
686 pages
Cameron & Trivedi - Solution Manual Cap. 4-5
0% (1)
Cameron & Trivedi - Solution Manual Cap. 4-5
12 pages
Cheat Sheet of Awesomeness
No ratings yet
Cheat Sheet of Awesomeness
65 pages
Review: Generalized Least Squares: 3.1. Covariance Matrices
No ratings yet
Review: Generalized Least Squares: 3.1. Covariance Matrices
12 pages
Seemingly Unrelated Regressions
No ratings yet
Seemingly Unrelated Regressions
9 pages
Generalized Least Squares: Simon Jackman Stanford University
No ratings yet
Generalized Least Squares: Simon Jackman Stanford University
3 pages
Zellner's Seemingly Unrelated Regressions Model
No ratings yet
Zellner's Seemingly Unrelated Regressions Model
6 pages
Generalized Least Squares
No ratings yet
Generalized Least Squares
5 pages
Feasible Generalized Least Squares For Panel Data With Cross-Sectional and Serial Correlations
No ratings yet
Feasible Generalized Least Squares For Panel Data With Cross-Sectional and Serial Correlations
18 pages
Estimation of Systems of Equations by OLS and GLS: C Alfonso Miranda (P. 1 of 1)
No ratings yet
Estimation of Systems of Equations by OLS and GLS: C Alfonso Miranda (P. 1 of 1)
20 pages
Linear Regression Analysis: Module - Vii
No ratings yet
Linear Regression Analysis: Module - Vii
10 pages
Statistical Inference in Nonlinear Sure Model
No ratings yet
Statistical Inference in Nonlinear Sure Model
7 pages
Ch. 2 - GLS and Heteroskedasticity
No ratings yet
Ch. 2 - GLS and Heteroskedasticity
23 pages
GLS Handout
No ratings yet
GLS Handout
10 pages
MultivariableRegression 4
No ratings yet
MultivariableRegression 4
98 pages
477 - Statistical Inference STS412
No ratings yet
477 - Statistical Inference STS412
20 pages
3SLS Lecture
No ratings yet
3SLS Lecture
20 pages
3.1 Least-Squares Problems
No ratings yet
3.1 Least-Squares Problems
28 pages
LeastSquares_DeptMath
No ratings yet
LeastSquares_DeptMath
7 pages
GLS-MMH
No ratings yet
GLS-MMH
35 pages
Ecom 165 Notes
No ratings yet
Ecom 165 Notes
98 pages
Chapter 2 - The Linear Model
No ratings yet
Chapter 2 - The Linear Model
4 pages
Chapter 2 - The Linear Model
No ratings yet
Chapter 2 - The Linear Model
4 pages
CLM: Review: - OLS Estimation
No ratings yet
CLM: Review: - OLS Estimation
44 pages
Estimation in A Multivariate Errors in Variables Regression Model (Large Sample Results)
No ratings yet
Estimation in A Multivariate Errors in Variables Regression Model (Large Sample Results)
22 pages
Wooldridge 6e AppE IM
No ratings yet
Wooldridge 6e AppE IM
5 pages
Lecture 6 (Curve Fitting)
100% (1)
Lecture 6 (Curve Fitting)
5 pages
Aitken' GLS
No ratings yet
Aitken' GLS
7 pages
Gls PDF
No ratings yet
Gls PDF
12 pages
Time Management Tracker (Printable Version)
From Everand
Time Management Tracker (Printable Version)
Sheba Blake
No ratings yet
Monthly Productivity Planner (Printable Version)
From Everand
Monthly Productivity Planner (Printable Version)
Sheba Blake
No ratings yet
Suggested Solutions: Problem Set 3 Econ 210: April 27, 2015
No ratings yet
Suggested Solutions: Problem Set 3 Econ 210: April 27, 2015
11 pages
GLS+ WLS+ Ols
No ratings yet
GLS+ WLS+ Ols
25 pages
Michael Murray SUR and SES
No ratings yet
Michael Murray SUR and SES
21 pages
Lecture25 Ps
No ratings yet
Lecture25 Ps
10 pages
Second-Order Nonlinear Least Squares Estimation: Liqun Wang
No ratings yet
Second-Order Nonlinear Least Squares Estimation: Liqun Wang
18 pages
Demostraciones
No ratings yet
Demostraciones
20 pages
WST 311 Notes part 2 2024
No ratings yet
WST 311 Notes part 2 2024
21 pages
Regress
No ratings yet
Regress
11 pages
Week 3-1
No ratings yet
Week 3-1
25 pages
Classical Linear Regression and Its Assumptions
No ratings yet
Classical Linear Regression and Its Assumptions
63 pages
MIT Microeconomics 14.32 Final Review
No ratings yet
MIT Microeconomics 14.32 Final Review
5 pages
Generalized Least Squares Theory
No ratings yet
Generalized Least Squares Theory
32 pages
Chapter2 Annotated Part2
No ratings yet
Chapter2 Annotated Part2
30 pages
Notes
No ratings yet
Notes
10 pages
Economics 620, Lecture 11: Generalized Least Squares (GLS) : Nicholas M. Kiefer
No ratings yet
Economics 620, Lecture 11: Generalized Least Squares (GLS) : Nicholas M. Kiefer
17 pages
Lecture 10 Nonlinear Regression
No ratings yet
Lecture 10 Nonlinear Regression
10 pages
Linear Regression: 1 1 N N I I I D I I
No ratings yet
Linear Regression: 1 1 N N I I I D I I
20 pages
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
No ratings yet
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
9 pages
Numerical Methods For Non-Linear Least Squares Curve Fitting
No ratings yet
Numerical Methods For Non-Linear Least Squares Curve Fitting
55 pages
Econometrics - Exercise set 2 (solution)
No ratings yet
Econometrics - Exercise set 2 (solution)
12 pages
Lecture Note 5: I V (IV) E: Outline
No ratings yet
Lecture Note 5: I V (IV) E: Outline
16 pages
Chapter18 Econometrics SUREModels
No ratings yet
Chapter18 Econometrics SUREModels
7 pages
s-m-s-t-c--lecture-2425-3
No ratings yet
s-m-s-t-c--lecture-2425-3
61 pages
Answer Key to Exercises_LN7_ver2
No ratings yet
Answer Key to Exercises_LN7_ver2
12 pages
Two-Variable Regression Model, The Problem of Estimation
No ratings yet
Two-Variable Regression Model, The Problem of Estimation
67 pages
Solutions_Manual_for_Econometric_Analysis_7th_Edition_by_Greene_sample_chapter
No ratings yet
Solutions_Manual_for_Econometric_Analysis_7th_Edition_by_Greene_sample_chapter
13 pages
Creel M Econometrics
No ratings yet
Creel M Econometrics
479 pages
Student Planner (Printable Version)
From Everand
Student Planner (Printable Version)
Sheba Blake
No ratings yet
The Attention Fix: How to Focus in a World That Wants to Distract You
From Everand
The Attention Fix: How to Focus in a World That Wants to Distract You
Anders Hansen
No ratings yet
Fisher Information For GLM
No ratings yet
Fisher Information For GLM
35 pages
MATH 437/ MATH 535: Applied Stochastic Processes/ Advanced Applied Stochastic Processes
No ratings yet
MATH 437/ MATH 535: Applied Stochastic Processes/ Advanced Applied Stochastic Processes
7 pages
Pattern Recognition 2nd Ed. (2009)
No ratings yet
Pattern Recognition 2nd Ed. (2009)
113 pages
CIS 674 Introduction To Data Mining: Srinivasan Parthasarathy Srini@cse - Ohio-State - Edu Office Hours: TTH 2-3:18PM DL317
No ratings yet
CIS 674 Introduction To Data Mining: Srinivasan Parthasarathy Srini@cse - Ohio-State - Edu Office Hours: TTH 2-3:18PM DL317
40 pages
Ordinary Kriging in R
No ratings yet
Ordinary Kriging in R
2 pages
MSC Statistics Syllabus First Year Regular Mode
No ratings yet
MSC Statistics Syllabus First Year Regular Mode
18 pages
Analysis of messy data 2nd ed Edition George A. Milliken instant download
100% (1)
Analysis of messy data 2nd ed Edition George A. Milliken instant download
38 pages
C4 BMobile Robots
No ratings yet
C4 BMobile Robots
114 pages
Problem Set 4 Solution Numerical Methods
No ratings yet
Problem Set 4 Solution Numerical Methods
6 pages
data science for civil engineering unit 3 notes-1
No ratings yet
data science for civil engineering unit 3 notes-1
29 pages
A Review of Software Fault Detection and Correction Process, Models and Techniques
No ratings yet
A Review of Software Fault Detection and Correction Process, Models and Techniques
8 pages
Efficient Estimation From Right-Censored Data When Failure Indicators Are Missing at Random
No ratings yet
Efficient Estimation From Right-Censored Data When Failure Indicators Are Missing at Random
17 pages
Statistics of Extreme Sea Levels For Locations Along The Norwegian Coast
No ratings yet
Statistics of Extreme Sea Levels For Locations Along The Norwegian Coast
21 pages
Item Response Theory in R Using Package LTM
No ratings yet
Item Response Theory in R Using Package LTM
27 pages
Three-Parameter vs. Two-Parameter Weibull Distribution
No ratings yet
Three-Parameter vs. Two-Parameter Weibull Distribution
7 pages
3.0 Efficient Analytical Fragility Function Fitting Using Dynamic Structural Analysis Baker2015
No ratings yet
3.0 Efficient Analytical Fragility Function Fitting Using Dynamic Structural Analysis Baker2015
21 pages
1036 2050 1 SM
No ratings yet
1036 2050 1 SM
10 pages
Box-Cox Transformation: An Overview
No ratings yet
Box-Cox Transformation: An Overview
45 pages
Instant Access to An Introduction to Quantitative Ecology Timothy E. Essington ebook Full Chapters
100% (1)
Instant Access to An Introduction to Quantitative Ecology Timothy E. Essington ebook Full Chapters
57 pages
M Tech-Syllabus
No ratings yet
M Tech-Syllabus
7 pages
Engle 1982
100% (1)
Engle 1982
22 pages
4 Sampling Distributions
100% (1)
4 Sampling Distributions
30 pages
Documento 1 Bmantenimiento
No ratings yet
Documento 1 Bmantenimiento
70 pages
Inferences - Mean - Vector
No ratings yet
Inferences - Mean - Vector
30 pages
Sample Exam
No ratings yet
Sample Exam
5 pages
2 Linear
No ratings yet
2 Linear
91 pages
Radar Lecture
No ratings yet
Radar Lecture
35 pages
TS Lecture1 2019
No ratings yet
TS Lecture1 2019
56 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Ch. 3 - Systems of Equations

Uploaded by

Ch. 3 - Systems of Equations

Uploaded by

Chapter 3

3.2 Seemingly Unrelated Equations

E [εit εjs | X1 , X2 , . . . , Xm ] = σij , if t = s and 0 otherwise.

This means that the variance covariance matrix of the disturbances, Σ is

so that the covariance matrix of the residuals is1

3.2.1 Generalized Least Squares

Consider the stacked model

OLS estimation of this model although consistent would be inefficient. If we know Σ

and the asymptotic covariance matrix of the GLS estimator is

Feasible Generalized Least Squares

equation to calculate the different σ

and the FGLS estimator is

3.2.2 Maximum Likelihood

Seemingly unrelated equations can also be estimated by maximum likelihood. Assume

which is equivalent to3

The log-likelihood thus is

The maximum-likelihood estimator thus is

The maximum-likelihood estimator thus is σ ε0ib

1. estimate OLS equation by equation;

2. use OLS residuals to calculate all the σ ε0ib

4. use the FGLS residuals (b εi,F GLS = yi − Xi β εj,F GLS = yj − Xj β

5. go back to step 3 until convergence, i.e. until the change in either β

3.2.3 Identical (Multivariate) Regressors

Substituting this in equation (3.5)4

b GLS = (Im ⊗ Z)0 Ω−1 ⊗ In (Im ⊗ Z) −1 (Im ⊗ Z)0 Ω−1 ⊗ In y

The estimator in equation (3.15) is equivalent to estimating the coefficients by OLS

3.2.4 Testing Hypotheses

To test hypotheses about β, a statistic similar to the F -ratio in multiple regression

Likelihood Ratio Test

In the case of seemingly

which follows a χ2 (J) distribution, where J is the number of restrictions imposed in

3.3 System of Demand Equations: Singular Systems

3.3.1 Cobb-Douglass Cost Function

Consider a Cobb-Douglas production function

The full system to be estimated is then

To avoid this singularity, we can impose the restriction that βM = 1 − β1 − β2 −

The system that can be estimated thus is

The system provides estimates for β0 , βq , and β1 , · · · , βM −1 . Estimates for βM is obtained

3.3.2 Translog Cost Function

ηii = si θii . (3.35b)

The transcendental logarithmic or translog function is a very flexible cost functional

Notice that a cost function must be homogenous of degree one in prices,

ηii = si θii . (3.42c)

3.4 SUR Assignment

This means that the share equations are

sL = βL + δLL ln pL + δQL ln Q + δLK ln pK + δLF ln pF + εL (3.44a)

sK = βK + δKK ln pK + δQK ln Q + δLK ln pL + δKF ln pF + εK (3.44b)

sF = βF + δF F ln pF + δQF ln Q + δLF ln pL + δKF ln pK + εF (3.44c)

δQL + δQK + δQF = 0 (3.45b)

δLL + δLK + δLF = 0 (3.45c)

δLK + δKK + δKF = 0 (3.45d)

δLF + δKF + δF F = 0. (3.45e)

Model B in the paper restricts the production function to be homothetic, which is

Model D, E, and F are models A, B, and C, respectively, constrained to have unitary

3.5 Simultaneous Equations Models

3.5.1 General Model

yi0 Γ + x0i B = ε0i . (3.50)

yi0 = −x0i BΓ−1 + ε0i Γ−1

The structural disturbances are assumed to be randomly drawn from an M -variate

E [εi | xi ] = 0 and E [εi ε0i | xi ] = Σ.

We assume for now that

so there is neither heteroskedasticity nor autocorrelation. It follows then that the

which means that

There are two conditions for identification of the equations:

y1 = β11 + β21 x1 + β41 x3 + u1

We finally consider equation 3. The number of included endogenous variables is

3.5.3 Estimation Methods

Limited Information Estimation

Full Information Estimation

For this we use 3-stage least square estimator (3SLS):

The asymptotic variance estimate is:

3.5.4 System of Equations Assignment

when dealing with cross-section data, since it is often hetersokedastic. An alternative to

Cameron, A. Colin and Pravin K. Trivedi, Microeconometrics Using Stata, 1 ed.,

Christensen, Laurits R. and William H. Greene, “Economies of Scale in U.S.

You might also like