Assignment 1
Assignment 1
The assignment is due by wednesday (30th july), 11 PM IST. The answers must be uploaded
in PDF format with the following filename structure:
NameLastnameAssignment1.pdf
You will have only two attempts to upload the file. Each answer should be as brief as possible.
Long answers will receive negative marking. Any form of cheating or copying will lead to a
zero grade.
Late Submissions: Assignments can be submitted late on Canvas until the available date. How-
ever, appropriate deductions will apply to all late submissions.
Note: Attach the .do file for relevant questions in the appendix of your PDF submission.
Q.1 Ref,Watson and Stone, chap-12 page no-464
That is, how much does a woman’s labor supply fall when she has an additional child? In this
exercise, you will estimate this effect using data for married women from the 1980 U.S. Census.
The data are available on the text website, http://www.pearsonglobaleditions.com, in the file
Fertility and described in the file Fertility Description. The data set contains information
on married women aged 21–35 with two or more children.
a. Regress weeksworked on the indicator variable morekids, using OLS. On average, do women
with more than two children work less than women with two children? How much less?
b. Explain why the OLS regression estimated in (a) is inappropriate for estimating the causal
effect of fertility (morekids) on labor supply (weeksworked).
c. The data set contains the variable samesex, which is equal to 1 if the first two children are
of the same sex (boy–boy or girl–girl) and equal to 0 otherwise. Are couples whose first two
children are of the same sex more likely to have a third child? Is the effect large? Is it
statistically significant?
d. Explain why samesex is a valid instrument for the IV regression of weeksworked on morekids.
e. Is samesex a weak instrument?
f. Estimate the IV regression of weeksworked on morekids, using samesex as an instrument.
How large is the fertility effect on labor supply?
g. Do the results change when you include the variables age, black, hispan, and othrace in
the labor supply regression (treating these variables as exogenous)? Explain why or why not.
Q.2 Use the data in CARD.RAW for this exercise.
(a) In Table 15.1, the difference between the IV and OLS estimates of the return to education
are economically important. Obtain the reduced form residuals, û, from equation (15.32).
(See Table 15.1 for the other variables to include in the regression.) Use these to test whether
educ is exogenous; that is, determine if the difference between OLS and IV is statistically
significant.
(b) Estimate the equation by 2SLS, adding nearc2 as an instrument. Does the coefficient on
educ change much?
(c) Test the single overidentifying restriction from part (a).
1
ECON:313-Assignment 1 2
(ii) The variable ciuti, in thousands of dollars, is the change in college tuition facing students
from age 17 to age 18.
• Show that educ and ciuti are essentially uncorrelated
• What does this say about ciuti as a possible IV for educ in a simple regression analysis?
(iii) Extend the simple regression model:
β̂ − β = (X ′ X)−1 X ′ u
2
ECON:313-Assignment 1 3
Yi = β0 + β1 Xi + ui .
y = X 1β
e + ẽ
1
and
y = X 1β
b + X 2β
1
b + ê.
2
Let R12 and R22 be the R-squared from the two regressions. Show that R22 ≥ R12 . Is there
a case (explain) when there is equality R22 = R12 ?
3
ECON:313-Assignment 1 4
Q.7
In the regression model Y = β0 + β1 Xt + ut , Suppose cov(X, u) ̸= 0.
i. Show that the OLS estimate for β1 is inconsistent.
ii. If Z is an instrumental variable for X1 obtain the instrumental variable (IV) estimator
of β1 . Show that this estimator is consistent. If Z is not exogenous, then show that the IV
estimator of β1 has an asymptotic bias.
iii. If X and Z are both positively correlated with u and Corr(Z, X) > 0, how much corre-
lation would have to exist between X and u for the OLS estimator to have more asymptotic
bias than the IV estimator?
(Key concept 12.3) A set of m instruments Z1i , . . . , Zmi must satisfy the following two
conditions to be valid:
(a) Instrument Relevance
• In general, let X̂1i
∗ be the predicted value of X from the population regression of
1i
X1i on the instruments (Z’s) and the included exogenous regressors (W ’s), and let
“1” denote the constant regressor that takes on the value 1 for all observations.
Then
∗ ∗
(X̂1i , . . . , X̂ki , W1i , . . . , Wri , 1)
are not perfectly multicollinear.
• If there is only one X, then for the previous condition to hold, at least one Z must
have a nonzero coefficient in the population regression of X on the Z’s and the W ’s.
(b) Instrument Exogeneity
The instruments are uncorrelated with the error term; that is,
corr(Z1i , ui ) = 0, . . . , corr(Zmi , ui ) = 0.
The variables and errors in the IV regression model satisfy the following:
1. E(ui | W1i , . . . , Wri ) = 0;
2. (X1i , . . . , Xki , W1i , . . . , Wri , Z1i , . . . , Zmi , Yi ) are i.i.d. draws from their joint distribution;
3. Large outliers are unlikely: The X’s, W ’s, Z’s, and Y have nonzero finite fourth mo-
ments; and
4. The two conditions for a valid instrument in Key Concept 12.3 hold.