Econometrics

Practice Questions
DS 301: Basic Econometrics

1. (a). Consider the following regression model:
Model A: 𝑌 = 𝛽1 + 𝛽2 𝑋2 + 𝛽3 𝑋3 + 𝑢
Model B: 𝑋2 = 𝛼1 + 𝛼2 𝑋3 + 𝑟
Model C: 𝑌 = 𝜆1 + 𝜆2 𝑟̂ + 𝑣
Here, 𝑟̂ is the residuals obtained from model B.
Based on the above models, prove the followings:
i. 𝛽̂1 = 𝜆̂1 − 𝜆̂2 𝛼̂1
ii. 𝛽̂2 = 𝜆̂2
iii. 𝛽̂3 = −𝜆2 𝛼̂2
(b). A researcher obtained the following results based on 526 observations:

Model B: 𝑋2 = 13.6 − 0.06𝑋3
Model C: 𝑌 = 5.9 + 0.64𝑟̂
Find the values of the parameters of model A based on the above estimated models – B and C.
2. Consider the following regression model.

𝑌 = 𝛽1 + 𝛽2 𝑋2 + 𝛽3 𝑋3 + 𝛽4 𝑋4 + 𝑢
Where
𝑌 is regressand, 𝑋′𝑠 are regressors, 𝑢 is stochastic disturbance term, and 𝛽′𝑠 are population parameters.
a. Consider that the null hypothesis 𝛽2 = 0. What does this null hypothesis mean? How will you test
this hypothesis? Describe the procedure.
b. Consider the null hypothesis 𝛽2 = 𝛽3 = 0. What does this null hypothesis mean? How will you test
c. Form the null hypothesis of joint significance of the model or the overall significance of the model
and explain the meaning of that hypothesis. Describe the processing of testing your null
hypothesis.
d. A researcher has formed the following hypothesis:
𝐻0 : 𝛽2 + 𝛽3 + 𝛽4 = 1
Suggest a test statistics of testing the above hypothesis and describe the process of testing that
hypothesis.
e. Consider the null hypothesis 𝛽2 = 𝛽3 . What does this null hypothesis mean? How will you test this
hypothesis? Describe the procedure.
f. Consider the null hypothesis 2𝛽2 = 𝛽3 . What does this null hypothesis mean? How will you test
3. A researcher obtained the following regression results based on 64 observations (see example 7.1):
̂ = 263.6416 − 0.0056 𝑃𝐺𝑁𝑃 − 2.2316 𝐹𝐿𝑅
𝐶𝑀
𝑆𝑒 = (11.5932) (0.0019) (0.2099)
𝑅2 = 0.7077
Where 𝐶𝑀 refers to child mortality per 1000 live birth, 𝑃𝐺𝑁𝑃 refers to per capita GNP and 𝐹𝐿𝑅 is the
female literacy rate
i. Do the regression coefficients satisfy priori signs? Interpret the coefficients of regression
results using suitable incremental rate of the explanatory variables.
ii. Compute the impact of a unit increase in per capita GNP as well as a one percentage point
increase in female literacy rate on child mortality and interpret your finding.
iii. Test the significance of individual coefficients using suitable level of significance.
iv. Do you think all the slope coefficients are simultaneously equal to zero? How do you test
it? Show the detail calculation and draw your conclusion.
v. Compute the adjusted 𝑅2 and interpret it.
4. The following exponential function exhibit the relationship between output (𝑌) and two factors –
labor (𝑋2) and capital (𝑋3 ) of several firms in a country:
𝛽 𝛽
𝑌𝑖 = 𝛽0 𝑋2𝑖2 𝑋3𝑖3 𝑒 𝑢𝑖
a. Transform the above model into a linear regression model.
b. How do you estimate the parameters of the transformed linear regression model?
c. Describe the process of testing the null hypothesis 𝐻0 : 𝛽2 + 𝛽3 = 1.
d. Based on the information of 50 manufacturing firms, a researcher finds the following results.
̂
𝑙𝑛 𝑌 = 3.8876 + 0.4683 𝑙𝑛𝑋2 + 0.5213 𝑙𝑛𝑋3
𝑆𝑒 = (0.3962) (0.0989) (0.0969) 𝑅2 = 0.9642
i. What is the output elasticity of labor? What does it mean? Is it statistically significant at
5 percent level of significance? Show your calculation.
ii. What is the output elasticity of capital? What does it mean? Is it statistically significant
at 5 percent level of significance? Show your calculation.
iii. Test the overall significance of the model.
iv. Does the economy exhibit constant returns to scale? Show your calculation.
[Note: 𝑐𝑜𝑣(𝛽̂2 , 𝛽̂3 ) = −0.0092]
5. A researcher estimated the Cobb-Douglas production function for an economy using 20 observations:
(a) 𝑙𝑛̂𝐺𝐷𝑃 = −1.6524 + 0.3397 ln 𝐿𝑎𝑏𝑜𝑟 + 0.8460 ln 𝐶𝑎𝑝𝑖𝑡𝑎𝑙

𝑡 = (−2.7259) (1.8295) (9.0625)
2
𝑅𝑈𝑅 = 0.0.9951, 𝑅𝑆𝑆𝑈𝑅 = 0.0136
Here,ln 𝐺𝐷𝑃= log of Gross Domestic Product.
(b) ̂
𝑙𝑛 (
𝐺𝐷𝑃
) = −0.4947 + 1.1053 ln (
𝐶𝑎𝑝𝑖𝑡𝑎𝑙
)
𝐿𝑎𝑏𝑜𝑟 𝐿𝑎𝑏𝑜𝑟
𝑡 = (−4.0612) (28.1056)
𝑅𝑅2 = 0.9777, 𝑅𝑆𝑆𝑅 = 0.0166
i. Interpret the results of both models.

ii. What is the underlying assumption/restriction of estimating model (b) rather of estimating
model (a)? Is the restriction valid? How do you know it? Show your calculation.
6. A researcher estimated the following regression results based on 11 observations of quantity demanded
(Y) and price of coffee (X) (see example 7.2).
(a) ̂𝑌 = 2.6911 − 0.4795 𝑋
𝑆𝑒 = (0.1216) (0.1140) 𝑟 2 = 0.6628
(b) ̂
𝑙𝑛 𝑌 = 0.7774 − 0.2530 𝑙𝑛𝑋
𝑆𝑒 = (0.0152) (0.0494) 𝑟 2 = 0.7448
i. Interpret the results of model (a) and model (b).

ii. Are the coefficients of both models statistically significantly different from zero? Show your
calculation and draw your conclusion.
iii. The researcher concluded that since 𝑟 2 of model (b) is higher than that of model (a), so model
(b) better fits the data than model (a). Do you agree with the researcher? Why or why not?
Explain.
7. From the data of 46 states in the United States for 1992, a researcher obtained the following regression
results:
̂𝑙𝑛 𝐶 = 4.30 − 1.34 ln 𝑃 + 0.17 ln 𝑌

𝑆𝑒 = (0.91) (0.32) (0.20) 𝑅̅2 = 0.27
Here, 𝐶= cigarette consumption, packs per year; 𝑃 = real price per pack; and 𝑌 = real disposable income
per capita.
i. What is the elasticity of demand for cigarettes with respect to price? Is it statistically
significant? If so, is it statistically significantly different from 1? Show your calculation.
ii. What is the income elasticity of demand for cigarettes? Is it statistically significant? Show your
calculation.
iii. Retrieve 𝑅2 .
8. From a sample of 209 firms, an Econometrician obtained the following regression results:
Table: Determinant of log of salaries of CEOs
Explanatory variables Coefficient Se
Log (sales – annual firm sales) 0.280 0.035
roe (Return on equity in percent) 0.0174 0.0041
ros (Return on firm’s stock) 0.00024 0.00054
Constant 4.32 0.32
i. Interpret the preceding regression taking into account any prior expectations that you may
have about the signs of the various coefficients.
ii. Which of the coefficients are individually statistically significant at the 5 percent level?
iii. What is the overall significance of the regression? Which test do you use? And why?
iv. Can you interpret the coefficients of roe and ros as elasticity coefficient? Why or why not?
9. Show the relationship between F and 𝑅2 .

10. Show the relationship between 𝑅 2 and 𝑅̅2 .
11. Using the cross-sectional data for 64 countries on child mortality (𝐶𝑀) under age five in a year per 1000
live births, per capita GNP (𝑃𝐺𝑁𝑃) in a particular year, female literacy rate (𝐹𝐿𝑅), and total fertility
rate (𝑇𝐹𝑅), a research was trying to analyze child mortality. The author proposed the following
regression models to find the determinants of child mortality.
Model A : 𝐶𝑀𝑖 = 𝛽1 + 𝛽2 𝑃𝐺𝑁𝑃𝑖 + 𝛽3 𝐹𝐿𝑅𝑖 + 𝑢𝑖

Model B : 𝐶𝑀𝑖 = 𝛽1 + 𝛽2 𝑃𝐺𝑁𝑃𝑖 + 𝛽3 𝐹𝐿𝑅𝑖 + 𝛽4 𝑇𝐹𝑅𝑖 + 𝑢𝑖
In the above models, 𝑢𝑖 is the stochastic disturbance term satisfying the standard assumptions of
Ordinary Least Square Method.
The estimated OLS results are presented in the following table.

Model A Model B
Coefficient /se Coefficient /se
Model A Model B
Coefficient /se Coefficient /se
Per capita GNP -0.006*** -0.006***
(0.002) (0.002)
Female literacy rate, percent -2.232*** -1.768***
(0.210) (0.248)
Total fertility rate, the average number of children born to a
12.869***
woman
(4.191)
Constant 263.642*** 168.307***
(11.593) (32.892)
𝑅2 0.708 0.747
Adjusted 𝑅2 0.698 0.735
Note: *** p<0.01, ** p<0.05, * p<0.1
(a) How would you interpret the coefficient of TFR? A priori, would you expect a positive or negative
relationship between CM and TFR? Justify your answer.
(b) Have the coefficient values of PGNP and FR changed between the two equations? If so, what may
be the reason(s) for such a change? Is the observed difference statistically significant? Which test
do you use and why?
(c) How would you choose between models A and B? Which statistical test would you use to answer
this question? Show the necessary calculations.
(d) We have not given the standard error of the coefficient of TFR. Can you find it out? (Hint: Recall
the relationship between the t and F distributions.)
12. A franchise management in American was trying to estimate the effect of advertising on sales or total
revenue and was proposing to estimate the following regression model:
Model A: 𝑆 = 𝛽1 + 𝛽2 𝑃 + 𝛽3 𝐴 + 𝑢
Here, 𝑆 represents sales or revenue measured in 1000 dollar as unit, 𝑃 is per unit price, and 𝐴 is
advertising cost measured in 1000 dollar as unit.
Later on, the management presumed that advertising may have diminishing return and therefore,
postulated the following form of the model.
Model B: 𝑆 = 𝛽1 + 𝛽2 𝑃 + 𝛽3 𝐴 + 𝛽4 𝐴2 + 𝑢
a. Explain how would you test the hypothesis that advertising has no effect on sales in model A and
model B.
b. What will be priori sign of 𝛽3 and 𝛽4 in model B if the assumption of diminishing returns to
advertising become true? Describe the process of testing the hypothesis that for 40000 dollar, the
return to advertising reaches at its optimal level.
c. Using OLS method, the economist find the following results:
𝑆̂ = 104.81 − 6.582𝑃 + 3.36𝐴 − 0.0268𝐴2
𝑠𝑒 = (3.74) (1.582) (0.42) (0.0159)
𝑛 = 78, 𝑅𝑆𝑆 = 2592.301
i. Interpret the regression result.
ii. Find the optimal level of advertising.
iii. Do you think that advertising has no effect on sales? How do you know that? Show your
calculation. [𝑅𝑆𝑆𝑅 = 20907.331]
iv. Suppose that the franchise management, based on experience in other cities, thinks
that the optimal level of advertising in problem c(ii) is too high, and that the optimal level
of advertising is actually about $40,000. How would you test this hypothesis? Show your
calculation and draw your conclusion. [𝑐𝑜𝑣(𝛽3 , 𝛽4 ) = −0.0064]
13. Define dummy variable. Discuss its nature and usages in social science researches. What cautionary
measures should be taken during incorporating dummy variables in regression model?
14. A macroeconomist was studying a time series data for the period 1990-2015. S/he was trying to know
whether there is structural differences in savings-income relationship in Bangladesh. S/he considers
2002 as the structural break year.
The restricted estimated regression model is
To analyze the structural differences s/he used the dummy variable technique and estimated the
following unrestricted regression result.
Where,
𝑌𝑡 is the savings at time 𝑡,
𝑋𝑡 is the income at time 𝑡,
𝐷𝑡 is the dummy variable containing two values: 1 if the data come from 2002-2015 and 0
otherwise. 𝑡 is time.
i. Estimate the regression line for the periods 1990-2001 and 2002-2015.
ii. Do you think the intercept differential is significantly different from zero? Show your
calculation.
iii. Do you think the slope differential is significantly different from zero? Show your
calculation.
iv. Do you think both intercept and slope differentials are simultaneously significantly
different from zero? How do you know it? Describe the process and draw your conclusion.
v. What was the alternative way of analyzing structural differences of savings-income
relationship? What were the limitations of that approach?
15. A student of DDS collected data on red roses, the dozen of red roses sold quarterly, in various flower
markets in Dhaka city. The student aims to estimate a demand function for red roses. S/he primarily
decided to estimate the following two regression models:
Model A: 𝑌𝑡 = 𝛼1 + 𝛼2 𝑋2𝑡 + 𝛼3 𝑋3𝑡 + 𝛼4 𝑋4𝑡 + 𝛼5 𝑋5𝑡 + 𝑢𝑡

Model B: 𝑙𝑛𝑌𝑡 = 𝛽1 + 𝛽2 𝑙𝑛𝑋2𝑡 + 𝛽3 𝑙𝑛𝑋3𝑡 + 𝛽4 𝑙𝑛𝑋4𝑡 + 𝛽5 𝑙𝑛𝑋5𝑡 + 𝑢𝑡
Where
𝑌 is the quantity of red roses sold, dozens.
𝑋2 is the average wholesale price of red roses.
𝑋3 is the average wholesale price of white roses.
𝑋4 is the average weekly family disposable income.
𝑋5 is the trend variable taking values of 1,2,and so on.
𝑙𝑛 natural log
Based on the collected data she obtained the following regression results:
Model A:
Source SS df MS Number of obs = 16

F(4, 11) = 13.89
Model 52249136.4 4 13062284.1 Prob > F = 0.0003
Residual 10347219.6 11 940656.327 R-squared = 0.8347
Adj R-squared = 0.7746
Total 62596356 15 4173090.4 Root MSE = 969.87
Y Coef. Std. Err. t P>|t| [95% Conf. Interval]
X2 -2227.704 920.4657 -2.42 0.034 -4253.636 -201.773

X3 1251.141 1157.021 1.08 0.303 -1295.444 3797.726
X4 6.282986 30.62166 0.21 0.841 -61.11482 73.6808
X5 -197.3999 101.5612 -1.94 0.078 -420.9347 26.13482
_cons 10816.04 5988.348 1.81 0.098 -2364.223 23996.31
Model B:
Source SS df MS Number of obs = 16
F(4, 11) = 9.63
Model 1.09893508 4 .27473377 Prob > F = 0.0013
Residual .313663766 11 .028514888 R-squared = 0.7780
Adj R-squared = 0.6972
Total 1.41259884 15 .094173256 Root MSE = .16886
lnY Coef. Std. Err. t P>|t| [95% Conf. Interval]
lnX2 -1.273554 .5266486 -2.42 0.034 -2.432699 -.1144078

lnX3 .937304 .6591908 1.42 0.183 -.5135652 2.388173
lnX4 1.712984 1.200845 1.43 0.181 -.9300571 4.356025
lnX5 -.1815972 .1278933 -1.42 0.183 -.4630885 .0998942
_cons .6267849 6.148268 0.10 0.921 -12.90546 14.15903
(a) Shortly discuss, what are the fundamental differences between model A and model B?
(b) Interpret the both regression results. Do the results concur with the a priori expectations about the
signs of the parameters? Discuss.
(c) List which regression coefficients are significant in model A and model B at 5 percent level of
significance.
(d) The student concludes that as model A has higher 𝑅2 value than model B, model A is better than
model B. Do you agree with her? Explain your argument.
16. Recently ABC Company has recruited a manager. The company sells different kinds of breads in the
markets of different cities of the country. The company has taken initiatives to expand the
understanding of the product through newspaper advertising in the hope of higher sales. The company
has shared the monthly sales (SALES), a price index for all products sold in a given month (PRICE),
and monthly advertising expenditure (ADVERT). The SALES and ADVERT are measured in thousand
($1000). There are 75 observations in the dataset. The manager has planned to estimate the following
regression models and to report the findings to the management.
Model A: SALESi = β0 + β1 PRICEi + β2 ADVERTi + 𝑢𝑖
Model B: SALESi = β0 + β1 PRICEi + β2 ADVERTi + 𝛽3 ADVERTi2 + 𝑣𝑖

Based on the given data, the manager has obtained the following regression results:
Model A Model B
Variable Coefficient SE
PRICE β1 -7.908 1.096 -7.640 1.046
ADVERT β2 1.863 0.683 12.151 3.556
ADVERT 2 β3 - - -2.768 0.941
β0 118.914 6.352 109.719 6.799
Summary Statistics of the Model
Number of observations (N) 75 75
Model Sum of Square 1,396.539 1,583.397
Residual Sum of Square 1,718.943 1,532.084
F 29.248 24.459
Note: The average sale is equal to; the average price is 5.6872; and the average advertising is $1844,
the standard deviations of the variables are 6.488537, 0.518432, and 0.8316769 respectively.
a) Interpret the results in model A and Model B.

b) Construct 95% confidence interval for all of the parameters in model A and Model B.
c) Test the overall significance of model A and model B.
d) The management is planning to reduce the price by 40 cents and increase the monthly advertising
expenditure to $800 as a strategy to boost the sales of the company. Compute the following based on
the results in model A:
i. The expected change in monthly sales.
ii. The 95% confidence interval of the expected change in monthly sales. [Note: 𝑐𝑜𝑣(β̂1 , β̂2 ) =
−0.01974215]
iii. Do you think that the expected change in sales is statistically different from $4000? How do
you know that? Show your calculation and comment on your result.
e) An adviser of the company claims that dropping the price by 20 cents will be more effective for
increasing sales revenue than increasing advertising expenditure by $500. Considering the results in
model A, will you agree or disagree with the advisor? Why or why not? [Note: test the null
hypothesis: 𝐻0 : − 0.2𝛽1 = 0.5𝛽2 against the alternative hypothesis 𝐻1 : −0.2𝛽1 > 0.5𝛽2 ]
f) What is the underlying assumption of incorporating the square of advertising expenditure as an
additional explanatory variable in model B? Is the assumption valid? Why or why not? Show your
calculation. [Note: diminishing returns to advertising expenditure]
g) Find the marginal effect of another unit of advertising ($1000) on monthly sales at the following
advertising level:
i. $500
ii. $2000
iii. $3000
h) Find the level of advertising that maximizes the net sales.
i) Predict the expected sales in model A and Model B for the following values of price and advertising
(PRICE, ADVERT).
i. (5.5, $1500)
ii. (4, $2500)
j) Compute adjusted 𝑅2 (𝑅̅2 ) for model A and model B and comment on ‘which model fits the data well’.
k) Estimate the price elasticity of sales and advertising elasticity of sales in model A and model B.
17. A young researcher was trying to estimate the following the regression models:
Model A: 𝑙𝑛𝑌𝑖 = 𝛼1 + 𝛼2 𝐾𝑊𝑊𝑖 + 𝛼3 𝐼𝑄𝑖 + 𝛼4 𝐸𝐷𝑈𝐶𝑖 + 𝛼5 𝑙𝑛𝑇𝑖 + 𝑢𝑖
Model B: 𝑙𝑛𝑌𝑖 = 𝛽1 + 𝛽2 𝐾𝑊𝑊𝑖 + 𝛽3 𝐼𝑄𝑖 + 𝛽4 𝐸𝐷𝑈𝐶𝑖 + 𝛽5 𝑙𝑛𝑇𝑖 + 𝛽6 𝐵𝐿𝐴𝐶𝐾 +∈𝑖
Model C: 𝑙𝑛𝑌𝑖 = 𝜆1 + 𝜆2 𝐾𝑊𝑊𝑖 + 𝜆3 𝐼𝑄𝑖 + 𝜆4 𝐸𝐷𝑈𝐶𝑖 + 𝜆5 𝑙𝑛𝑇𝑖 + 𝜆7 𝐾𝑊𝑊 ∗ 𝐵𝐿𝐴𝐶𝐾 + 𝜆8 𝐼𝑄 ∗ 𝐵𝐿𝐴𝐶𝐾 + 𝜆9 𝐸𝐷𝑈𝐶 ∗ 𝐵𝐿𝐴𝐶𝐾 +
𝜆10 𝑙𝑛𝑇 ∗ 𝐵𝐿𝐴𝐶𝐾 + 𝜀𝑖
Model B: 𝑙𝑛𝑌𝑖 = 𝛾1 + 𝛾2 𝐾𝑊𝑊𝑖 + 𝛾3 𝐼𝑄𝑖 + 𝛾4 𝐸𝐷𝑈𝐶𝑖 + 𝛾5 𝑙𝑛𝑇𝑖 + 𝛾6 𝐵𝐿𝐴𝐶𝐾 + 𝛾7 𝐾𝑊𝑊 ∗ 𝐵𝐿𝐴𝐶𝐾 + 𝛾8 𝐼𝑄 ∗ 𝐵𝐿𝐴𝐶𝐾 + 𝛾9 𝐸𝐷𝑈𝐶 ∗
𝐵𝐿𝐴𝐶𝐾 + 𝛾10 𝑙𝑛𝑇 ∗ 𝐵𝐿𝐴𝐶𝐾 + 𝑒𝑖
Here 𝑙𝑛𝑌𝑖 is log of monthly earnings, 𝐾𝑊𝑊 is the knowledge of world work score, IQ is IQ score, EDUC
is years of schooling, 𝑙𝑛𝑇 is log of tenure and BLACK is a binary race variable containing 1 value if race is black
or 0 otherwise.
Model B assumes that the wage equations of black and non-black labor have different intercept only
(hence, the race dummy is just added) and model C assumes that wage equations for black and non-
black labor have same intercept but have different slopes (the race dummy is omitted but the
interaction of race variable with the explanatory variables are added). In model D, the researcher
added a race dummy variable and a set of interactive terms under the presumption that race has both
intercept differential and slope differentials. Therefore, in this setting, model A assumes that race has
neither intercept differential nor slope differentials. The results of the model is reported in the
following table:
Table 1: Determinants of monthly earnings

Explanatory variable Model A Model B Model C Model D
coef/se coef/se coef/se coef/se
Knowledge of world work score [KWW] 0.009 0.008 0.011 0.011
(0.002) (0.002) (0.002) (0.002)
IQ score [IQ] 0.004 0.003 0.003 0.003
(0.001) (0.001) (0.001) (0.001)
Years of education [EDUC] 0.033 0.034 0.035 0.035
(0.007) (0.007) (0.007) (0.007)
Log of tenure [lnT] 0.078 0.076 0.059 0.060
(0.014) (0.014) (0.015) (0.015)
Race (=1 if black) -0.129 0.167
(0.041) (0.305)
KWWblack [KWW*BLACK] -0.017 -0.017
(0.005) (0.006)
IQblack [IQ*BLACK] 0.005 0.005
(0.003) (0.003)
Educblack [EDUC*BLACK] -0.016 -0.024
(0.019) (0.023)
lnTblack [lnT*BLACK] 0.095 0.090
(0.038) (0.040)
Constant 5.462 5.599 5.589 5.567
(0.097) (0.106) (0.104) (0.112)
Explained Sum of Square (ESS) 30.385 31.823 34.644 34.687
Residual Sum of Square (RSS) 135.271 133.834 131.012 130.970
Number of observations 935 935 935 935
Mean VIF 1.30 1.32 13.61 22.84
Homoscedasticity [𝑃𝑟𝑜𝑏 > 𝜒12 ] 0.07 0.10 0.18 0.19
No omitted variable [𝑃𝑟𝑜𝑏 > 𝐹] 0.55 0.93 0.80 0.56
a) Interpret the results in model A.

b) Construct 95% confidence interval for the coefficient of KWW variable in model A.
c) Test that 𝐻0 : 𝛼2 = 0
d) Test that IQ and EDUC have equal effect on log of monthly earnings [note: use model A].
[𝑐𝑜𝑣(𝛼̂3 , 𝛼̂4 ) = 0]
e) Test that EDUC has 10 times higher effect on log of monthly earnings compare to the effect of IQ
[note: use model A]. [𝑐𝑜𝑣(𝛼̂3 , 𝛼̂4 ) = 0]
f) Test that 𝐻0 : 𝛼2 + 𝛼3 + 𝛼4 = 0.05 in model A.
g) Test that 𝐻0 : 5𝛼2 − 𝛼3 + 2𝛼4 = 0.5 in model A.
h) Test the significance of the coefficient of race variable in model B.
i) Find the mean wage equation for black labor and non-black labor from model B, C, and D.
j) Find the significant and insignificant coefficients in model A, B, C and D.
k) Construct and test the hypothesis and the wage equations of black and non-black labor have same
slopes but different intercept only. [Note: use model B].
l) Construct and test the hypothesis that only slope coefficients of wage equations of both black and
non-black labor are same.
m) Construct and test the hypothesis that both black and non-black labor have similar wage equation.
n) Test the hypothesis that 𝐻0 : 𝛾2 + 𝛾7 = 0 from model D. [𝑐𝑜𝑣(𝛾̂2 , 𝛾̂7 ) = 0]
o) At 1% significance level, calculate the join significance of the regression in model all models.
p) Do the models suffer from multicollinearity, heteroscedasticity, and omitted variable bias? Give
your arguments.
18. In a study of turnover in the labor market, James F. Ragan, Jr., aimed to estimate the following
regression model for the U.S. economy for the period of 1950–I to 1979–IV:
𝑙𝑛𝑄𝑡 = 𝛼1 + 𝛼2 𝑙𝑛𝐶𝑌𝐶𝑡 + 𝛼3 𝑙𝑛𝑌𝑁𝐺𝑡 + 𝛼4 𝑙𝑛𝐸𝑀𝑃𝑡 + 𝛼5 𝑙𝑛𝑊𝑂𝑀𝑡 + 𝛼6 𝑇𝐼𝑀𝐸 + 𝑢𝑡
Where 𝑄 = quit rate in manufacturing industry, defined as number of people leaving jobs voluntarily
per 100 employees; 𝐶𝑌𝐶 = an instrumental or proxy variable for adult male unemployment rate; 𝑌𝑁𝐺 =
percentage of employees younger than 25; 𝐸𝑀𝑃 = 𝑁𝑡−1 /𝑁𝑡−4 = ratio of manufacturing employment in
quarter (t - 1) to that in quarter (t - 4); 𝑊𝑂𝑀 = percentage of women employees; and 𝑇𝐼𝑀𝐸 = time trend
(1950–I = 1).
The estimated results are reported in the following table:
Table: Determinants of quit rate in manufacturing industry in US economy
Explanatory variables Coefficient t-value
Constant 4.47 4.28
lnCYC -0.34 -5.31
lnYNG 1.22 3.64
lnEMP 1.20 3.10
lnWOM 0.80 1.10
TIME -0.0055 -3.09
Model’s statistics: 𝑅̅ 2 = 0.5370
(i) Shortly describe the factors determining the quit rate in manufacturing industry in US
economy based on the results reported in the above table.
(ii) Find the standard errors of the regression coefficients from the given data.
(iii) Test the overall significance of the above regression result.
19. (a). Marc Nerlove has estimated the following cost function for electricity generation
𝛼 𝛼 𝛼
𝑌𝑖 = 𝐴𝑋𝛽 𝑃1 1 𝑃2 2 𝑃3 3 𝑒 𝑢𝑖
where Y = total cost of production, X = output in kilowatt hours, 𝑃1 = price of labor input, 𝑃2 = price of
capital input, 𝑃3 = price of fuel, and u = disturbance term
By imposing a special restriction, the author transformed the above model as follows:
𝑌𝑖 𝑃1 𝛼1 𝑃2 𝛼2
= 𝐴𝑋𝛽 ( ) ( ) 𝑒 𝑢𝑖
𝑃3 𝑃3 𝑃3
i. What was the special restriction? Explain the meaning of the restriction.
ii. Explain the process of testing whether the restriction is valid or not.
(b). On the basis of a sample of 29 medium-sized firms, and after logarithmic transformation, Nerlove
obtained the following regression results.
̂ 𝑖 = −4.93 + 0.94𝑙𝑛𝑋𝑖 + 0.31𝑙𝑛𝑃1 − 0.26𝑙𝑛𝑃2 + 0.44𝑙𝑛𝑃3

𝑙𝑛𝑌
Se = (1.96) (0.11) (0.23) (0.29) (0.07)
RSS=0.336
̂ 𝑌 𝑃1 𝑃2
ln ( ) = −6.55 + 0.91𝑙𝑛𝑋 + 0.51𝑙𝑛 ( ) + 0.09𝑙𝑛 ( )
𝑃3 𝑖 𝑃3 𝑃3
Se= (0.16) (0.11) (0.19) (0.16)
RSS=0.364
i. Do the above results confirm the assumption/restriction made by the author? Show your
calculation.
ii. Test the hypothesis 𝛽 = 1.

Econometrics

Uploaded by

Copyright:

Available Formats

Econometrics

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Econometrics

Uploaded by

Copyright:

Available Formats

Practice Questions

DS 301: Basic Econometrics

(b). A researcher obtained the following results based on 526 observations:

2. Consider the following regression model.

(a) 𝑙𝑛̂𝐺𝐷𝑃 = −1.6524 + 0.3397 ln 𝐿𝑎𝑏𝑜𝑟 + 0.8460 ln 𝐶𝑎𝑝𝑖𝑡𝑎𝑙

i. Interpret the results of both models.

i. Interpret the results of model (a) and model (b).

̂𝑙𝑛 𝐶 = 4.30 − 1.34 ln 𝑃 + 0.17 ln 𝑌

9. Show the relationship between F and 𝑅2 .

Model A : 𝐶𝑀𝑖 = 𝛽1 + 𝛽2 𝑃𝐺𝑁𝑃𝑖 + 𝛽3 𝐹𝐿𝑅𝑖 + 𝑢𝑖

The estimated OLS results are presented in the following table.

Model A: 𝑌𝑡 = 𝛼1 + 𝛼2 𝑋2𝑡 + 𝛼3 𝑋3𝑡 + 𝛼4 𝑋4𝑡 + 𝛼5 𝑋5𝑡 + 𝑢𝑡

Source SS df MS Number of obs = 16

Y Coef. Std. Err. t P>|t| [95% Conf. Interval]

X2 -2227.704 920.4657 -2.42 0.034 -4253.636 -201.773

lnY Coef. Std. Err. t P>|t| [95% Conf. Interval]

lnX2 -1.273554 .5266486 -2.42 0.034 -2.432699 -.1144078

Model A: SALESi = β0 + β1 PRICEi + β2 ADVERTi + 𝑢𝑖

Model B: SALESi = β0 + β1 PRICEi + β2 ADVERTi + 𝛽3 ADVERTi2 + 𝑣𝑖

a) Interpret the results in model A and Model B.

Table 1: Determinants of monthly earnings

a) Interpret the results in model A.

𝑙𝑛𝑄𝑡 = 𝛼1 + 𝛼2 𝑙𝑛𝐶𝑌𝐶𝑡 + 𝛼3 𝑙𝑛𝑌𝑁𝐺𝑡 + 𝛼4 𝑙𝑛𝐸𝑀𝑃𝑡 + 𝛼5 𝑙𝑛𝑊𝑂𝑀𝑡 + 𝛼6 𝑇𝐼𝑀𝐸 + 𝑢𝑡

The estimated results are reported in the following table:

Table: Determinants of quit rate in manufacturing industry in US economy

Explanatory variables Coefficient t-value

Constant 4.47 4.28

lnCYC -0.34 -5.31

lnYNG 1.22 3.64

lnEMP 1.20 3.10

lnWOM 0.80 1.10

TIME -0.0055 -3.09

Model’s statistics: 𝑅̅ 2 = 0.5370

̂ 𝑖 = −4.93 + 0.94𝑙𝑛𝑋𝑖 + 0.31𝑙𝑛𝑃1 − 0.26𝑙𝑛𝑃2 + 0.44𝑙𝑛𝑃3

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.