0% found this document useful (0 votes)
20 views

Tutorial Session 11 - Heteroscedasticity Solution

Uploaded by

lucastone325
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views

Tutorial Session 11 - Heteroscedasticity Solution

Uploaded by

lucastone325
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Econ 314: Quantitative Economics

Tutorial 10- Heteroscedasticity


Please print and attempt prior to the tutorial session.

1. What is heteroscedasticity? Why is it problematic? Explain in your own words.

Heteroscedasticity occurs when the error term does not have constant variance. It is a violation
of multiple linear regression M6. By violating assumption M6, heteroscedasticity prevents OLS
estimates from being BLUE because it results in standard errors that are not minimum variance
among the class of unbiased estimators.

2. What are the null and alternative hypotheses for testing for the presence of heteroscedasticity?
Why? Explain.

The null hypothesis for testing for the presence of heteroscedasticity is that the error term has
constant variance (or 𝐻𝐻0 : 𝑉𝑉𝑉𝑉𝑉𝑉(𝜀𝜀𝑖𝑖2 ) = 𝜎𝜎 2 ). The alternative hypothesis is that the error term has
non-constant variance (or 𝐻𝐻0 : 𝑉𝑉𝑉𝑉𝑉𝑉(𝜀𝜀𝑖𝑖2 ) ≠ 𝜎𝜎 2 ). The reason for this is that if we can reject that
null hypothesis that the error term has constant variance, then we can conclude that it has non-
constant variance, meaning that our heteroscedasticity is present in our data.

3. Suppose you are interested in explaining variation in Body Mass Index in a nationally-
representative sample of 12,486 men and women. You estimate the sample regression
function (with D =1 if female and 0 otherwise) as follows (and with standard errors in
parentheses):

�𝑖𝑖 = 26.37 − 5.61 𝐹𝐹𝐹𝐹𝐹𝐹𝐹𝐹𝐹𝐹𝐹𝐹𝑖𝑖 + 0.48 𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖


𝐵𝐵𝐵𝐵𝐵𝐵

(4.62) (1.05) (0.13)

a) Which variable do you suspect could be responsible for heteroscedasticity in this model?
Why? Explain.

It seems most reasonable that Age is likely responsible for heteroskedasticity if present in
this model. The hypothesis would be that as people get older the variance of BMI
increases.

b) How would you use Weighted Least Squares to account for heteroskedasticity if it is
present? Explain.

To use Weighted Least Squares you need to know the exact form of heteroscedasticity.
Assume the form of heteroscedasticity is ℎ(𝑥𝑥) = 𝜎𝜎 2 𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 . As age increases so does the
variance of the error term increase proportionally i.e. it is linearly related. Because we
have assumed the exact form of heteroscedasticity, we can use weighted least squares.
Estimate the model

𝐵𝐵𝐵𝐵𝐵𝐵𝑖𝑖 1 𝐹𝐹𝐹𝐹𝐹𝐹𝐹𝐹𝐹𝐹𝑒𝑒𝑖𝑖 𝐴𝐴𝑔𝑔𝑔𝑔𝑖𝑖 𝜀𝜀𝑖𝑖


= 𝛽𝛽0 + 𝛽𝛽1 + 𝛽𝛽2 +
�𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 �𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 �𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 �𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 �𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖

The variance of the error term of the weighted least squares model is now homoscedastic.

Page 1 of 3
4. Suppose you are interested in explaining variation in Vacation Spending in a sample of
10 000 high-school and university graduates. You estimate the sample regression function
(with D =1 if university graduate and 0 otherwise) (standard errors in parentheses)

𝑉𝑉𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎�
𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑖𝑖 = 109.82 + 0.12 𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝑖𝑖 + 317.39 𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈 𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝑖𝑖

(134.62) (0.05) (91.43)


a) Which variable do you suspect could be responsible for heteroscedasticity in this
model? Why? Explain.

It seems most reasonable that Income is likely responsible for heteroscedasticity if


present in this model. The hypothesis would be that as people’s income increases the
variance of Vacation Spending increases.

b) Suppose you know that heteroscedasticity takes the form ℎ(𝑥𝑥) = 𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝑖𝑖2 ∙ 𝜎𝜎 2 . How
would you use Weighted Least Squares to correct for the heteroscedasticity? Explain.

To use Weighted Least Squares you need to know the exact form of heteroscedasticity.
In this case the form of heteroscedasticity is ℎ(𝑥𝑥) = 𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝑖𝑖2 ∙ 𝜎𝜎 2 . So the variance of
the error term increases to the square of income. Because we have assumed the exact
form of heteroskedasticity, we can use weighted least squares. Estimate the model

𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉 𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑖𝑖 1 𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝑒𝑒𝑖𝑖 𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈𝑈. 𝐺𝐺𝑟𝑟𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑒𝑒𝑖𝑖 𝜀𝜀𝑖𝑖


= 𝛽𝛽0 + 𝛽𝛽1 + 𝛽𝛽2 +
𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝑖𝑖 𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝑖𝑖 𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝑖𝑖 𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝑖𝑖 𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝑖𝑖

The reason that weighted least squares works is that the variance of the error term is
now homoscedastic.

5. You are interested in estimating the money demand function in several countries. You obtain
cross-sectional data for 39 countries for the year 2015, and estimate the following model:

Source | SS df MS Number of obs = 39


-------------+------------------------------ F( 2, 36) = 1123.44
Model | 45.8892902 2 15.2964301 Prob > F = 0.0000
Residual | .476548269 36 .013615665 R-squared = 0.9897
-------------+------------------------------ Adj R-squared = 0.9888
Total | 46.3658385 38 1.22015364 Root MSE = .11669

------------------------------------------------------------------------------
lnmon | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
gdp | 1.227289 .0273449 44.88 0.000 1.171776 1.282802
cpi | .0429665 .0316949 1.36 0.184 -.0213776 .1073105
_cons | -2.27753 .3034308 -7.51 0.000 -2.893527 -1.661532
------------------------------------------------------------------------------
Variables: lnmon the natural log of the broad money stock per capita (in US $)
gdp GDP per capita (in US $)
cpi consumer inflation rate (% change)

Page 2 of 3
You also generated the following graph:

1000
800
600
Squared
400
200
0

2 4 6 8
gdp

a) Identify the specific type of model estimated above.


Log-lin/log-linear model
b) You use the scatter plot above to see whether a suspected problem with your regression
results might exist.
i. Which violation of the assumptions of the Classical Linear Regression Model did you
suspect might affect your results? Describe the nature of the problem briefly.
Heteroscedasticity :the residuals have non-constant variance
ii. Explain whether the scatter plot confirms your suspicions.
Yes. As the value of GDP increases, the values of the squared residuals increases.
iii. List two reasons why this problem could occur in your dataset.
Nature of data: mainly cross-sectional data (e.g. because of scale effect);
Outliers
Misspecification.

c) To remedy for the potential problem, you attempt the following transformation:
. g gdpsqrt=sqrt(gdp)
. g lnmont=lnmon/gdpsqrt
. g gdpt=gdp/gdpsqrt
. g cpit=cpi/gdpsqrt
. g xt=1/gdpsqrt
and you regress the transformed variables lnmont on gdpt, cpit and xt.
What is the assumption that you have made regarding the nature of the pattern/structure
of the problem suspected?
The error variance is proportional to X (gdp).
Page 3 of 3

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy