0% found this document useful (0 votes)

12 views

LEC11

The document discusses multicollinearity, a violation of the Classical Linear Regression Model (CLRM) assumptions, where independent variables are highly correlated, making it difficult to interpret regression coefficients. It explains the nature, consequences, detection methods, and potential remedies for multicollinearity, including dropping variables, acquiring additional data, rethinking the model, and transforming variables. The document emphasizes the importance of addressing multicollinearity to ensure reliable regression analysis results.

Uploaded by

alifmahmud436

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

LEC11

Uploaded by

alifmahmud436

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

VIOLATION OF CLRM ASSUMPTIONS:

MULTICOLLINEARITY

Chapter 8
MULTICOLLINEARITY: NATURE

 One of the assumptions of the CLRM was that there is no exact linear
relationship between the independent variables of a regression model.
 Informally, no exact linear relationship or no collinearity means that a
variable, say , can not be expressed as an exact linear function of
another variable, say .

𝑌 = 𝛽1 + 𝛽2 𝑋 2+ 𝛽 3 𝑋 3 +𝑒
 is the impact of on holding all other factors constant. If is related to
then also captures the impact of changes in . In other words,
interpretation of the parameters becomes difficult.
 When the explanatory variables are very highly correlated with each
other (correlation coefficients either very close to 1 or to -1) then the
2
problem of multicollinearity occurs.
MULTICOLLINEARITY: NATURE

 Perfect linear relationship: Assume we have the following model:

 Where the sample values for and are:

1 2 3 4 5 6
2 4 6 8 10 12

 From above Table, we see that

 Therefore, although it seems that there are two explanatory variables in
fact it is only one. This is because is an exact linear function of or
because and are perfectly collinear.
3
PERFECT LINEAR RELATIONSHIP: DEFINITION

 In presence of multicollinearity, we can not disentangle the

individual effect of and on Y.
 Exact linear relationship: For a k-variable regression model
involving explanatory variables ( is the intercept and has a value
of 1) an exact linear relationship is said to exist if the following
condition is satisfied

 Where are constants and not all of them are zero simultaneously.
 In our case the equation: can be satisfied for non-zero values of

both .
 We have:

, that is and =1
EXACT LINEAR RELATIONSHIP: CONSEQUENCE

 Under Perfect Multicollinearity, the OLS estimators simply

do not exist.
 If you try to estimate an equation in Eviews/ any other

computer packages and your equation specifications suffers

from
perfect multicollinearity, you will get an error message that
the regression cannot be run.

In case of perfect
multicollinearity, we cannot
 T 1 T2 21 T1 get the inverse of ,
because it becomes
singular.
5
NEAR-EXACT LINEAR RELATIONSHIP
 A near-exact linear relationship is said to exist if the following
condition is satisfied:

Where are constants and not all of them are zero simultaneously
and is a random error term.
 In our example, , shows that is not an exact linear combination of

because it is also determined by the stochastic error term .

 That is and =1 does not satisfy the relationship, we need to

know the value of random error term.

6
MULTICOLLINEARITY: NATURE

 Points to remember
 Multicollinearity as we have defined refers only to linear

relationship among variables of a regression model. But

may be nonlinearly correlated as:

 Variables and are functionally related with but the

relationship in nonlinear thus the model does not violate
the assumption of multicollinearity.
MULTICOLLINEARITY: EXAMPLE

3 9 13 (+4)

5 15 16 (+1) 0.9959

6 18 21 (+3)

8 24 25 (+1)

10 30 30 (+0)
THEORETICAL CONSEQUENCES OF MULTICOLLINEARITY

 CASE 1: PERFECT LINEAR RELATIONSHIP BETWEEN VARIABLES

 If multicollinearity is perfect (exact linear relationship), the
partial regression coefficients of the variables are
indeterminate and their standard errors are infinite.
 If and are perfectly collinear, there is no way that can be

kept constant when changes. As long as we fail to separate

individual effect of and on Y, we can not get a unique solution
for individual regression coefficients.
THEORETICAL CONSEQUENCES OF MULTICOLLINEARITY

 CASE 2: NEAR PERFECT LINEAR RELATIONSHIP BETWEEN VARIABLES

 If multicollinearity is less than perfect, the regression
coefficients, although determinate, posses large standard errors
(in relation to the coefficients themselves), which means that the
coefficients can not be estimated with great precision.
 The effect of multicollinearity is to make it hard to obtain

estimates of coefficients with small standard error.

 However, having a small number of observations also has similar

effect. For this reason, multicollinearity is considered as a small

sample phenomenon. Some Economists also use separate
term for multicollinearity, like micronumerosity instead of
multicollinearity.
PRACTICAL CONSEQUENCES OF MULTICOLLINEARITY

In case of near perfect or high multicollinearity, one is likely to

face following problems:
1. Although BLUE, the OLS estimates have large variances and
covariances making precise estimation difficult.
2. Because of 1, the confidence intervals for parameter estimates
become large leading to accepting the zero more readily.
3. Also because of 1, the t - statistics of one or more coefficients
tends to be statistically insignificant.
4. Although the t - statistics of one or more coefficients is
statistically insignificant, , the overall measure of goodness-of-fit,
can be very high.
5. The OLS estimators and their standard errors can be sensitive to
small changes in the data.
DETECTION OF MULTICOLLINEARITY
 High but few significant t ratios.

 “Classic” symptom of multicollinearity. If is high, say, in excess of 0.8,

the F test in most cases will reject the null hypothesis that the partial
slope coefficients are jointly or simultaneously equal to zero. But
individual t tests will show that none or very few partial slope
coefficients are statistically different from zero.
 Subsidiary, or auxiliary regressions.

 One way of finding out which X variable is highly collinear with other X

variables in the model is to regress each X variable on the remaining

X variables and to compute the corresponding .

 Consider the regression of on ─ six explanatory variables.

 If this regression shows that the is high but very few X coefficients
are individually statistically significant, we then look for the “culprit,”
the variable(s) that may be a perfect or near perfect linear
combination of the other X’s.
DETECTION OF MULTICOLLINEARITY

on remaining Xs
on remaining Xs
on remaining Xs
on remaining Xs
on remaining Xs
on remaining Xs
2
R /( k −1)
𝐹= 2
(1 − R )/ (n − k )
DETECTION OF MULTICOLLINEARITY: VARIANCE INFLATION FACTOR
 Run a regression, plug in the value of in the following formula:
 The variance inflation factor:
 Variance inflation factors range from 1 upwards. The numerical value for
VIF tells us (in decimal form) what % the variance is inflated for each
coefficient. For example, a VIF of 1.9 tells you that the variance of a
particular coefficient is 90% bigger than what you would expect if there
was no multicollinearity.
 A rule of thumb for interpreting the variance inflation factor:
1 = not correlated.
Between 1 and 5 = moderately correlated.
Greater than 5 = highly correlated.
 What is known is that the more your VIF increases, the less reliable your
regression results are going to be. In general, a VIF above 10 indicates high
correlation and is cause for concern. Some authors suggest a more
conservative level of 2.5 or above.
REMEDIAL MEASURES

 Suppose on the basis of one or more of the diagnostic tests we

find a particular problem is plagued by multicollinearity.
 What solution(s), if any, can be used to reduce the severity of the

collinearity problem, if not eliminate it completely?

 Unfortunately, as in the case of collinearity diagnostics, there is

no silver bullet; there are only a few rules of thumb.

 What can be done if multicollinearity is serious? We have two
choices:
(1) do nothing or (2) follow some rules of thumb.

15
REMEDIAL MEASURES
 Dropping a Variable(s) from the Model
 Faced with severe multicollinearity, the simplest solution might seem to be
to drop one or more of the collinear variables.
 But this remedy can be worse than the disease (multicollinearity). When
formulating an economic model, we base the model on some theoretical
considerations.
 Suppose that we are modelling demand for chicken which theoretically,
among others, depends on disposable income, price of chicken, price of
beef and, price of goat. In this example, following economic theory, we
expect all three prices to have some effect on the demand for chicken
since the three meat products are to some extent competing products.
 Suppose we fail to separate the influence of the prices beef and goat on
the quantity of chicken demanded. But dropping those variables from the
model will lead to what is known as model specification error.
 The best practical advice is not to drop a variable from an economically
viable model just because the collinearity problem is serious.
REMEDIAL MEASURES

 Acquiring Additional Data or a New Sample

 Since multicollinearity is a sample feature, it is possible that

in another sample involving the same variables, collinearity

may not be as serious as in the first sample.
 The important practical question is whether we can obtain

another sample, for collection of data can be costly.

 Sometimes just acquiring additional data—increasing the

sample size-can reduce the severity of the collinearity

problem.
 But if cost constraints are not very prohibitive, by all means

getting more data, i.e., increasing the sample size is

certainly feasible.
17
RETHINKING THE MODEL
 Sometimes a model chosen for empirical analysis is not carefully
thought out.
 Maybe some important variables are omitted, or maybe the

functional form of the model is incorrectly chosen.

 Thus, in the demand function for chicken can be estimated as a

log-linear or in linear in variable specification.

 If presence of collinearity is high in the log-linear specification, the

demand function can be estimated using a linear in variables (LIV)

model.
 It is possible that in the LIV specification the extent of collinearity

may not be as high as in the log-linear specification.

18
PRIOR INFORMATION ABOUT SOME PARAMETERS
 Sometimes a particular phenomenon, such as a demand function, is
investigated time and again.
 From prior studies it is possible that we can have some knowledge of the
values of one or more parameters. This knowledge can be profitably used in
the current sample.
 Consider following demand function where we find collinearity between price
and earnings but current research has found income coefficient of 0.9 in
similar regression.
 We can use this information for :

 Assuming that the prior information is correct, we have resolved the

collinearity problem, for on the right-hand side of the above model.
19
 We now have only one explanatory variable and no question of collinearity
arises.
TRANSFORMATION OF VARIABLES
 Transformation of Variables
 Occasionally, transformation of variables included in the model can

minimize, if not solve, the problem of collinearity.

 Consider following estimated regression

t= N.A. (1.232) (1.844) = 0.9894

Where, Imports ($) billion, GNP ($) billion,CPI
 In theory, imports are positively related to the GNP (a measure of

income) and domestic prices.

To resolve collinearity, consider following transformation:

This regression shows that real imports are

statistically significantly positively related to
real income, the estimated t- value being highly
20
significant. The trick is to transforming nominal
value into real value
OTHER REMEDIES
 Combining time series and cross-sectional data.
 Factor or principal component analysis and,

 Ridge regression

Will not be covered.

Perhaps out of
scope. 21

Multicollinearity Nature of Multicollinearity
100% (2)
Multicollinearity Nature of Multicollinearity
7 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Multivariate Analysis – The Simplest Guide in the Universe: Bite-Size Stats, #6
From Everand
Multivariate Analysis – The Simplest Guide in the Universe: Bite-Size Stats, #6
Lee Baker
No ratings yet
Multicollinearity: What Happens If Explanatory Variables Are Correlated.
No ratings yet
Multicollinearity: What Happens If Explanatory Variables Are Correlated.
20 pages
CHAPTER 4_violations of Assumptions
No ratings yet
CHAPTER 4_violations of Assumptions
96 pages
Multicollinerity
No ratings yet
Multicollinerity
27 pages
Multicollinearity
No ratings yet
Multicollinearity
25 pages
Chapter 04 (1)
No ratings yet
Chapter 04 (1)
70 pages
Multicollinearity
100% (1)
Multicollinearity
25 pages
Multicollinearity: Abhijeet Kumar Kumar Anshuman Manish Kumar Umashankar Singh
100% (1)
Multicollinearity: Abhijeet Kumar Kumar Anshuman Manish Kumar Umashankar Singh
22 pages
Multicollinearity
No ratings yet
Multicollinearity
35 pages
Multicollinearity (1)
No ratings yet
Multicollinearity (1)
7 pages
6 Multicolinearity
No ratings yet
6 Multicolinearity
6 pages
AE Unit II
No ratings yet
AE Unit II
64 pages
Statistical Modelling: Regression: Multicollinearity
No ratings yet
Statistical Modelling: Regression: Multicollinearity
22 pages
Multicollinearity
No ratings yet
Multicollinearity
26 pages
CH 10
No ratings yet
CH 10
9 pages
Multicollinearity 2023
No ratings yet
Multicollinearity 2023
32 pages
Chapter 5
No ratings yet
Chapter 5
26 pages
Multi Kol
No ratings yet
Multi Kol
44 pages
Mulicolinearity
No ratings yet
Mulicolinearity
18 pages
Chapter_5_multicollinearity.pptx
No ratings yet
Chapter_5_multicollinearity.pptx
20 pages
Chapter Four Violations of The Assumptions of Classical Model
No ratings yet
Chapter Four Violations of The Assumptions of Classical Model
151 pages
Multicollinearity and Endogeneity PDF
No ratings yet
Multicollinearity and Endogeneity PDF
37 pages
Chapter 7 (Multicolinarity)
No ratings yet
Chapter 7 (Multicolinarity)
64 pages
chapter 10 multicollinearity what happens if the regressors are correlated
No ratings yet
chapter 10 multicollinearity what happens if the regressors are correlated
23 pages
Econometrics: Multicollinearity
No ratings yet
Econometrics: Multicollinearity
9 pages
MULTICOLLINEARITY(1)
No ratings yet
MULTICOLLINEARITY(1)
21 pages
Multicollinearity
100% (1)
Multicollinearity
2 pages
slides-3-iu
No ratings yet
slides-3-iu
22 pages
Multicolnearity 2
No ratings yet
Multicolnearity 2
28 pages
Multicollinearity 074432
No ratings yet
Multicollinearity 074432
21 pages
Multicollinearity Among The Regressors Included in The Regression Model
No ratings yet
Multicollinearity Among The Regressors Included in The Regression Model
13 pages
Econometrics Presentation
No ratings yet
Econometrics Presentation
31 pages
MULTICOLLINEARITY
No ratings yet
MULTICOLLINEARITY
8 pages
Multicollinerity_A_Violation_of_Classical_Linear_Regression_Model_Assumptions
No ratings yet
Multicollinerity_A_Violation_of_Classical_Linear_Regression_Model_Assumptions
19 pages
CHAPTER 4 (1)
No ratings yet
CHAPTER 4 (1)
38 pages
MULTICOLLINEALITY
No ratings yet
MULTICOLLINEALITY
20 pages
Lecture 4 - Multicolinearity
No ratings yet
Lecture 4 - Multicolinearity
24 pages
Violation of Assumptions of CLR Model:: Multicollinearity
No ratings yet
Violation of Assumptions of CLR Model:: Multicollinearity
28 pages
QMT 533 Assesment 2
No ratings yet
QMT 533 Assesment 2
20 pages
Chapter 05 - Multicollinearity
100% (1)
Chapter 05 - Multicollinearity
26 pages
Multicollinearity
No ratings yet
Multicollinearity
36 pages
Chapter 4. Violation of Assumptions
No ratings yet
Chapter 4. Violation of Assumptions
51 pages
Econ 321.6
No ratings yet
Econ 321.6
20 pages
Econometrics Edited Chapter-4
No ratings yet
Econometrics Edited Chapter-4
35 pages
Lecture 16
No ratings yet
Lecture 16
10 pages
C4-English
No ratings yet
C4-English
27 pages
4 Regression Diagnostics I
No ratings yet
4 Regression Diagnostics I
10 pages
Chapter 4
No ratings yet
Chapter 4
68 pages
Multicollinearity Assignment April 5
100% (1)
Multicollinearity Assignment April 5
15 pages
Trapti Chap4
No ratings yet
Trapti Chap4
8 pages
Chapter 4 Violations of The Assumptions of Classical Linear Regression Models
100% (10)
Chapter 4 Violations of The Assumptions of Classical Linear Regression Models
10 pages
Chapter 4
No ratings yet
Chapter 4
47 pages
Lecture 5,6,7 - Violations of CLRM
No ratings yet
Lecture 5,6,7 - Violations of CLRM
91 pages
Multi Col Linearity
No ratings yet
Multi Col Linearity
8 pages
Lecture 6 Multicollinearity
No ratings yet
Lecture 6 Multicollinearity
25 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Circumscription Logic: Fundamentals and Applications
From Everand
Circumscription Logic: Fundamentals and Applications
Fouad Sabry
No ratings yet
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
LEC12
No ratings yet
LEC12
21 pages
EVIEWS1_1
No ratings yet
EVIEWS1_1
21 pages
Balance of Payments
No ratings yet
Balance of Payments
42 pages
Foreign Direct Investment
No ratings yet
Foreign Direct Investment
60 pages
BOWEC Regulations
88% (8)
BOWEC Regulations
70 pages
Investigation On Transient Stability of Six-Phase Transmission System and Proposal For Integrating With Smart Grid
No ratings yet
Investigation On Transient Stability of Six-Phase Transmission System and Proposal For Integrating With Smart Grid
10 pages
KANSAI SPECIAL JJ30 - PB - INST - 5th - Ed PART BOOK
No ratings yet
KANSAI SPECIAL JJ30 - PB - INST - 5th - Ed PART BOOK
56 pages
SRRB Analysis Excel
No ratings yet
SRRB Analysis Excel
2 pages
SDD s65 Oe65 Rac PRSH En2 60f7f8cb248bb
No ratings yet
SDD s65 Oe65 Rac PRSH En2 60f7f8cb248bb
4 pages
Apache Druid: Sudhindra Tirupati Nagaraj
No ratings yet
Apache Druid: Sudhindra Tirupati Nagaraj
12 pages
Saso
No ratings yet
Saso
1 page
Effective Managerial Decisions
100% (1)
Effective Managerial Decisions
39 pages
Value Analysis & Value Engineering: DR - Sanjay Rajurkar
No ratings yet
Value Analysis & Value Engineering: DR - Sanjay Rajurkar
36 pages
1977 - Sample Size To Set Tolerance Interval
No ratings yet
1977 - Sample Size To Set Tolerance Interval
8 pages
Hollow Structural Sections Column Load Tables
No ratings yet
Hollow Structural Sections Column Load Tables
128 pages
Statistical Tools
No ratings yet
Statistical Tools
24 pages
Catalogo para Repuesto Burkert
No ratings yet
Catalogo para Repuesto Burkert
98 pages
Propositional Logic: Artificial Intelligence
No ratings yet
Propositional Logic: Artificial Intelligence
60 pages
MIT TLI - B10 - Brochure-Min
No ratings yet
MIT TLI - B10 - Brochure-Min
22 pages
d3500 MM Intro en
No ratings yet
d3500 MM Intro en
6 pages
PDF File Rsndom - Buscar Con Google
No ratings yet
PDF File Rsndom - Buscar Con Google
4 pages
Od 331332451626046100
No ratings yet
Od 331332451626046100
5 pages
Researcher Made Questionnaire Final
100% (2)
Researcher Made Questionnaire Final
6 pages
English Project Success Story
No ratings yet
English Project Success Story
7 pages
PGForever FAQ
No ratings yet
PGForever FAQ
3 pages
Change Request Form
No ratings yet
Change Request Form
1 page
New Rawmill Maintenance List
No ratings yet
New Rawmill Maintenance List
3 pages
Lab 1: Combinational Logic Design: A. Objectives
No ratings yet
Lab 1: Combinational Logic Design: A. Objectives
4 pages
AJ Reversing Tool Service Manual ES21 399401
No ratings yet
AJ Reversing Tool Service Manual ES21 399401
93 pages
Design and Simulation of Modified Ultra Low Power CMOS Comparator For Sigma Delta Modulator
No ratings yet
Design and Simulation of Modified Ultra Low Power CMOS Comparator For Sigma Delta Modulator
3 pages
Java Programming (Individual Assignment)
No ratings yet
Java Programming (Individual Assignment)
28 pages
Trouble Shooting With Pyxis
No ratings yet
Trouble Shooting With Pyxis
4 pages
Account Claiming - Identity Management Services - Cal Poly Pomona
No ratings yet
Account Claiming - Identity Management Services - Cal Poly Pomona
2 pages
Shalom Hills International School
No ratings yet
Shalom Hills International School
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

LEC11

Uploaded by

LEC11

Uploaded by

VIOLATION OF CLRM ASSUMPTIONS:

 Perfect linear relationship: Assume we have the following model:

 Where the sample values for and are:

 From above Table, we see that

 In presence of multicollinearity, we can not disentangle the

 Under Perfect Multicollinearity, the OLS estimators simply

computer packages and your equation specifications suffers

because it is also determined by the stochastic error term .

know the value of random error term.

relationship among variables of a regression model. But

 Variables and are functionally related with but the

 CASE 1: PERFECT LINEAR RELATIONSHIP BETWEEN VARIABLES

kept constant when changes. As long as we fail to separate

 CASE 2: NEAR PERFECT LINEAR RELATIONSHIP BETWEEN VARIABLES

estimates of coefficients with small standard error.

effect. For this reason, multicollinearity is considered as a small

In case of near perfect or high multicollinearity, one is likely to

 “Classic” symptom of multicollinearity. If is high, say, in excess of 0.8,

variables in the model is to regress each X variable on the remaining

 Consider the regression of on ─ six explanatory variables.

 Suppose on the basis of one or more of the diagnostic tests we

collinearity problem, if not eliminate it completely?

no silver bullet; there are only a few rules of thumb.

 Acquiring Additional Data or a New Sample

in another sample involving the same variables, collinearity

another sample, for collection of data can be costly.

sample size-can reduce the severity of the collinearity

getting more data, i.e., increasing the sample size is

functional form of the model is incorrectly chosen.

log-linear or in linear in variable specification.

demand function can be estimated using a linear in variables (LIV)

may not be as high as in the log-linear specification.

 Assuming that the prior information is correct, we have resolved the

minimize, if not solve, the problem of collinearity.

t= N.A. (1.232) (1.844) = 0.9894

income) and domestic prices.

This regression shows that real imports are

Will not be covered.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.