Lecture 1 - Introduction and Background
Lecture 1 - Introduction and Background
Lecture 1 - Introduction and Background
Introduction
Identify the scope of research
The flow chart of econometrics
Type of Data
The Data Structure
Size of Data
The model specification
Descriptive Statistic
Data plotting
Data transformation
Data cleaning
Inference
Hypothesis
Estimation
3
Interpretation
AA017004 ECONOMETRIC THEORY AND METHODS
INTRODUCTION
The central purpose of methods and procedures in the research project proposal is to provide the plan, and its
description, of how the objectives of the study will achieved.
It is the what, why, and how of the research project: step-by-step specification and description of what will be
done, how it will be done, and why it will be done in the specified manner.
How it will be done includes the order or sequence of steps in the process. It includes specification of the
quantitative estimation techniques, analytical methods, data that will be collected, and how the data will be
obtained and processed.
Methods and procedures also include how each of these relate to the objectives of the research, how empirical
estimates will be tested and analyzed. Model development and design, justification of model mathematical form,
how results will be interpreted, and so on.
From a more general perspective, the methods and procedures delineate the approach for testing the hypotheses
of the study.
Recall that hypotheses may be qualitative as well as quantitative and that hypotheses are not always stated
explicitly, particularly in economic research.
The methods and procedures specify the means through which the various hypotheses are tested (examined)
The appropriateness of the methods and procedures depends on the problem and the objectives.
This is yet another reminder of the importance of the problem identification and specification in the research
process.
Methods and procedures directly address the objectives, but the research objectives are derived from the
research problem.
Consequently, the research methods and procedures are driven by the problem and objectives, not the other way
around.
10
Continuous
Discrete
Ordinal
Likert scale
Nominal
Dichotomous
Count/Non-negative
11
12
Time series
yt = α + β1X1t + β2X2t + εt t = 1, 2, …, T
Panel data
yit = α + β1X1it + β2X2it + εit i = 1, 2, …, N and t = 1, 2, …, T
13
14
Reability and validity of the data - selection bias, accuracy, consistency responses by respondents
Source of data
Data discrepancy
Variable and proxy used
Data measurement
15
16
Mean
Minimum
Maximum
Skewness
Kurtosis
Distribution : Jarque-Bera test
Number of observation
17
18
Standardizing scale
Natural logarithm : Continuous data
Demean : Continuous data
Normalization : discrete or continuous data
Data smoothing (time series)
Seasonal adjustment
Denoise – Kalman Filter, wavelet, Hodrick-Prescott
19
Outlier (cross-section)
Outlier will effect the estimation
Outlier detection: DFIT, Cook’s D, scatter plot
Missing value
Interpolation
High frequency to low frequency
Low frequency to high frequency
Unbalanced panel
20
21
Choose the estimator that can compatible with our data structure, size
Calculate the coefficient to investigate the relationship, sign and size
23
Identify which independent variable (X) have a statistically significant relationship on dependent variable (Y)
1 unit increase of X1, y will increase 0.5 unit
1 unit increase of X2, y will decrease 0.3 unit
If both (IV and DV) use log form, we can say 1 % increase of X1, y will increase 0.5 %, that indicate the elasticity
between X and Y.
24
Identify which independent variable (X) have a statistically significant relationship on dependent variable (Y)
Depend on the sign and size.
Normalization:
X1 = 0.5/1-(0.2+0.4)
25
26
Issue
Problem statement
Research question
Research objective
Hypothesis
Model specification
Estimation
27
28
Interest rate H1
H2
Exchange rate Inflation
H3
Consumption
Hypothesis
Ho: Higher interest rate can not reduce the inflation
H1: Higher interest rate can reduce the inflation
Model specification
𝐼𝑁𝐹𝑖 = α +𝛽1 𝐼𝑅i +𝛽2 𝐸𝑅i +𝛽3 𝐶 i + 𝜀𝑖Ƹ i = 1, 2, …, N
Estimation
Ordinary least square
Two stage least square
Diagnostic Test
Heteroscedasticity, autocorrelation, multicollinearity, misspecification
30
𝑖 = α - 0.5𝑙𝑛𝐼𝑅
𝑙𝑛𝐼𝑁𝐹 𝑖 + 0.3𝑙𝑛𝐸𝑅
𝑖 + 0.7𝑙𝑛𝐶
𝑖 + 𝜀𝑖𝑡
Ƹ
p-value (0.00) (0.00) (0.00)
Identify which independent variable (X) have a statistically significant relationship on dependent variable (Y)
1% increase in IR, INF will decrease 0.5%
1% increase in ER, INF will increase 0.3%
1% increase in C, INF will increase 0.7%
31
32
33
35
37
38
10. T>N Heterogenous I(0) & I(1) Yes (Westerlund) Pooled Mean Group (PMG) Short run relationship
Dynamic Panel Mean Group (MG) Long run relationship
Cointegration
11. T>N Non-stationary Panel I(0) & I(1) Yes (Westerlund) Common Correlated Mean Short run relationship
with Cross-Sectional Group (CCMG) Long run relationship 39
Dependency
AA017004 ECONOMETRIC THEORY AND METHODS
CORRELATION
Cov(X , Y)
Corr(X , Y)
StdDev(X) StdDev(Y)
Correlation measures the strength of the linear relationship between two variables.
Strength
not the slope
Linear
misses nonlinearities completely
Two
shows only “shadows” of multidimensional relationships
AA017004 ECONOMETRIC THEORY AND METHODS 41
A correlation of +1 would
arise only if all of the
points lined up perfectly.
Customers can pay extra for expedited service (shorter leadtime at moderate extra cost), or for express service
(shortest leadtime at highest cost)
Those who chose to save money and wait longer ended up (slightly) regretting their choice.
Most customers who chose rapid service weren’t given access to order tracking.
They didn’t need it, and were still happy with their fast deliveries.
Causality (also referred to as causation, or cause and effect) is influence by which one event, process, state,
or object (a cause) contributes to the production of another event, process, state, or object (an effect) where the
cause is partly responsible for the effect, and the effect is partly dependent on the cause.
Unidirectional : ‘x’ being a cause of ‘y” but “y” is not being a cause of “x” (x y) (y→ x)(exogenous)
Bidirectional : ‘x’ being a cause of ‘y” and “y” is being a cause of “x” (x ↔ y) (endogenous)
/
Neutral : ‘x’ is not being a cause of ‘y” and “y” is not being a cause of “x” (x ↔ y) (neutral)
/
Causality can be used for time series (Granger causality based on VAR or VECM) and panel data (Granger
causality, Dumitrescu-Hurlin causality).
SPSS
Eviews
STATA
MATLAB
GAUSS
RATS
Oxmetrix
57