ACF
ACF
ACF
ARIMA Modelling
Introduction
Define what stationarity is and why it is so
important to Econometrics
Describe the Autocorrelation coefficient
and its relationship to stationarity
Evaluate the Q-statistic
Describe the components of an
Autoregressive Integrated Moving Average
Model (ARIMA model)
Stationarity
A strictly stationary process is one where
the distribution of its values remains the
same as time proceeds, implying that the
probability lies in a particular interval is the
same now as at any point in the past or
the future.
However we tend to use the criteria
relating to a weakly stationary process to
determine if a series is stationary or not.
Stationary Series
E ( yt )
2
E ( yt )
E ( yt1 )( yt 2 ) t 2t1 , t1 , t 2
Stationary Series
Non-stationary Series
Stationary Data
It is important to determine if our data is
stationary before the regression. This can
be done in a number of ways:
- plotting the data
- assessing the autocorrelation function
- Using a specific test on the
significance of the autocorrelation
coefficients.
- Specific tests to be covered later.
0
variance
(Yt Y )(Yt k Y )
2
(Yt Y )
Correlogram
The sample correlogram is the plot of the ACF
against k.
As the ACF lies between -1 and +1, the
correlogram also lies between these values.
It can be used to determine stationarity, if the
ACF falls immediately from 1 to 0, then equals
about 0 thereafter, the series is stationary.
If the ACF declines gradually from 1 to 0 over a
prolonged period of time, then it is not stationary.
Q statistic
m
Q n k
k 1
n sample size
m lag length
Ljung-Box Statistic
This statistic is the same as the Q statistic in
large samples, but has better properties in
small samples.
LB n(n 2) (
k 1
2
k
(n k )
Partial ACF
The Partial Autocorrelation Function
(PACF) is similar to the ACF, however it
measures correlation between
observations that are k time periods apart,
after controlling for correlations at
intermediate lags.
This can also be used to produce a partial
correlogram, which is used in Box-Jenkins
methodology (covered later).
Q-statistic Example
The following information, from a specific
variable can be used to determine if a time
series is stationary or not.
k 1
2
k
n 60
0.32
Q-statistic
Q 60 * 0.32 19.2
2
(4) 9.488
19.2 9.488 reject H 0
The series is not stationary as the ACFs
are jointly significantly different to 0.
Autoregressive Process
An AR process involves the inclusion of
lagged dependent variables.
An AR(1) process involves a single lag, an
AR(p) model involves p lags.
AR(1) processes are often referred to as
the random walk, or driftless random walk
if we exclude the constant.
AR Process
yt 1 yt 1 2 yt 1 .... p yt p ut
p
yt i yt i ut
i 1
p
yt i Li yt ut
i 1
Li lag operator
MA process
yt ut 1ut 1 2ut 2 ... q ut q
q
yt i ut i ut
i 1
q
yt i Li ut ut
i 1
yt ( L)ut
Where : ( L) 1 1L 2 L2 ... q Lq
MA process
To estimate moving average processes,
involves interpreting the coefficients and tstatistics in the usual way
It is possible to have a model with lags on
the 1st but not 2nd, then 3rd lags. This
produces the problem of how to determine
the optimal number of lags.
MA process
The MA process has the following properties relating to
its mean and variance:
-
E ( yt )
var( yt )
2
(1 1
2
2
2
2
... q )
Example of an MA Process
y t 0.7 0.8 yt 1 0.3ut 1
(0.1) (0.2)
2
(0.1)
Example
In the previous slide we have estimated a
model using an AR(1) process and MA(1)
process or ARMA(1,1) model, with a lag
on the MA part to pick up any inertia in
adjustment in output.
The t-statistics are interpreted in the same
way, in this case only one MA lag was
significant.
Conclusion
Before conducting a regression, we need to
consider whether the variables are stationary or
not.
The ACF and correlogram is one way of
determining if a series is stationary, as is the Qstatistic
An AR(p) process involves the use of p lags of
the dependent variable as explanatory variables
A MA(q) process involves the use of q lags of
the error term.