Lecture 15: Introduction To Box-Jenkins Methodology of Forecasting
Lecture 15: Introduction To Box-Jenkins Methodology of Forecasting
Forecasting
1
Box-Jenkins Methodology
The Box-Jenkins methodology of forecasting is
different from most methods because it does not
assume any particular pattern in the historical data
of the series to be forecast.
3
Definition
4
Example
5
Implementing the model Building Strategy
• Step 1: Model Identification
The first step in model identification is to determine whether
the series is stationary— that is, whether the time series
appears to vary about a fixed level. It is useful to look at a plot
of the series along with the sample autocorrelation function.
6
Differencing Required
• If the series is not stationary, it can often be
converted to a stationary series by differencing.
That is, the original series is replaced by a series
of differences. An ARMA model is then specified
for the differenced series. In effect, the analyst is
modeling changes rather than levels.
Once a stationary
series has been
obtained, the analyst
must identify the
form of the model to
be used by
comparing the
autocorrelations and
partial 7
Partial Auto Correlations
8
Model Identification
• Identifying the model form is
accomplished by comparing the
autocorrelations and partial
autocorrelations computed from the data
to the theoretical autocorrelations and
partial autocorrelations for the various
ARIMA models.
Theoretical Behavior for AR(1) Model
10
Theoretical Behavior for AR(2) Model
12
Theoretical Behavior for MA(2) Model
16
Principal of Parsimony
• All things being equal, simple models are
preferred to complex models. This is known as
the principle of parsimony.
• With a limited amount of data, it is relatively easy to
find a model with a large number of parameters that
fits the data well.
• However, forecasts from such a model are likely to be poor
because much of the variation in the data is due to the
random error that is being modeled.
• The goal is to develop the simplest model that provides an
adequate description of the major features of the data.
17
Example : Model Identification
Neither pattern appears
to die out in a declining
manner at low lags.
Lynn decided to fit both
ARIMA(1, 1, 0) and
ARIMA(0, 1, 1) models to
the Transportation
Index.
18
Step 2: Model Estimation
Once a tentative model has been selected, the
parameters for that model must be estimated.
• The parameters in ARIMA models are estimated by
minimizing the sum of squares of the fitting errors.
These least squares estimates must, in general, be
obtained using a nonlinear least squares procedure.
• A nonlinear least squares procedure is simply an
algorithm that finds the minimum of the sum of
squared errors function.
• Once the least squares estimates and their standard
errors are determined, t values can be constructed
and interpreted in the usual way.
• Parameters that are judged significantly different
from zero are retained in the fitted model; 19
Residual Mean Square Error
21
22
Example: Estimation Results
23
Step 3: Model Checking
• Before using the model for forecasting, it must be checked
for adequacy. Basically, a model is adequate if the
residuals cannot be used to improve the forecasts. That is,
the residuals should be random.
25
Example: Model Diagnostics
ARIMA(1, 1, 0): Model for Transportation Index
In general, for a given confidence level, the longer the forecast lead time, the
larger the prediction interval. This is sensible, since the uncertainty is expected
to be greater for a forecast of a distant value than it is for a forecast of, say, the
next observation. Calculating forecasts and prediction intervals is tedious and
best left to the computer. Computer programs that fit ARIMA models generate
forecasts and prediction intervals at the analyst’s request.
2. As more data become available, the same ARIMA model can be used to
generate
revised forecasts from another time origin.
3. If the pattern of the series appears to be changing over time, the new data
may be
used to re-estimate the model parameters or, if necessary, to develop an entirely
new model.
28
Monitor the Forecast Errors
• It is a good idea to monitor the forecast errors. If
the most recent errors tend to be consistently
larger than the previous errors, it may be time to
reevaluate the model. At this point, another
iteration of the model-building strategy may be
required. The same holds true if the recent
forecast errors tend to be consistently positive
(underpredicting) or negative (overpredicting).
29
Example: Forecasting with the Model
• Lynn opted for the ARIMA(1, 1, 0) model on the
basis of its slightly better fit. She checks the
forecast for period 66 for this model as follows:
30