Chapter2.Data Pattern and Techniques Selection
Chapter2.Data Pattern and Techniques Selection
Data Patterns
Horizontal
When there is no trend in the data pattern, we deal
with horizontal data pattern.
Forecast Variable
Mean
Time
2.1. Data Pattern
Trend
the long-term component that represents the growth
or decline in the time series over an extended period
of time.
Trend Yt Trend
Yt
t t
Yt Trend Yt
Trend
t t
2.1. Data Pattern
Cyclical
• Observations
exhibit rises and
falls that are not
of a fixed period
• Occurs with
business and
economic
expansions and
contractions.
• Lasts longer
than 1 year.
• Correlated with
business cycles.
2.1. Data Pattern
Seasonal Pattern
A predictable
and repetitive
movement
observed around
a trend line
within a period
of 1 year or less.
2.1. Data Pattern
Seasonal Pattern
2.1. Data Pattern
The monthly housing sales show strong seasonality within each year, as
well as some strong cyclic behaviour with a period of about 6–10 years.
There is no apparent trend in the data over this period
2.1. Data Pattern
The US treasury bill contracts show results from the Chicago market for 100 consecutive
trading days in 1981.
There is no seasonality, but an obvious downward trend.
Possibly, if we had a much longer series, we would see that this downward trend is
actually part of a long cycle, but when viewed over only 100 days it appears to be a
trend.
2.1. Data Pattern
The daily change in the Google closing stock price has no trend, seasonality or
cyclic behaviour.
There are random fluctuations which do not appear to be very predictable, and
no strong patterns that would help with developing a forecasting model.
2.1. Data Pattern
Autocorrelated Pattern
• Data in one period are related to their values in the
previous period.
• Generally, if there is a high positive autocorrelation,
the value in the month of June, for example, is
positively related to the values in the month of
May.
• This pattern is more fully discussed when we talk
about the Box–Jenkins methodology.
2.1. Data Pattern
Autocorrelated Pattern
2.1. Data Pattern
Autocorrelated Pattern
2.1. Data Pattern
Autocorrelated Pattern
Covariance and correlation: measure extent of linear
relationship between two variables (y and x).
Autocovariance and autocorrelation: measure linear
relationship between lagged values of a time series
y.
We measure the relationship between:
yt and yt−1
yt and yt−2
yt and yt−3
etc.
2.1. Data Pattern
Artwork by @allison_horst
2.1. Data Pattern
Artwork by @allison_horst
2.1. Data Pattern
Artwork by @allison_horst
2.1. Data Pattern
Artwork by @allison_horst
2.1. Data Pattern
Artwork by @allison_horst
2.1. Data Pattern
Artwork by @allison_horst
2.1. Data Pattern
Artwork by @allison_horst
2.1. Data Pattern
Artwork by @allison_horst
2.1. Data Pattern
Artwork by @allison_horst
2.1. Data Pattern
Autocorrelated Pattern
The correlogram or autocorrelation function is a graph of the
autocorrelations for various lags of a time series.
2.1. Data Pattern
Autocorrelated Pattern
Autocorrelation coefficients for different time lags for a
variable can be used to answer the following questions about
a time series:
1. Are the data random?
2. Do the data have a trend (nonstationary)?
3. Are the data seasonal?
4. Are the data stationary?
2.1. Data Pattern
Autocorrelated Pattern
1. Are the data random?
• If a series is random, the autocorrelations between Yt and
Yt-k for any time lag k are close to zero.
• The successive values of a time series are not related to
each other
2.1. Data Pattern
Autocorrelated Pattern
2. Do the data have a trend (nonstationary)?
• If a series has a trend, successive observations are highly
correlated.
• rk typically are significantly different from zero for the first several
time lags
• rk gradually drop toward zero as the number of lags k increases
2.1. Data Pattern
Autocorrelated Pattern
3. Are the data seasonal?
• If a series is seasonal, a pattern related to the calendar
repeats itself over a particular interval of time (usually a
year).
• Observations in the same position for different seasonal
periods tend to be related.
• If quarterly data with a seasonal pattern are analyzed, a significant
autocorrelation coefficient will appear at time lag 4.
• If monthly data are analyzed, a significant autocorrelation
coefficient will appear at time lag 12.
2.1. Data Pattern
Autocorrelated Pattern
Trend and seasonality in ACF plots
When data have a trend, the autocorrelations for
small lags tend to be large and positive.
When data are seasonal, the autocorrelations will
be larger at the seasonal lags (i.e., at multiples of
the seasonal frequency)
When data are trended and seasonal, you see a
combination of these effects.
2.1. Data Pattern
Autocorrelated
Pattern
When data are seasonal,
the autocorrelations will be
larger at the seasonal lags
(i.e., at multiples of the
seasonal frequency)
• r4 higher than for the other lags. This is due to the seasonal pattern in the
data: the peaks tend to be 4 quarters apart and the troughs tend to be 2
quarters apart.
• r2 is more negative than for the other lags because troughs tend to be 2
quarters behind peaks
2.1. Data Pattern
Autocorrelated
Pattern
When data have a
trend, the
autocorrelations for
small lags tend to be
large and positive.
2.1. Data Pattern
Autocorrelated
Pattern
2.1. Data Pattern
Which is which?
2.1. Data Pattern
Autocorrelated Pattern
4. Are the data stationary?
• A stationary time series is one whose mean and variance remain
constant over time.
• Consequently, a stationary series that varies about a fixed level
(no growth or decline) over time.
Autocorrelated Pattern
White noise data is uncorrelated across time with zero
mean and constant variance.
Autocorrelated Pattern
2.1. Data Pattern
Autocorrelated Pattern
Monthly total number of pigs slaughtered in the state of
Victoria, Australia, from January 2014 through December
2018 (Source: Australian Bureau of Statistics.)
Error in Forecasting
et Yt Yˆt
• Measures the average error that can be expected
over time.
• The average error concept has some problems with it.
The positive and negative values cancel each other
out and the mean is very likely to be close to zero.
2.2. Measures of Accuracy in Forecasting
• MAPE, MPE is scale independent but is only sensible if yt >> 0 for all i,
and y has a natural zero.
• MPE is used to determine whether a forecasting method is biased
(consistently forecasting low or high).
If the forecasting approach is unbiased, MPE ~ 0.
If the result is a large negative percentage, the forecasting
method is consistently overestimating.
If the result is a large positive percentage, the forecasting method
is consistently underestimating.
2.2. Measures of Accuracy in Forecasting
• Qualitative Models
• Technological Approaches
2.3. Technique Selection
Quantitative Models
• Also known as statistical models.
• Quantitative Models
• An example of a quantitative model is shown below:
Yt 1 0 1Yt 2Yt 1
Y t 1 = Sales one time period into the future
Yt = Sales in the current period
Yt 1 = Sales in the last period
2.3. Technique Selection
Qualitative Models
• Non-statistical or judgment models
• Expert opinion
• Executive opinion
• Focus groups
• Delphi method
2.3. Technique Selection
Technological Approach
• Combines quantitative and qualitative methods.
• Level of detail
Pattern of data
Type of model
Examples:
the number of breakdowns per week on an assembly line having a
uniform production rate,
the unit sales of a product or service in the maturation stage of its life
cycle,
Examples:
changing income to per capita income
Examples:
• moving averages,
Examples
the demands for electronic components, which increased with the
advent of the computer
Examples
the sales revenues of consumer goods,
Examples:
salaries,
production costs,
prices
2.3. Technique Selection
Examples:
salaries,
production costs,
Prices
Simple Regression,
Growth Curves,
Exponential Models,
30
25
20
15
Cumulative Error
10 Model A
5 Model B
0 Model C
-5 Model D
-10
-15
-20
-25
Time
2.4. Model Evaluation
Actual Change
Y
Actual Change
400
300
200
Forecast Change
100
0
-300 -200 -100 0 100 200 300 400
-100
-200
-300
-400
Chapter Summary
• Data Patterns
• Forecasting Methodologies
• Technique Selection
• Model Evaluation