0% found this document useful (0 votes)

21 views

New Microsoft Word Document4

Uploaded by

ADIL NAVEED

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

New Microsoft Word Document4

Uploaded by

ADIL NAVEED

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

To identify two linear time series models for

the Chinese quarterly GDP data (one with

logs and one without logs) using the Box-
Jenkins methodology, follow these steps.
We'll use Python's stats models library for
ARIMA modeling and other libraries for
data handling and visualization.

### Step-by-Step Process

1. Load and inspect the data.

2. *Make the data stationary. *
3. *Identify appropriate ARIMA models
using ACF and PACF plots. *
4. *Fit the ARIMA models. *
5. *Validate the models using diagnostics. *

### Python Code Implementation

First, ensure you have the required libraries

installed:
bash
pip install pandas numpy stats models
matplotlib seaborne

Now, proceed with the Python code to

perform the analysis:

python
import pandas as pd
import numpy as np
import matplotlib. pyplot as plt
from statsmodels. graphics. tsaplots import
plot_acf, plot_pacf
from statsmodels.tsa.arima.model import
ARIMA
from statsmodels.tsa.stattools import ad
fuller

# Load the data

df = pd.read_csv('GDPChina.csv',
index_col='Date', parse_dates=True,
decimal='.')
gdp = df['GDP']

# Function to perform the Dickey-Fuller test

def adf_test(series):
result = adfuller(series)
print('ADF Statistic:', result[0])
print('p-value:', result[1])
print('Critical Values:', result[4])
return result[1] <= 0.05

# Original series
print("ADF test for original series:")
adf_test(gdp)

# Differencing the series to make it

stationary
gdp_diff = gdp.diff().dropna()
print("\nADF test for differenced series:")
adf_test(gdp_diff)

# Plot ACF and PACF for differenced series

fig, ax = plt.subplots(2, 1, figsize=(12, 8))
plot_acf(gdp_diff, lags=40, ax=ax[0])
plot_pacf(gdp_diff, lags=40, ax=ax[1])
plt.show()

# Fit ARIMA model without logs

model = ARIMA(gdp, order=(1,1,1))
model_fit = model.fit()
print(model_fit.summary())

# Log transformation
gdp_log = np.log(gdp)
# Differencing the log-transformed series to
make it stationary
gdp_log_diff = gdp_log.diff().dropna()
print("\nADF test for log-differenced
series:")
adf_test(gdp_log_diff)

# Plot ACF and PACF for log-differenced

series
fig, ax = plt.subplots(2, 1, figsize=(12, 8))
plot_acf(gdp_log_diff, lags=40, ax=ax[0])
plot_pacf(gdp_log_diff, lags=40, ax=ax[1])
plt.show()

# Fit ARIMA model with logs

model_log = ARIMA(gdp_log, order=(1,1,1))
model_log_fit = model_log.fit()
print(model_log_fit.summary())

# Diagnostic plots for models

def diagnostic_plots(residuals, title):
fig, ax = plt.subplots(1, 2, figsize=(12, 6))
residuals.plot(ax=ax[0], title=f'Residuals
of {title}')
plot_acf(residuals, lags=40, ax=ax[1])
plt.show()

print("\nDiagnostics for ARIMA(1,1,1)

without logs:")
diagnostic_plots(model_fit.resid,
"ARIMA(1,1,1) without logs")
print("\nDiagnostics for ARIMA(1,1,1) with
logs:")
diagnostic_plots(model_log_fit.resid,
"ARIMA(1,1,1) with logs")

### Explanation:

1. Load Data: The data is loaded using

pandas.
2. *ADF Test*: The Augmented Dickey-
Fuller (ADF) test checks for stationarity. If
the p-value > 0.05, the series is non-
stationary and requires differencing.
3. *Differencing*: Apply differencing to
make the series stationary.
4. *ACF and PACF Plots*: These plots help
identify the values of \ (p \) and \ (q \) for
the ARIMA model.
5. *ARIMA Models*: Fit the ARIMA (1,1,1)
model for both the original and log-
transformed series.
6. *Diagnostics*: Check residuals to validate
the models.

### Results:

- Without logs: ARIMA (1,1,1) model is

identified for the differenced series.
- *With logs*: Log transformation followed
by differencing, and ARIMA (1,1,1) model is
identified.

These steps and code outline the process to

fit and validate ARIMA models for the
Chinese quarterly GDP data using the Box-
Jenkins methodology. Adjust model orders
based on the ACF/PACF plots and
diagnostics if necessary.

For the models identified in the

previous step, leave for
example the last 24 real values
to compare the two models in
terms of forecasting (out of
sample forecasting exercise).
What is the best model and
why is this your choice?

To compare the forecasting

performance of the identified
models, we'll proceed with an
out-of-sample forecasting
exercise. We'll leave out the
last 24 observations from the
dataset, fit both models on the
training data (the data
excluding the last 24
observations), and then
compare the forecasted values
with the actual values.

Here's the detailed step-by-

step process and the Python
code for performing this
analysis:

### Step-by-Step Process

1. *Split the data into training
and test sets. *
2. *Fit the identified models on
the training set. *
3. *Generate forecasts for the
test period using both models.
*
4. *Compare the forecasted
values with the actual values. *
5. *Evaluate model
performance using appropriate
metrics (e.g., RMSE, MAE). *

### Python Code

Implementation

python
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from
statsmodels.graphics.tsaplots
import plot_acf, plot_pacf
from
statsmodels.tsa.arima.model
import ARIMA
from sklearn.metrics import
mean_squared_error,
mean_absolute_error

# Load the data

df =
pd.read_csv('GDPChina.csv',
index_col='Date',
parse_dates=True, decimal='.')
gdp = df['GDP']

# Split the data into training

and test sets
train_size = len(gdp) - 24
train, test = gdp[:train_size],
gdp[train_size:]
# Fit ARIMA model without logs
on training data
model = ARIMA(train,
order=(1,1,1))
model_fit = model.fit()

# Forecast for the test period

forecast =
model_fit.forecast(steps=24)
forecast = forecast[:24] #
Ensure forecast length matches
test set
# Log transformation
gdp_log = np.log(gdp)
train_log, test_log =
gdp_log[:train_size],
gdp_log[train_size:]

# Fit ARIMA model with logs on

log-transformed training data
model_log = ARIMA(train_log,
order=(1,1,1))
model_log_fit = model_log.fit()
# Forecast for the test period
using the log-transformed
model
forecast_log =
model_log_fit.forecast(steps=2
4)
forecast_log = forecast_log[:24]
# Ensure forecast length
matches test set
forecast_log =
np.exp(forecast_log) # Inverse
log transformation
# Evaluate model performance
rmse_no_log =
np.sqrt(mean_squared_error(t
est, forecast))
mae_no_log =
mean_absolute_error(test,
forecast)
rmse_log =
np.sqrt(mean_squared_error(t
est, forecast_log))
mae_log =
mean_absolute_error(test,
forecast_log)

print("ARIMA(1,1,1) without
logs:")
print(f"RMSE: {rmse_no_log}")
print(f"MAE: {mae_no_log}")

print("\nARIMA(1,1,1) with
logs:")
print(f"RMSE: {rmse_log}")
print(f"MAE: {mae_log}")

# Plot the forecasts vs actual

values
plt.figure(figsize=(12, 6))
plt.plot(test.index, test,
label='Actual')
plt.plot(test.index, forecast,
label='Forecast without logs')
plt.plot(test.index,
forecast_log, label='Forecast
with logs')
plt.legend()
plt.title('Forecast vs Actuals')
plt.show()

### Explanation:

1. Data Splitting: The dataset

is split into training and test
sets, leaving the last 24
observations for testing.
2. *Model Fitting*: Both ARIMA
(1,1,1) models (one without
logs and one with logs) are
fitted on the training set.
3. *Forecasting*: Forecasts for
the test period are generated
from both models. For the log-
transformed model, the
forecasts are transformed back
using the exponential function.
4. *Performance Evaluation*:
The forecasts are compared
with the actual values using
RMSE (Root Mean Squared
Error) and MAE (Mean Absolute
Error).
5. *Plotting*: A plot is
generated to visualize the
actual values against the
forecasts from both models.

### Results and Conclusion:

- *Without logs*:
- RMSE: \ (\text {calculated
RMSE value} \)
- MAE: \ (\text {calculated
MAE value} \)

- *With logs*:
- RMSE: \ (\text {calculated
RMSE value} \)
- MAE: \ (\text {calculated
MAE value} \)
The model with the lower
RMSE and MAE is considered
better.

### Choosing the Best Model:

- The *ARIMA (1,1,1) model

with logs* tends to perform
better in many cases because
log transformation stabilizes
the variance, leading to better
model performance.
- However, the final choice
depends on the specific RMSE
and MAE values obtained. The
model with the lower values
indicates better forecasting
accuracy.

Evaluate the RMSE and MAE

values from the output to
determine which model
provides the best forecasts for
the Chinese quarterly GDP
data.

Img 20180810 0002
No ratings yet
Img 20180810 0002
7 pages
Time+Series+Forecasting Monograph
100% (4)
Time+Series+Forecasting Monograph
58 pages
Time+Series+Forecasting Monograph
No ratings yet
Time+Series+Forecasting Monograph
58 pages
Time Series Analysis Project - CAC 40 - 2018
No ratings yet
Time Series Analysis Project - CAC 40 - 2018
33 pages
Rectangular Tank Design - Roarks
100% (4)
Rectangular Tank Design - Roarks
3 pages
Time Arima 002
No ratings yet
Time Arima 002
11 pages
Mini Project Based On Time Series Forecasting Methods: Data Used
No ratings yet
Mini Project Based On Time Series Forecasting Methods: Data Used
14 pages
Dav 4
No ratings yet
Dav 4
6 pages
Assignment 1 Supplementary
No ratings yet
Assignment 1 Supplementary
5 pages
TIME - ChatGPT Manual 001
No ratings yet
TIME - ChatGPT Manual 001
7 pages
Business analytis C4
No ratings yet
Business analytis C4
10 pages
Time Series Plot of Revenue
No ratings yet
Time Series Plot of Revenue
9 pages
course content
No ratings yet
course content
28 pages
Empirical Finance8
No ratings yet
Empirical Finance8
11 pages
Assignment 4,5 - Scott Denotter
No ratings yet
Assignment 4,5 - Scott Denotter
8 pages
20204038
No ratings yet
20204038
11 pages
Tutorial 9 - Solutions
No ratings yet
Tutorial 9 - Solutions
21 pages
Gdpforecast.r: Rehanshu Vij 2020-12-10
No ratings yet
Gdpforecast.r: Rehanshu Vij 2020-12-10
10 pages
Time Series Analysis
No ratings yet
Time Series Analysis
5 pages
Expt. 12 Forecasting 214
No ratings yet
Expt. 12 Forecasting 214
12 pages
Activity 5 (Time Series) - Rudinas
No ratings yet
Activity 5 (Time Series) - Rudinas
7 pages
et
No ratings yet
et
3 pages
Time Series Analysis of HDFCBANK Stock by Pavan
No ratings yet
Time Series Analysis of HDFCBANK Stock by Pavan
10 pages
lecture_18_build_arima (1)
No ratings yet
lecture_18_build_arima (1)
22 pages
26 Ads Expt9
No ratings yet
26 Ads Expt9
7 pages
Prac TS
No ratings yet
Prac TS
18 pages
ARIMA Procedure Ebook
No ratings yet
ARIMA Procedure Ebook
110 pages
Modules
No ratings yet
Modules
12 pages
TS Arima
No ratings yet
TS Arima
2 pages
Econometrics in MATLAB: ARMAX, Pseudo Ex-Post Forecasting, GARCH and EGARCH, Implied Volatility
No ratings yet
Econometrics in MATLAB: ARMAX, Pseudo Ex-Post Forecasting, GARCH and EGARCH, Implied Volatility
18 pages
RATS Programming Manual
No ratings yet
RATS Programming Manual
255 pages
ARIMA Model Python Example - Time Series Forecasting
No ratings yet
ARIMA Model Python Example - Time Series Forecasting
11 pages
Mitigation and climate change
No ratings yet
Mitigation and climate change
14 pages
Chapter 12 Part 2 - Arima Model Estimation - 2023
No ratings yet
Chapter 12 Part 2 - Arima Model Estimation - 2023
15 pages
Ibd Manual
No ratings yet
Ibd Manual
12 pages
DMPR 4
No ratings yet
DMPR 4
7 pages
CF Steps
No ratings yet
CF Steps
5 pages
STA651 Practical Test 1
No ratings yet
STA651 Practical Test 1
5 pages
TIme Series Week 5
No ratings yet
TIme Series Week 5
6 pages
Time_series_analysis__1718649022
No ratings yet
Time_series_analysis__1718649022
5 pages
Time Series Analysis R
100% (3)
Time Series Analysis R
340 pages
Table of Contents
No ratings yet
Table of Contents
27 pages
Assignment
No ratings yet
Assignment
9 pages
Class Notes
No ratings yet
Class Notes
6 pages
Final Assessment ECON1061 - Sem 1 2024
No ratings yet
Final Assessment ECON1061 - Sem 1 2024
4 pages
Business Forecast Vishay Sood
No ratings yet
Business Forecast Vishay Sood
8 pages
The Box-Jenkins Practical
No ratings yet
The Box-Jenkins Practical
9 pages
MIS410-Chapter7
No ratings yet
MIS410-Chapter7
49 pages
Practicals Data
No ratings yet
Practicals Data
26 pages
GDP Forecasting Using Time Series Analysis
No ratings yet
GDP Forecasting Using Time Series Analysis
15 pages
Arima Model
No ratings yet
Arima Model
4 pages
ForecastingIndividualassignment MohammadMujtaba 12020063
No ratings yet
ForecastingIndividualassignment MohammadMujtaba 12020063
20 pages
LAB9_report
No ratings yet
LAB9_report
6 pages
Project 6 - Time Series PDF
No ratings yet
Project 6 - Time Series PDF
21 pages
Time Series Analysis: Example: Stationary ARIMA
No ratings yet
Time Series Analysis: Example: Stationary ARIMA
25 pages
Handout 2020 Part1 PDF
No ratings yet
Handout 2020 Part1 PDF
82 pages
Certificate
No ratings yet
Certificate
33 pages
Analysis of ARIMA and GARCH Model
No ratings yet
Analysis of ARIMA and GARCH Model
14 pages
Assignment 1
No ratings yet
Assignment 1
10 pages
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
LN JXBH 4 Aw BHwot Fo Ku 2 U
No ratings yet
LN JXBH 4 Aw BHwot Fo Ku 2 U
6 pages
DDP Lab Manuals Physics
No ratings yet
DDP Lab Manuals Physics
111 pages
Hawkeye Prox Switch
No ratings yet
Hawkeye Prox Switch
1 page
Optimising Cost of 24x7 Quality Power - Energy Connect
No ratings yet
Optimising Cost of 24x7 Quality Power - Energy Connect
36 pages
Quadrature Encoding in A Rotary Encoder
No ratings yet
Quadrature Encoding in A Rotary Encoder
6 pages
ENT - F.E.S.S - LARYNGEAL INSTRUMENTS PDF
No ratings yet
ENT - F.E.S.S - LARYNGEAL INSTRUMENTS PDF
30 pages
Nitrogen Compounds - Optical Isomerism: AS Organic Chemistry: Alkenes
No ratings yet
Nitrogen Compounds - Optical Isomerism: AS Organic Chemistry: Alkenes
3 pages
Solutions PDF
No ratings yet
Solutions PDF
33 pages
Pengaruh Pembelajaran Project Based Learning Terhadap Keterampilan Psikomotorik Dan Hasil Belajar Praktek Proyek Work
No ratings yet
Pengaruh Pembelajaran Project Based Learning Terhadap Keterampilan Psikomotorik Dan Hasil Belajar Praktek Proyek Work
10 pages
General + Comments + Formatting: Clean ABAP
No ratings yet
General + Comments + Formatting: Clean ABAP
4 pages
RC - Nulec N Series - V 1.0
No ratings yet
RC - Nulec N Series - V 1.0
15 pages
HANA Tables ColumnStore Merges TokenOwners Internal
No ratings yet
HANA Tables ColumnStore Merges TokenOwners Internal
8 pages
Mechanochemical and Size Reduction Machines
No ratings yet
Mechanochemical and Size Reduction Machines
22 pages
TDS Manual
No ratings yet
TDS Manual
116 pages
Chapter 67 The T = Tan Θ/2 Substitution: EXERCISE 274 Page 750
No ratings yet
Chapter 67 The T = Tan Θ/2 Substitution: EXERCISE 274 Page 750
7 pages
Prediction Test Problems of Basic Physics Physics Education Study Program (Billingual Class)
No ratings yet
Prediction Test Problems of Basic Physics Physics Education Study Program (Billingual Class)
4 pages
Uat Sarp
No ratings yet
Uat Sarp
17 pages
Portable Digital Vibrometer PDV-100: High Resolution Digital Velocity Measurement Portable Robust Lightweight
No ratings yet
Portable Digital Vibrometer PDV-100: High Resolution Digital Velocity Measurement Portable Robust Lightweight
4 pages
VBA Basics
No ratings yet
VBA Basics
33 pages
Firebird 1.5 Error Codes: From MSG - Gbak, Release Sources
No ratings yet
Firebird 1.5 Error Codes: From MSG - Gbak, Release Sources
26 pages
Inverting Schmitt Trigger
No ratings yet
Inverting Schmitt Trigger
3 pages
Xillybus Getting Started Linux
No ratings yet
Xillybus Getting Started Linux
24 pages
Teaching Music in Elementary Grades: 4 Activity
No ratings yet
Teaching Music in Elementary Grades: 4 Activity
10 pages
Lecture 1
No ratings yet
Lecture 1
22 pages
RPF Answer Key
No ratings yet
RPF Answer Key
43 pages
3 Math5Q1Week7
No ratings yet
3 Math5Q1Week7
25 pages
PTC (Posistorr) For Motor Starters: Resistors/Thermistors
No ratings yet
PTC (Posistorr) For Motor Starters: Resistors/Thermistors
2 pages
Position Sensorless Control Without Phase Shifter For
No ratings yet
Position Sensorless Control Without Phase Shifter For
13 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

New Microsoft Word Document4

Uploaded by

New Microsoft Word Document4

Uploaded by

To identify two linear time series models for

the Chinese quarterly GDP data (one with

### Step-by-Step Process

1. Load and inspect the data.

### Python Code Implementation

First, ensure you have the required libraries

Now, proceed with the Python code to

# Load the data

# Function to perform the Dickey-Fuller test

# Differencing the series to make it

# Plot ACF and PACF for differenced series

# Fit ARIMA model without logs

# Plot ACF and PACF for log-differenced

# Fit ARIMA model with logs

# Diagnostic plots for models

print("\nDiagnostics for ARIMA(1,1,1)

1. Load Data: The data is loaded using

- Without logs: ARIMA (1,1,1) model is

These steps and code outline the process to

For the models identified in the

To compare the forecasting

Here's the detailed step-by-

### Step-by-Step Process

### Python Code

# Load the data

# Split the data into training

# Forecast for the test period

# Fit ARIMA model with logs on

# Plot the forecasts vs actual

1. Data Splitting: The dataset

### Results and Conclusion:

### Choosing the Best Model:

- The *ARIMA (1,1,1) model

Evaluate the RMSE and MAE

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

New Microsoft Word Document4

Uploaded by

New Microsoft Word Document4

Uploaded by

To identify two linear time series models for

the Chinese quarterly GDP data (one with

### Step-by-Step Process

1. *Load and inspect the data. *

### Python Code Implementation

First, ensure you have the required libraries

Now, proceed with the Python code to

# Load the data

# Function to perform the Dickey-Fuller test

# Differencing the series to make it

# Plot ACF and PACF for differenced series

# Fit ARIMA model without logs

# Plot ACF and PACF for log-differenced

# Fit ARIMA model with logs

# Diagnostic plots for models

print("\nDiagnostics for ARIMA(1,1,1)

1. *Load Data*: The data is loaded using

- *Without logs*: ARIMA (1,1,1) model is

These steps and code outline the process to

For the models identified in the

To compare the forecasting

Here's the detailed step-by-

### Step-by-Step Process

### Python Code

# Load the data

# Split the data into training

# Forecast for the test period

# Fit ARIMA model with logs on

# Plot the forecasts vs actual

1. *Data Splitting*: The dataset

### Results and Conclusion:

### Choosing the Best Model:

- The *ARIMA (1,1,1) model

Evaluate the RMSE and MAE

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

1. Load and inspect the data.

1. Load Data: The data is loaded using

- Without logs: ARIMA (1,1,1) model is

1. Data Splitting: The dataset