Lecture N0. 6
Held on: Wednesday, August 9, 2000
Notes
Prepared by: Abhishek Bapna & Aditya Gandhi
Review
Input Sequence of observations
X1,X2.....Xn
Objective - To forecast Xn+1, Xn+2
Step 1
Signal Extraction
Fit Model
Find parameter estimates that minimize say least squares
i.e. minimize
Step 2
Obtain "residuals"
Fit a time series model to AR(p) model
Autoregression process of order p
Exogenous (from outside) Indogenous (from inside)
Assume that the signal has been filtered out (been subtracted/ has been detrended)
This means that
i.e if
then g(t) should be zero or else the estimate for f(t) is wrong.
Why do we need to estimate
From AR(p) and assuming linear trends for f(t)
if all parameters are estimated
En is already available from Xi - f(ti)
Similarly using Xn+1 we can find the value of Xn+2
As the n+2th terms forecast is based on a previous forecast other than data thus possibility of inaccuracy is higher.
This can be understood as a spread, which increases as we move further away
Forecasting strategy:
How do we estimate value of ?'s?
For AR(p)
Where n is structured
To fine the E's we can use least squares of E's i.e. mininise
Or minimize
This estimates almost exactly the product moment correlation
Choice of p-
Diagnostic
Autocorrelation function (ACF)
The correlation coefficient r relates the two random quantities Y and Z and is given as
rk = correlation between E and E lagged by k
i.e we consider observations (E1, Ek+1), (E2, Ek+2) etc.
say E's have some process
for ex.- for sales the data is more likely to be periodic with similar trends every 12 months.
Typical ACF for periodic/ seasonal series
This somewhat resembles a dampened sinusoid
One can write a program to find ACF.
Standard packages also are available.
Base on when the value of ACF dies or shrinks to 0 can help in deciding p.
The expression for E can also look like this -
Note: we may not be using all the terms and just some part of the data.
This can also be interpreted as a variable time gap situation.
This is known as the Parasimonious choice of p
AIC (Akaike Information Criteria)
This incorporates a factor for p also in it and our minimisation function changes to
More the p's better fit possible but can lead to junk parameters as p goes up
Thus there is an optimal value of p