Observing Xi at a time ti ( i = 1….n )
Model : Xi = f(ti) + Îi à ( signal + noise )
Problem : We need to forecast X for t* however Îi ( noise ) has no deterministic component.
Estimate problem :
What is f?
So we assume a parameterized model:
ie. f(t) = a + bt and so Îi = Xi – f(t)
There are 2 ways to estimate a and b :
1. Least squares method:
minimize the square of the Î (noise or error) to get a value of a and b.
a,b = arg min å[ Xi - fa,b(ti) ]2 ( i=1..n)
2. Least absolute deviations:
a,b = arg min å | Xi - fa,b(ti) | ( i=1..n)
à forecast at t* is fa,b(ti *)
It is also good to find out a forecast of Î , like this if we know Îi then we know Îi +1 .
Forecast at tn+1 à
fa,b(tn+1)
+ În+1
If we know Î from (1….n) then we can forecast for În+1.
Îi + g(ti) + hi ( ie. g should be captured in f where xi = f(ti) +Îi )
so Xi = f(ti) + g(ti) + h
We can assume În+1to be a function of ( Î1….În).
So we come to the …….
În+1 = F1În + F2În-1 + FpÎn-(p-1) + NOISE
This is the AR(p) model….
So in general :
Îi+1
= F1Îi
+ F2Îi-1
+ FpÎi-(p-1)
+ NOISE
For example :
AR(1) à Îi+1= FÎi + hi+1
If the above is correct then à Xi = f(ti) + Îi
What is F
?
If Î is positively correlated then :
Large Îi à large Îi+1
Small Îi à small Îi+1
and the above is vice versa for a negative correlation!
F > 0 for a positive correlation ( low frequency ( red noise))
F < 0 for a negative correlation ( high frequency (blue noise))
F = 0 is for white noise.
Definition of Correlation :
r = å ( yi – y)(zi – z) -1 < r < 1
(1/nå
(yi – y)2)1/2 (1/nå
(yi – y)2)1/2
Estimate of F in AR(1) is the PMR between (Î1 … În-1) and (Î2… În)
ie. subsequent pairs of observations
Assume åÎi = 0 ( i=1..n-1) & åÎi = 0 (i=1..n)
Then F estimated by :
F = å Îi Îi +1( i = 1..n-1)
(å (Îi 2)1/2 (åÎi 2)1/2 ( i = 1 .. n)