Page 357 - Statistics for Environmental Engineers
P. 357

L1592_Frame_C41  Page 367  Tuesday, December 18, 2001  3:24 PM









                       Why Autocorrelation Distorts the Variance Estimates
                       Suppose that the system generating the data has the true underlying relation η = β 0  + β 1 x, where x could
                       be any independent variable, including time as in a times series of data. We observe n values: y 1  = η +
                       e 1 ,…, y i−2  = η + e i−2 , y i−1  = η + e i−1 , y i  = η + e i ,…, y n  = η + e n . The usual assumption is that the residuals
                       (e i ) are independent, meaning that the value of e i  is not related to e i−1 , e i−2 , etc. Let us examine what
                       happens when this is not true.
                        Suppose that the residuals (e i ), instead of being random and independent, are correlated in a simple
                       way that is described by e i  = ρ e i−1  + a i , in which the errors (a i ) are independent and normally distributed
                                           2
                       with constant variance  σ .  The strength of the autocorrelation is indicated by the  autocorrelation
                       coefficient (ρ), which ranges from −1 to +1. If ρ = 0, the e i  are independent. If ρ is positive, successive
                       values of e i  are similar to each other and:

                                                        e i =  ρe i−1 +  a i
                                                       e i−1 =  ρe i−2 +  a i−1
                                                       e i−2 =  ρe i−3 +  a i−2


                       and so on. By recursive substitution we can show that:
                                                  (
                                             e i =  ρρe i−2 + a i−1 ) + a i =  ρ e i−2 + ρa i−1 +  a i
                                                                   2
                       and

                                                       3
                                                              2
                                                  e i =  ρ e i−3 +  ρ a i−2 +  ρa i−1 +  a i
                       This shows that the process is “remembering” past conditions to some extent, and the strength of this
                       memory is reflected in the value of ρ.
                        Reversing the order of the terms and continuing the recursive substitution gives:

                                              e i =  a i +  ρa i−1 +  ρ a i−2 + ρ a i−3 +  ⋅⋅⋅ρ a i−n
                                                                   3
                                                                           n
                                                            2
                       The expected values of a i , a i−1 ,… are zero and so is the expected value of e i . The variance of e i  and the
                       variance of a i , however, are not the same. The variance of e i  is the sum of the variances of each term:
                                                      (
                                                                 (
                                    σ e =  Var a i + ρ Var a i−1 ) + ρ Var a i−2 ) + ⋅⋅⋅ +  ρ Var a i−n ) + ⋅⋅⋅
                                            ()
                                                                                (
                                      2
                                                             4
                                                                            2n
                                                  2
                                                       2
                       By definition, the a’s are independent so σ a  = Var(a i ) = Var(a i−1 ) = … = Var(a i−n ). Therefore, the variance
                       of e i  is:
                                                      2
                                                               4
                                                σ e =  σ a 1 +(  ρ +  ρ +  … +  ρ +  … )
                                                 2
                                                                       2n
                                                           2
                       For positive correlation (ρ > 0), the power series converges and:
                                                                          1
                                                               2n
                                                        4
                                               ( 1 +  ρ +  ρ +  … +  ρ +  … ) =  --------------
                                                    2
                                                                        1 ρ–  2
                       Thus:
                                                                 1
                                                        σ e =  σ a --------------
                                                          2
                                                               2
                                                               1 ρ–  2
                       © 2002 By CRC Press LLC
   352   353   354   355   356   357   358   359   360   361   362