Page 188 - Computational Statistics Handbook with MATLAB
P. 188

Chapter 5: Exploratory Data Analysis                            175


                                                              (
                                                   (
                                                           –
                                                            1
                                                 Θ T ) =  Φ [ F T )]
                                                     1
                                                                 1
                                                              (
                                                   (
                                                            1
                                                           –
                                                 Θ T 2 ) =  Φ [ F T 2 )]                   (5.18)
                                                                        ,
                                                   (
                                                                     ,
                                                 Θ T i ) =  T i ;  i =  3 … d  ,
                             where  Φ  – 1   is the inverse of the standard normal cumulative distribution
                             function and   is a function defined below (see Equations 5.19 and 5.20). We
                                         F
                             see from Equation 5.18, that we will be changing only the first two rows of T.
                              We now describe the transformation of Equation 5.18 in more detail, work-
                             ing  only  with T 1  and T 2  . First,  we  note  that T 1  can  be  written  as
                                                                 *
                                                           *
                                                                α
                                                          α
                                                              ,
                                                           ,
                                                                     ,
                                                   T 1 =  ( z 1 … z j ,  … z n α *  , )
                                    as
                             and T 2
                                                           *
                                                                 *
                                                                β
                                                          β
                                                              ,
                                                                    ,
                                                   T =  ( z ,  … z ,  … z β n *  . )
                                                          1
                                                     2
                                                                j
                                       α *     β *
                             Recall that z j   and z j   would be coordinates of the j-th observation projected
                             onto the plane spanned by  α β,(  *  *  . )
                                                                                     γ
                              Next, we define a rotation about the origin through the angle   as follows
                                                   ˜ 1 t()  1 t()  2 t()  γ
                                                   z j  =  z j  cos  γ +  z j  sin
                                                                                           (5.19)
                                                   ˜ 2 t()  2 t()  1 t()
                                                   z j  =  z j  cos γ –  z j  sin  γ,
                                               ⁄
                                                 ,
                                            ,
                                        ,
                                           ⁄
                                                     ⁄
                             where  γ =  0 π 4 π 8 3π 8   and  z j 1 t()   represents the j-th element of  T 1   at
                             the t-th iteration of the process. We now apply the following transformation
                             to the rotated points,
                                              – 
                                                                          – 
                                                                           1 rz j ) –
                                                      –
                                                                  (
                                     (
                                     1 t + 1)  Φ   (  ˜ 1 t() ) 0.5   2 t + 1)  Φ   ( ˜ 2 t()  0.5   , 
                                              1 rz j
                                    z j   =     ------------------------------   z j  =  ------------------------------  (5.20)
                                                    n                         n     
                                     1 t()
                                    (
                             where rz j )   represents the rank (position in the ordered list) of z j 1 t()  .
                                                                                      ˜
                                     ˜
                              This transformation replaces each rotated observation by its normal score
                             in the projection. With this procedure, we are deflating the projection index
                             by making the data more normal. It is evident in the procedure given below,
                             that this is an iterative process. Friedman [1987] states that during the first
                             few iterations, the projection index should decrease rapidly. After approxi-
                             mate normality is obtained, the index might oscillate with small changes.
                             Usually, the process takes between 5 to 15 complete iterations to remove the
                             structure.
                            © 2002 by Chapman & Hall/CRC
   183   184   185   186   187   188   189   190   191   192   193