Page 49 - Introduction to Statistical Pattern Recognition
P. 49

2 Random Vectors and their Properties                         31



                      Simultaneous Diagonalization

                          We can diagonalize two symmetric matrices El and C2 simultaneously by a
                      linear transformation. The process is as follows:

                           (I) First, we whiten C, by
                                               y = @-”2@Tx                       (2.93)
                      where 0 and @ are the eigenvalue and eigenvector matrices of E I  as

                                         El@=@@  and  @‘@=I.                     (2.94)
                      Then,  C, and C2 are transformed to

                                            @-1/*@TZ,  QQ-1’2   =                (2.95)

                                            @-l/2@TC2@Q-’/2   = K                (2.96)
                      In general, K is not a diagonal matrix.

                           (2)  Second, we  apply the orthonormal transformation to diagonalize K.
                      That is,

                                                 Z=VY,                           (2.97)
                      where ‘i’ and A are the eigenvector and eigenvalue matrices of K as

                                         KY=YA  and  YTY==I.                     (2.98)
                      As shown in (2.92), the first matrix I of (2.95) is invariant under this transforma-
                      tion. Thus,
                                              Y’IY  =YTY =I,                     (2.99)

                                                   =
                                             YJ~KY A.                           (2.100)
                           Thus, both matrices are diagonalized. Figure 2-3 shows a two-dimensional
                      example of this process.  The combination of steps (1)  and (2) gives the overall
                      transformation matrix w-”~Y’.

                           Alternative approach: The matrices O@-”2\y and A can be calculated
                      directly from X, and C2 without going through the two steps above. This is done
                      as follows:

                           Theorem  We can diagonalize two symmetric matrices as
   44   45   46   47   48   49   50   51   52   53   54