Page 252 - Elements of Distribution Theory
P. 252

P1: JZP
            052184472Xc08  CUNY148/Severini  May 24, 2005  17:54





                            238                       Normal Distribution Theory

                                                                        1
                            Corollary 3.2 that X has the same distribution as µ +   2 Z. Hence, part (vii) of the theorem
                            holds.


                            Example 8.1 (Bivariate normal distribution). Suppose that X = (X 1 , X 2 ) has a two-
                            dimensional multivariate normal distribution. Then  , the covariance matrix of the distri-
                            bution, is of the form
                                                                2
                                                               σ 1  σ 12
                                                           =        2
                                                               σ 21 σ
                                                                    2
                                   2
                            where σ denotes the variance of X j , j = 1, 2, and σ 12 = σ 21 denotes the covariance of
                                   j
                            X i , X j .
                              We may write σ 12 = ρσ 1 σ 2 so that ρ denotes the correlation of X 1 and X 2 . Then

                                                       σ 1  0   1 ρ    σ 1  0
                                                   =                          .
                                                       0 σ 2   ρ 1     0 σ 2
                            It follows that   is nonnegative-definite provided that

                                                              1 ρ
                                                              ρ 1
                            is nonnegative-definite. Since this matrix has eigenvalues 1 − ρ, 1 + ρ,   is nonnegative-
                            definite for any −1 ≤ ρ ≤ 1. If ρ =±1, then X 1 /σ 1 − ρX 2 /σ 2 has variance 0.

                            Example 8.2 (Exchangeable normal random variables). Consider a multivariate normal
                            random vector X = (X 1 , X 2 ,..., X n ) and suppose that X 1 , X 2 ,..., X n are exchangeable
                            random variables. Let µ and   denote the mean vector and covariance matrix, respectively.
                            Then, according to Theorem 2.8, each X j has the same marginal distribution; hence, µ
                            must be a constant vector and the diagonal elements of   must be equal. Also, each pair
                            (X i , X j ) must have the same distribution; it follows that the Cov(X i , X j )isa constant, not
                            depending on i, j. Hence,   must be of the form
                                                             1 ρρ ··· ρ
                                                                         
                                                            ρ 1 ρ ··· ρ 
                                                      = σ  2     .       
                                                                  .
                                                                 .       
                                                             ρρρ ··· 1
                            for some constants σ ≥ 0 and ρ.Of course,   is a valid covariance matrix only for certain
                            values of ρ; see Exercise 8.8.

                            Example 8.3 (Principal components). Let X denote a d-dimensional random vector with
                            a multivariate normal distribution with mean vector µ and covariance matrix  . Consider
                                                              T
                            the problem of finding the linear function a X with the maximum variance; of course, the
                                      T
                            variance of a X can be made large by choosing the elements of a to be large in magnitude.
                            Hence, we require a to be a unit vector. Since
                                                                   T
                                                            T
                                                        Var(a X) = a  a,
                                                                    T
                            we want to find the unit vector a that maximizes a  a.
   247   248   249   250   251   252   253   254   255   256   257