Page 171 - Statistics and Data Analysis in Geology
P. 171

Statistics and Data Analysis in  Geology - Chapter 6

             against
                                             H1 : P1 #Po
             The null hypothesis states that the mean vector of  the parent population  of  the
             first  sample is the same as the mean vector of  the parent population from which
             the second sample was drawn.
                 The test we must use is a multivariate equivalent of  Equation (2.48) on p. 73.
             In that two-sample t-test, we used a pooled estimate of  the population  variance
             based on both samples.  Accordingly, we must compute a pooled estimate, S,,  of
             the common variance-covariance matrix from our two multivariate samples. This
             is done by calculating a matrix of  sums of  squares and products for each sample.
             We  can use the terminology of  discriminant  functions and denote the matrix of
             sums of  squares and cross products of  sample A as SA; similarly, the matrix from
             sample B is SB. The pooled estimate of  the variance-covariance matrix is

                                     S,  = (nA + nB  - 2)-l (sA + sB)               (6.32)

             We  must next find the difference between the two mean vectors, D = EA - XB. Our
             T2 test has the form
                                          T2 =                                      (6.33)
                                                nAnB  D‘S, lD
                                               nA + nB
             The significance of the T2 test statistic can be determined by the F-transformation:
                                            nA + nB  - m - 1
                                        F=                  T2                      (6.34)
                                             (nA + nB  - 2)m

             which has m and (nA + nB  - m - 1) degrees of  freedom (Morrison, 1990).

              Eq ua I ity of varia nce-covaria nce matrices
             An underlying assumption in the two preceding tests is that the samples are drawn
             from populations having the same variance-covariance matrix.  This is the multi-
             variate equivalent of the assumption of equal populationvariances necessary to per-
             form t-tests of means. In practice, an assumption of  equality may be unwarranted,
             because samples which exhibit a high mean often will also have a large variance.
             You will recall from Chapter 4 that such behavior is characteristic of many geologic
             variables such as mine-assay values and trace-element concentrations. Equality of
             variance-covariance matrices may be checked by the following “test of generalized
             variances” which is a multivariate equivalent of  the F-test (Morrison, 1990).
                  Suppose we have k  samples of  observations, and have measured m variables
             on each observation.  For  each sample a variance-covariance matrix, Sk, may be
             computed. We wish to test the null hypothesis



             against the alternative
                                             H1   Xi #Ej
                 The null hypothesis states that all k population variance-covariance matrices
             are the same.  The alternative is that  at least two of  the matrices are different.
             Each variance-covariance matrix Si is an  estimate of  a population matrix Xi. If the
             parent populations  of  the k  samples are identical, the sample estimates may be

             484
   166   167   168   169   170   171   172   173   174   175   176