Page 68 - Statistics and Data Analysis in Geology
P. 68

Statistics and Data Analysis in  Geology - Chapter 3

                 What circumstances will lead to singularity? The condition indicates that two
             or more rows (or columns) of the matrix are linear combinations or linear transfor-
             mations of  other rows; that is, the values in some rows (or columns) are dependent
             on values in other rows. For example, the determinant

                                              1 2 3
                                              4  5  6  =O
                                              246


             is zero because the third row of  the matrix is simply twice the first row.  Similarly,
             the determinant


                                              1 2 3
                                              4  5  6  =O
                                              579

             is zero because the third row is the sum of  rows one and two.  Of  course, in real
             problems the source of  singularity usually is not so obvious.  Consider the data
             in file BAL,TIC.TXT, which gives the weight-percent of  sand in five successive size
             fractions, measured on bottom samples collected in an area of the Baltic Sea. We can
             calculate correlations between the five sand size categories and place the results in
             a square, symmetric correlation matrix:

                                  1     0.243   -0.301   0.096   -0.261
                                0.243     1     -0.969   -0.562  -0.422  I
                               -0.301  -0.969      1     0.340    0.253
                                0.096   -0.562   0.340     1      0.691
                               -0.261  -0.422    0.253   0.691      1


             It is not obvious that this matrix should be singular with a zero determinant, yet
             it is.  The linear dependence comes about because the weight-percentages in the
             five size categories sum to 100 for each observation, so there are induced negative
              correlations between the size categories.  (Actually, because of  rounding during
              computations, you may compute a correlation matrix that is not exactly singular.
             Depending upon the numerical precision of  the computer program, rather  than
              exactly 0, you may observe a very small determinant such as -0.0002.  A matrix
             with a determinant near zero is said to be ill-conditioned.)
                 Finally, there is another special case of  interest.  An identity matrix has a de-
              terminant equal to  1.0.  If  several variables are completely independent of  each
              other, their correlations will be near zero and they will form a correlation matrix
              that approximates an identity matrix.  The determinant of  such a matrix will be
              close to one, and its logarithm will be close to zero; this is the basis for one test of
              independence between variables.







              140
   63   64   65   66   67   68   69   70   71   72   73