Page 101 - Statistics and Data Analysis in Geology
P. 101

Analysis of Sequences of Data

                 The same reasoning can be applied to determine the probability of any lithology
             two steps hence, from any starting lithology. However, all of the various sequences
             do not have to be worked out individually, because the process of multiplying and
             summing is exactly that used for matrix multiplication. If the transition probability
             matrix is multiplied by itself (that is, the matrix is squared), the result is the second-
             order transition probability matrix describing the second-order Markov properties
             of  the succession:

                          0.78   0    0.22   0        0.64  0.02  0.31  0.02
                           0    0.71  0.29   0 1' = [ 0.05  0.52  0.39  0.03  1
                          0.18  0.07  0.64  0.11      0.26  0.09  0.54  0.11
                           0     0    0.60  0.40      0.11  0.04  0.62  0.23

             Note that the rows of the squared matrix also sum to 100%.
                 The existence of  a significant second-order property can be checked in exactly
             the same manner as we checked for independence between successive states, by
             using a x2 test. If you repeat the test performed earlier, but using the second-order
             transition probability matrix, you should find that the sequence has no significant
             second-order properties.
                 We can estimate the probable state to be encountered at any step in the future
             simply by powering the transition probability matrix the appropriate number of
             times. If the matrix is raised to a sufficiently high power, it reaches a stable state in
             which the rows all become equal to the fixed probability vector, or in other words,
             becomes an independent transition probability matrix and will not  change with
             additional powering.
                 You will note in the example that the highest transition probabilities are from
             one state to itself, particularly  from sandstone to sandstone, from limestone to
             limestone, and from shale to shale.  It is obvious that these transition probabili-
             ties are related to the thicknesses of  the stratigraphic units being sampled and the
             distance between the sample points. For example, the frequencies along the main
             diagonal of  the transition frequency matrix would be doubled while off-diagonal
             frequencies remained unchanged if  observations were made every half-foot.  This
             would greatly enhance the Markovian property, but in a specious manner.  Select-
             ing the appropriate distance between sampling points can be a vexing problem; if
             observations are too closely spaced, the transition matrix reflects mainly the thick-
             ness of  the more massive stratigraphic units. If the spacing is too great, thin units
             may be entirely missed.

             Embedded Markov chains

             The difficulty of  selecting an appropriate sampling interval can be avoided if ob-
             servations are taken only when there is a change in state. A stratigraphic section,
             for example, would be recorded as a succession of beds, each one of  a different
             lithology than the immediately preceding bed.  Table  4-4  contains the record  of
             successive rock types penetrated by a well drilled in the Midland Valley of Scotland
             (these data are contained in file MIDLAND.TXT). The well was drilled through 1600
             ft of  Coal Measures of  Carboniferous age, consisting of interbedded shales, silt-
             stones, sandstones, and coal beds or root zones. These sediments are interpreted
             as having been deposited in a delta plain environment subject to repeated flooding,
             so we would expect that certain lithologies would occur in preferred relations to

                                                                                      173
   96   97   98   99   100   101   102   103   104   105   106