Page 89 - Statistics and Data Analysis in Geology
P. 89

Analysis  of Sequences of Data


                  Table 4-1.  Techniques discussed in this chapter classified  by the nature of the
                  variable and its spacing along a line.  Locations are explicit  if X  is specified for
                    every  Y; locations are implicit if X is implied by the order of observations.
                                               Explicit Location   Implicit Location
                    Nature of  Variables       in Time or Space   in Time or Space
                    Interval or Ratio Data     Interpolation      Zonation
                                               Regression         Seriation
                                               Splines            Autocorrelation
                                                                  Cross-correlation
                                                                  Semivariograms
                                                                  Periodograms
                                                                  Spectral Density
                    Nominal or Ordinal Data    Series of  Events   Markov Chains
                                                                  Runs Tests



             this category does not consider the spacing at all, and only the sequence of  the
             observations is important.
                 The techniques also may be classified on the type of observations they require.
             Some necessitate interval or ratio Observations; the variate must be measured on
             a scale and expressed in real numbers. Other methods accept nominal or ordinal
             data, and observations need only to be categorized in some fashion. In the methods
             discussed in this chapter, the classes are not ranked; that is, state A is not “greater”
             or “larger”  in some sense than states B or C. Nominal data may be represented by
             integers, alphabetic characters, or symbols.
                 In the remainder of  this chapter, we  are going to examine the mathematical
             techniques required to analyze data in sequences.  The methods described here
             do not  exhaust the possibilities by  any means.  Rather, these are a collection of
             operations that have proved valuable in quantitative problem-solving  in the Earth
             sciences, or that seem especially promising. Other methods may be more appropri-
             ate or powerful in specific situations or for certain data sets. However, a familiarity
             with the techniques discussed here will provide an introduction to a diverse field of
             analytical tools. Unfortunately, many of these methods were developed in scientific
             specialties alien to most geologists, and the description of  an application in radar
             engineering, stock market analysis, speech therapy, or cell biology may be difficult
             to relate to a geologic problem. Some of  the methods involve nonparametric statis-
             tics, and these are not widely considered in introductory statistics courses. Because
             of the general unfamiliarity of  most Earth scientists with developments in the nu-
             merical analysis of  data sequences, we have thought it best to present a potpourri
             of  techniques and approaches. As you can see from Table 4.1, these cover a variety
             of  sequences of different types, and are designed to answer different kinds of ques-
             tions. None of  the techniques can be considered exhaustively in this short space,
             but from the examples and applications presented, one or another may suggest
             themselves to the geologist with a problem to solve. The list of  Selected Readings
             can then provide a discussion of  a specific subject in more detail.
                 These methods provide answers to the following broad categories of questions:
             Are the observations random, or do they contain evidence of a trend or pattern? If a
             trend exists, what is its form? Can cycles or repetitions be detected and measured?

                                                                                      161
   84   85   86   87   88   89   90   91   92   93   94