Page 45 - Statistics and Data Analysis in Geology
P. 45

Statistics and Data Analysis in  Geology - Chapter  2

                 The great advantage of plotting data in cumulative form, however, comes about
             because we can show the individual observations directly, and avoid the loss of res-
             olution that comes from grouping the Observations into categories for a histogram.
             To do this, we must first rank the observations from smallest to largest, divide each
             observation's rank by the number of observations to convert it into a fraction, then
             multiply by 100 to express it as a percentile. That is,
                                                        ( ran ","' )
                                   percentile of Xi = 100                          (2.11)
                                                               xi
             where n is the number of  observations.  By graphing the percentile of  each obser-
             vation versus its value, we form a cumulative plot (Fig. 2-14).  Note that both the
             cumulative histogram and the cumulative plot have a characteristic ogive form.




















                           s
                             2ok
                          U








                              0
                                0
                                              10     15     20     25     30
                                               Average porosity, %
              Figure 2-14.  Cumulative plot of individual porosity measurements used to construct Figures
                   2-12  and 2-13.

                  Successive divisions of a distribution are called quantizes. If we rank all obser-
             vations in a sample and then divide the ranks into 100 equal-sized categories, each
              category is a percentile. Suppose our sample contains 300 observations; the three
              smallest values constitute the first percentile. Each category is called a decile if the
              ranked sample is divided into ten equal categories, and a quartile if  it is divided
             into four equal categories.  Certain divisions of  a distribution such as the 5th and
              95th percentiles, the 25th and 75th percentiles (also called the 1st and 3rd quartiles),
              and the 50th percentile (also called the 5th decile, the 2nd quartile, or the median)
              are considered especially diagnostic and are indicated on the graphic plots we will
              consider next.

              32
   40   41   42   43   44   45   46   47   48   49   50