Page 135 - Statistics for Environmental Engineers
P. 135

L1592_Frame_C15  Page 132  Tuesday, December 18, 2001  1:50 PM











                       Graphical Methods
                       The median, trimmed mean, and Winsorized mean give unbiased estimates of the mean only when the
                       distribution is symmetrical. Many sets of environmental data are not symmetrical so other approaches
                       are needed. Even if the distribution is symmetric, the proportion of censored observations may be so
                       large that the trimmed or Winsorized estimates are not reliable. In these cases, we settle for doing
                       something helpful without misleading.
                        Graphical methods, when properly used, reveal all the important features of the data. Graphical methods
                       are especially important when one is unsure about a particular statistical method. It is better to display the
                       data rather than to compute a summarizing statistic that hides the data structure and therefore may be
                       misleading. Suitable plots very often enable a decision to be reached without further analysis.
                        Two useful graphs, the time series plot and the cumulative probability plot, are illustrated using a highly
                       censored sample. Forty-five grab samples were taken on consecutive days from a large river. Table 15.1
                       gives the measured mercury concentrations and the days on which they were observed. Only 14 samples
                       had a concentration above MDL of 0.2 µg/ L (today, the MDL for Hg is much lower).
                        With 31 out of 45 observations censored, it is impossible to compute any meaningful statistics for this
                       data set. The median is below the MDL and, because more than half the data are censored, neither the
                       trimmed or Winsorized mean can be computed. One approach is plot the data and see what can be learned.
                       Figure 15.1 shows the measurable concentrations, which occur as seemingly random spikes.
                        Table 15.2 gives the plotting positions for the cumulative probability plot (Figure 15.2), which uses
                       a logarithmic scale so the data will plot as a straight line. The plotting positions are p =  ( i 0.5) n .
                                                                                               –
                       (See Chapter 5 for more about probability plots.)

                         TABLE 15.1
                         Measurable Mercury Concentrations
                         Time, Days  2  4    10   12  13   16   17   22   26  27   35   38   43   44
                         Hg (µµ µµg// //L)  0.5  0.6  0.8  1.0  3.4  5.5  1.0  1.0  3.5  2.8  0.3  0.5  0.5  1.0


                                        TABLE 15.2
                                        Construction of the Cumulative Probability Distribution
                                        Concentration  No. of Obs.  Rank  Probability Value ≤≤ ≤≤y i
                                                                 i      p == == (i – 0.5)// //n
                                             y i        ≤≤ ≤≤y i
                                            <0.2       31       31         0.678
                                            0.3         1       32         0.700
                                            0.5         3       35         0.767
                                            0.6         1       36         0.789
                                            0.8         1       37         0.811
                                            1.0         4       41         0.900
                                            2.8         1       42         0.922
                                            3.4         1       43         0.944
                                            3.5         1       44         0.967
                                            5.5         1       45         0.989

                                              Hg (µg/L)  6 4 2


                                               0
                                                0     10    20    30    40     50
                                                              Day

                       FIGURE 15.1 Time series plot of the data.
                       © 2002 By CRC Press LLC
   130   131   132   133   134   135   136   137   138   139   140