Page 137 - Statistics for Environmental Engineers
P. 137

L1592_Frame_C15  Page 134  Tuesday, December 18, 2001  1:50 PM









                                       TABLE 15.3
                                       Censored Data Analysis Using Rankits (Normal Order Statistics)
                                       Conc.  Rankit   Conc.    Rankit   Conc.   Rankit
                                       23.8    −2.21    31.0     −0.40    36.4    0.46
                                       24.4    −1.81    31.6     −0.34    37.0    0.52
                                       25.0    −1.58    31.6     −0.28    37.0    0.59
                                       25.0    −1.41    31.6     −0.22    37.0    0.65
                                       25.6    −1.27    32.2     −0.17    37.0    0.72
                                       25.6    −1.16    32.8     −0.11    37.0    0.80
                                       28.0    −1.05    33.4     −0.06    37.6    0.88
                                       28.0    −0.96    34.1      0.0     37.6    0.96
                                       28.0    −0.88    34.6      0.06    38.2    1.05
                                       28.6    −0.80    35.2      0.11    38.2    1.16
                                       28.6    −0.72    35.2      0.17    39.4    1.27
                                       29.2    −0.65    35.2      0.22    39.4    1.41
                                       29.8    −0.59    35.8      0.28    40.6    1.58
                                       29.8    −0.52    35.8      0.34    43.6    1.81
                                       31.0   −0.46     35.8      0.40    47.8    2.21

                                      50
                                         all 45 observations  10 values censored  20 values censored
                                     Concentration  40
                                                                          y = 33.8 + 5.0x
                                         y = 33.6 + 3.6x
                                                          y = 33.3 + 5.4x
                                      30
                                      20
                                          -2  -1  0  1  2  -2  -1  0  1  2  -2  -1  0  1  2
                                              Rankit          Rankit           Rankit

                       FIGURE 15.3 The left panel shows the rankit plot for a sample of 45 normally distributed values. In the middle panel,
                       10 values have been censored; the right panel has 20 censored values. Regression with rankits as the independent variables
                       and concentration as the dependent variables gives the intercept (rankit = 0) as an estimated of the median = mean, and
                       the slope as an estimate of the standard deviation.

                        If the data have been transformed to obtain normality,  b 0  and  b 1  estimate the mean and standard
                       deviation of the transformed data, but not of the data on the original measurement scale. If normality
                       was achieved by a logarithmic transformation  z =[  ln y()] , the 50th percentile (median of z) estimates
                       the mean of z and is also the log of the geometric mean in the original metric. The slope will estimate
                       the standard deviation of  z. Exponentiation transforms the median on the log-scale transform to the
                       median in the original metric. The back transformations to estimate the mean and standard deviation in
                       the original metric are given in Chapter 7 and in Example 15.5.



                       Cohen’s Maximum Likelihood Estimator Method
                       There are several methods to estimate the mean of a sample of censored data. Comparative studies show
                       that none is always superior so we have chosen to present Cohen’s maximum likelihood method (Cohen,
                       1959, 1961; Gilliom and Helsel, 1986; Haas and Scheff, 1990). It is easy to compute for samples from
                       a normally distributed parent population or from a distribution that can be made normal by a log-arithmic
                       transformation.
                        A sample of n observations has measured values of the variable only at  y ≥  y c  , where y c  is a known
                       and fixed point of censoring. In our application, y c  is the MDL and it is assumed that the same MDL
                       applies to each observation. Of the n total observations in the sample, n c  observations have  y ≤  y c   and
                       are censored. The number of observations with  y >  y c  is  k =  n n c  . The fraction of censored data is
                                                                         –
                       © 2002 By CRC Press LLC
   132   133   134   135   136   137   138   139   140   141   142