Page 140 - Statistics for Environmental Engineers

P. 140

L1592_Frame_C15 Page 137 Tuesday, December 18, 2001 1:50 PM

The transformation equations to convert these into estimates of the mean and variance of the
untransformed y’s are:

η ˆ y = exp ( η ˆ x + 0.5σ ˆ x)
2
[
σ ˆ y = η ˆ y exp σ ˆ x) 1]
(
2
2
2
–
2
Substituting the parameter estimatesη ˆ x and σ ˆ x gives:
[
(
η ˆ y = exp 3.0187 + 0.5 0.1358)] = exp ( 3.0866) = 21.9 µg/L
σ ˆ y = ( 21.90) exp 0.1358) 1] = 69.76 µg/L) 2
(
(
[
2
2
–
σ ˆ y = 8.35 µg/L
The Delta-Lognormal Distribution
The delta-lognormal method estimates the mean of a sample of size n as a weighted average of n c replaced
censored values and n – n c uncensored lognormally distributed values. The Aitchison method (1955, 1969)
assumes that all censored values are replaced by zeros (D = 0) and the noncensored values have a lognormal
distribution. Another approach is to replace censored values by the detection limit (D = MDL) or by some
value between zero and the MDL (U.S. EPA, 1989; Owen and DeRouen, 1980).
The estimated mean is a weighted average of the mean of n c values that are assigned value D and the
mean of the n – n c fully measured values that are assumed to have a lognormal distribution with mean
2
η x and variance σ x .
---- exp η ˆ x –(
η ˆ y = D---- +  1 – n c 0.5σ ˆ x )
2
n c
n  n 
are the estimated mean and variance of the log-transformed noncensored values.
where η ˆ x andσ ˆ x
This method gives results that agree well with Cohen’s method, but it is not consistently better than
Cohen’s method. One reason is that the user is required to assume that all censored values are located
at a single value, which may be zero, the limit of detection, or something in between.

Comments
The problem of censored data starts when an analyst decides not to report a numerical value and instead
reports “not detected.” It would be better to have numbers, even if the measurement error is large relative
to the value itself, as long as a statement of the measurement’s precision is provided. Even if this were
practiced universally in the future, there remain many important data sets that have already been censored
and must be analyzed.
Simply replacing and deleting censored values gives biased estimates of both the mean and the variance.
The median, trimmed mean, and Winsorized mean provide unbiased estimates of the mean when the
distribution is symmetric. The trimmed mean is useful for up to 25% censoring, and the Winsorized
mean for up to 15% censoring. These methods fail when more than half the observations are censored.
In such cases, the best approach is to display the data graphically. Simple time series plots and probability
plots will reveal a great deal about the data and will never mislead, whereas presenting any single
numerical value may be misleading.
The time series plot gives a good impression about variability and randomness. The probability plot
shows how frequently any particular value has occurred. The probability plot can be used to estimate

135 136 137 138 139 140 141 142 143 144 145