Page 71 - Statistics for Environmental Engineers
P. 71

L1592_frame_C07.fm  Page 63  Tuesday, December 18, 2001  1:44 PM








                                 TABLE 7.1
                                 Transformations that are Useful to Obtain Uniform Variance
                                 Condition                                    Replace y by
                                 σ  uniform over range of y                no transformation needed
                                  σ ∝  y 2                                 x = 1/y
                                  σ ∝  y 3/2                               x =  1/ y
                                       2
                                      (
                                  σ ∝  y  σ >  y)      all y > 0           x = log(y)
                                                       some y = 0          x = log(y + c)
                                  σ ∝  y 1/2           all y > 0           x =  y
                                                       some y < 0          x =  y +  c
                                  σ >  y               p = ratio or percentage  x = arcsin  p
                                   2
                                 Source: Box, G. E. P., W. G. Hunter, and J. S. Hunter (1978). Statistics for Experimenters:
                                 An Introduction to Design, Data Analysis, and Model Building, New York, Wiley Interscience.
                        TABLE 7.2
                        Plankton Counts on 20 Replicate Water Samples from Five Stations in a Reservoir
                        Station 1  0  2  1  0   0  1   1   0  1  1   0  2   1   0   0  2  3  0  1   1
                        Station 2  3  1  1  1   4  0   1   4  3  3   5  3   2   2   1  1  2  2  2   0
                        Station 3  6  1  5  7   4  1   6   5  3  3   5  3   4   3   8  4  2  2  4   2
                        Station 4  7  2  6  9   5  2   7   6  4  3   5  3   6   4   8  5  2  3  4   1
                        Station 5  12  7  10  15  9  6  13  11  8  7  10  8  11  8  14  9  6  7  9  5
                        Source: Elliot, J. (1977).  Some Methods for the Statistical  Analysis of Samples of Benthic Invertebrates, 2nd ed.,
                        Ambleside, England, Freshwater Biological Association.

                                 TABLE 7.3
                                 Statistics Computed from the Data in Table 7.2
                                 Station                  1        2      4       5      6
                                 Untransformed data     y  = 0.85  2.05  3.90    4.60   9.25
                                                       s y 2  = 0.77  1.84  3.67  4.78  7.57
                                 Transformed  x =  y +  c  x  = 1.10  1.54  2.05  2.20  3.09
                                                       s x 2  = 0.14  0.20  0.22  0.22  0.19

                        The effect of square root and logarithmic transformations is to make the larger values less important
                       relative to the small ones. For example, the square root converts the values (0, 1, 4) to (0, 1, 2). The 4,
                       which tends to dominate on the original scale, is made relatively less important by the transformation.
                        The log transformation is a stronger transformation than the square root transformation. “Stronger” means
                       that the range of the transformed variables is relatively smaller for a log transformation that for the square
                       root. When the sample contains some zero values, the log transformation is x = log(y + c), where c is a
                       constant. Usually the value of c is arbitrarily chosen to be 1 or 0.5. The larger the value of c, the less severe
                       the transformation. Similarly, for square root transformations,  y +  c   is less severe than  y.
                        The arcsin transformation is used for decimal fractions and is most useful when the sample includes values
                       near 0.00 and 1.00. One application is in bioassys where the data are fractions of organisms showing an effect.

                       Example 7.1

                           Twenty replicate samples from five stations were counted for plankton, with the results given in
                           Table 7.2. The computed averages and variances are in Table 7.3. The computed means and variance
                           on the original data show that variance is not uniform; it is ten times larger at station 5 than at
                                                                            2
                           station 1. Also, the variance increases as the average increases and s y   seems to be proportional to
                            y.   This indicates that a square root transformation may be suitable. Because most of the counts
                       © 2002 By CRC Press LLC
   66   67   68   69   70   71   72   73   74   75   76