Page 274 - Statistics for Environmental Engineers
P. 274

L1592_Frame_C30  Page 277  Tuesday, December 18, 2001  2:49 PM









                       Coefficients b 0  and b 1  were expected to be significant. There is nothing additional to note about them.
                        The interactions are not significant:

                          b 12  = 0.0192 ± 0.0224  b 13   = −0.0109  ± 0.0224
                          b 23  = 0.0004 ± 0.0224  b 123  = −0.0115  ± 0.0224

                        Method A gives results that are from 0.025 to 0.060 mg/L lower than Method B (on the log-transformed
                       scale). This is indicated by the coefficient b 2  = 0.0478 ± 0.0224. The difference between A and B on
                                                                        1
                       the log scale is a percentage on the original measurement scale.  Method A gives results that are 2.5 to
                       6% lower than Method B.
                        If a 5% difference between methods is not important in the context of a particular investigation, and
                       if Method B offers substantial advantages in terms of cost, speed, convenience, simplicity, etc., one
                       might decide to adopt Method B although it is not truly equivalent to Method A. This highlights the
                       difference between “statistically significant” and “practically important.” The statistical problem was
                       important to learn whether A and B were different and, if so, by how much and in which direction. The
                       practical problem was to decide whether a real difference could be tolerated in the application at hand.
                        Using PMA as a preservative caused no measurable effect or interference. This is indicated by the
                       confidence interval [−0.004, 0.041] for b 3 , which includes zero. This does not mean that wastewater
                       specimens could be held without preservation. It was already known that preservation was needed, but
                       it was not known how PMA would affect Method B. This important result meant that the analyst could
                       do nitrate measurements twice a week instead of daily and holding wastewater over the weekend was
                       possible. This led to economies of scale in processing.
                        This chapter began by saying that Method A, the widely accepted method, was considered to give
                       accurate measurements. It is often assumed that widely used methods are accurate, but that is not
                       necessarily true. For many analyses, no method is known a priori to be correct. In this case, finding that
                       Methods A and B are equivalent would not prove that either or both give correct results. Likewise,
                       finding them different would not mean necessarily that one is correct. Both might be wrong.
                        At the time of this study, all nitrate measurement methods were considered tentative (i.e., not yet
                       proven accurate). Therefore, Method A actually was not known to be correct. A 5% difference between
                       Methods A and B was of no practical importance in the application of interest. Method B was adopted
                       because it was sufficiently accurate and it was simpler, faster, and cheaper.




                       Comments
                       The arithmetic of fitting a regression model to a factorial design and estimating effects in the standard
                       way is virtually identical. The main effect indicates the change in response that results from moving
                       from the low level to the high level (i.e., from −1 to +1). The coefficients in the regression model indicate
                       the change in response associated with moving only one unit (e.g., from 0 to +1). Therefore, the regression
                       coefficients are exactly half as large as the effects.
                        Obviously, the decision of analyzing the data by regression or by calculating effects is largely a matter
                       of convenience or personal preference. Calculating the effects is more intuitive and, for many persons,
                       easier, but it is not really different or better.
                        There are several common situations where linear regression must be used to analyze data from a
                       factorial experiment. The factorial design may not have been executed precisely as planned. Perhaps
                       one run has failed so there is a missing data point. Or perhaps not all runs were replicated, or the number
                       of replicates is different for different runs. This makes the experiment unbalanced, and matrix multipli-
                       cation and inversion cannot be done by inspection as in the case of a balanced two-level design.

                       1
                       Suppose that Method B measures 3.0 mg/L, which is 1.0986 on the log scale, and Method A measures 0.0477 less on the
                       log scale, so it would give 1.0986 − 0.0477 = 1.0509. Transform this to the original metric by taking the antilog; exp(1.0509) =
                       2.86 mg/L. The difference 3.0 − 2.86 = 0.14, expressed as a percentage is 100(0.14/3.0) = 4.77%. This is the same as the effect
                       of method (0.0477) on the log-scale that was computed in the analysis.
                       © 2002 By CRC Press LLC
   269   270   271   272   273   274   275   276   277   278   279