Page 320 - Statistics for Environmental Engineers
P. 320

L1592_frame_C37.fm  Page 329  Tuesday, December 18, 2001  3:20 PM









                                                   350  (a) Straight line
                                                 Peak Height (1000s)  150
                                                        fitted to all data
                                                   250



                                                    50

                                                   -50
                                                      0    10   20   30    40
                                                       Nitrate Concentration (mg/L)


                       FIGURE 37.2 Plot of the nitrate calibration data and a straight line fitted by ordinary (unweighted) least squares.

                                      100
                                          (a) Fitted straight line in  10  (b) Residuals of fitted
                                          the region of low
                                       80                          straight line
                                          concentrations
                                       60
                                       40                      Residuals (1000s)  0
                                       20
                                        0
                                                                -10
                                         0  2   4  6   8  10     -100  0  100  200  300  400
                                         Nitrate Concentration (mg/L)  Predicted Peak Height (1000s)

                       FIGURE 37.3 (a) Expanded view of the straight-line calibration shows lack of fit at low concentrations. (b) Residuals of
                       the straight-line model show lack of fit and suggest that a quadratic or cubic calibration model should be tried. The greater
                       spread of triplicates at higher values of peak height also suggests that weighting would be appropriate.

                                            Residuals from Average  of 3 Replicates  –2000  • • • •  • •  • • • •  • •  • • •  • •  • • •
                                                4000
                                                2000
                                                  0
                                                      •

                                                                     •
                                               –4000
                                                                          40
                                                   0
                                                              20
                                                         10
                                                                    30
                                                        Nitrate Concentration (mg/L)  50
                       FIGURE 37.4 Residuals from the average at each of the 13 concentration levels used to construct the calibration curve
                       show that the variance increases in proportion to nitrate concentrations.
                        Diagnosing the need for weighted least squares is easy in this case because there are triplicate mea-
                       surements. The variation within replicates is evident in the tabulated data, but it is hidden in Figure 37.2.
                       Figure 37.3 suggests the nonconstant variance, but Figure 37.4 shows it better by flattening the curve
                       to show the residuals with respect to the average at each of the 13 standard concentration levels. The
                       residuals are larger when the analyte concentration is large, which means that the variance is not constant
                       at all concentration levels.
                        There is a further problem with the straight-line analysis given above. A check on the confidence
                       interval of the intercept would support keeping the negative value. This confidence interval is wrong
                       because the residuals of the fitted model are not random and they do not have constant variance. The
                       violation of the constant variance condition of regression distorts all statements about confidence
                       © 2002 By CRC Press LLC
   315   316   317   318   319   320   321   322   323   324   325