Page 331 - Statistics for Environmental Engineers
P. 331

L1592_frame_C38  Page 340  Tuesday, December 18, 2001  3:21 PM









                                            TABLE 38.2
                                            Analysis of Variance for the Six-Parameter Settling
                                            Linear Model
                                            Due to          df    SS    MS = SS/df
                                            Regression (Reg SS)  5  20255.5  4051.1
                                            Residuals (RSS)  6    308.8     51.5
                                            Total (Total SS)  11  20564.2


                                              10  •    •  •               •  •
                                            Residuals  0  •  •  •  •  •

                                              -10
                                                40      80  •  120  •  160     200
                                                         Predicted SS Values

                                                                                               2
                                                                             y ˆ
                       FIGURE 38.3  Plot of residuals against the predicted values of the regression model   = 185.97 + 7.125t + 0.014t  − 3.057z.
                       the model might be improved. The pattern of the residuals plotted against time (Figure 38.2b) suggests
                                  2
                       that adding a t  term may be helpful. This was done to obtain:
                                                y ˆ =  186.0 +  7.12z 3.06t +  0.0143t  2
                                                               –
                                2
                       which has R  = 0.97 and residual mean square = 81.5. A diagnostic plot of the residuals (Figure 38.3)
                       reveals no inadequacies. Similar plots of residuals against the independent variables also support the
                       model. This model is adequate to describe the data.
                        The most complicated model, which has six parameters, is:

                                         y ˆ =  152 +  20.9z 2.74t –  1.13z –  0.0143t –  0.080zt
                                                                  2
                                                                           2
                                                       –
                       The model contains quadratic terms for time and depth and the interaction of depth and time (zt). The
                       analysis of variance for this model is given in Table 38.2. This information is produced by computer
                       programs that do linear regression. For now we do not need to know how to calculate this, but we should
                       understand how it is interpreted.
                        Across the top, SS is sum of squares and df = degrees of freedom associated with a sum of squares
                       quantity. MS is mean square, where MS = SS/df. The sum of squares due to regression is the regression
                       sum of squares (RegSS): RegSS = 20,255.5. The sum of squares due to residuals is the residual sum of
                       squares (RSS); RSS = 308.8. The total sum of squares, or Total SS, is:


                                                    Total SS = RegSS + RSS

                       Also:

                                                     Total SS =  ∑ ( y i –  y) 2

                       The residual sum of squares (RSS) is the minimum sum of squares that results from estimating the parameters
                       by least squares. It is the variation that is not explained by fitting the model. If the model is correct, the RSS
                       is the variation in the data due to random measurement error. For this model, RSS = 308.8. The residual
                       mean square is the RSS divided by the degrees of freedom of the residual sum of squares. For RSS, the
                       degrees of freedom is df = n − p, where n is the number of observations and p is the number of parameters
                       in the fitted model. Thus, RMS = RSS /(n − p). The residual sum of squares (RSS = 308.8) and the
                       © 2002 By CRC Press LLC
   326   327   328   329   330   331   332   333   334   335   336