Page 290 - Statistics for Environmental Engineers
P. 290

L1592_frame_C33  Page 296  Tuesday, December 18, 2001  2:51 PM









                       and independent variables are known. It is the parameters, the β’s and θ’s, that are unknown and must
                                                                                     2
                                               2
                       be computed. The model y = βx  is nonlinear in x; but once the known value of x  is provided, we have
                       an equation that is linear in the parameter β. This is a linear model and it can be fitted by linear regression.
                                            θ
                       In contrast, the model y = x  is nonlinear in θ, and θ must be estimated by nonlinear regression (or we
                       must transform the model to make it linear).
                        It is usually assumed that a well-conducted  experiment produces  values of  x i  that are essentially
                       without error, while the observations of y i  are affected by random error. Under this assumption, the y i
                       observed for the ith experimental run is the sum of the true underlying value of the response (η i ) and a
                       residual error (e i ):
                                                      y i =  η i +  e i i =  1, 2,…,n


                       Suppose that we know, or tentatively propose, the linear model η = β 0  + β 1 x. The observed responses
                       to which the model will be fitted are:

                                                             +
                                                       y i =  β 0 β 1 x i +  e i
                       which has residuals:

                                                       e i =  y i β 0 β 1 x i
                                                             –
                                                                 –
                       Similarly, if one proposed the nonlinear model η = θ 1 exp(−θ 2 x), the observed response is:


                                                      y i  = θ 1  exp(−θ 2 x i ) + e i
                       with residuals:

                                                      e i  = y i  − θ 1  exp(−θ 2 x i )

                       The relation of the residuals to the data and the fitted model is shown in Figure 33.1. The lines represent
                       the model functions evaluated at particular numerical values of the parameters. The residual  e i =(  y i η i )
                                                                                                  –
                       is the vertical distance from the observation to the value on the line that is calculated from the model.
                       The residuals can be positive or negative.
                        The position of the line obviously will depend upon the particular values that are used for β 0  and β 1
                       in the linear model and for θ 1  and θ 2  in the nonlinear model. The regression problem is to select the
                       values for these parameters that best fit the available observations. “Best” is measured in terms of making
                       the residuals small according to a least squares criterion that will be explained in a moment.
                        If the model is correct, the residual e i  = y i  − η i  will be nothing more than random measurement error. If
                       the model is incorrect, e i  will reflect lack-of-fit due to all terms that are needed but missing from the model
                       specification. This means that, after we have fitted a model, the residuals contain diagnostic information.


                                              Linear model
                                              y = β  + β x
                                                    1
                                                 0
                                                         e
                                           y i            i         e i
                                                               y i
                                                                     Nonlinear model
                                                                     y = θ  [ 1-exp(θ x)]
                                                                              1
                                                                        0
                                                        x i         x i
                       FIGURE 33.1 Definition of residual error for a linear model and a nonlinear model.
                       © 2002 By CRC Press LLC
   285   286   287   288   289   290   291   292   293   294   295