Page 348 - Statistics for Environmental Engineers
P. 348

L1592_frame_C40   Page 358  Tuesday, December 18, 2001  3:24 PM









                       estimate β 0  = 0, γ 0  = 0, β 1  = 0, and γ 1  = 0 and the same slope (α 1 ) and intercept (α 0 ) would apply to


                       all three categories. The fitted simplified model is y i =  α 0 +  α 1 x i +  e i  .
                        If the intercepts are different for the three categories but the slopes are the same, the regression would
                       estimate β 1  = 0 and γ 1  = 0 and the model becomes:
                                                 y i =  ( α 0 +  β 0 Z 1 +  γ 0 Z 2 ) +  α 1 x i +  e i


                          For category 1:   y i =  ( α 0 + β 0 Z 1 ) +  α 1 x i +  e i
                          For category 2:   y i =  ( α 0 +  γ 0 Z 2 ) +  α 1 x i +  e i
                          For category 3:   y i =  α 0 + α 1 x i +  e i



                       Case Study: Solution
                       The model under consideration allows a different slope and intercept for each storm. Two dummy variables
                       are needed:

                          Z 1  = 1 for storm 1 and zero otherwise
                          Z 2  = 1 for storm 2 and zero otherwise

                        The model is:

                                           pH = α 0  + α 1 WA + Z 1 (β 0  + β 1 WA ) + Z 2 (γ 0  + γ 1 WA )

                       where the α’s, β’s, and γ ’s are estimated by regression. The model can be rewritten as:

                                          pH = α 0  + β 0 Z 1  + γ 0 Z 2  + α 1 WA + β 1 Z 1 WA + γ 1 Z 2 WA

                       The dummy variables are incorporated into the model by creating the new variables Z 1 WA and Z 2 WA.
                       Table 40.1 shows how this is done.
                        Fitting the full six-parameter model gives:

                              Model A: pH = 5.77 − 0.00008WA + 0.998Z 1  + 1.65Z 2  − 0.005Z 1 WA − 0.008Z 2 WA
                                  (t-ratios)       (0.11)   (2.14)  (3.51)  (3.63)    (4.90)

                       which is also shown as Model A in Table 40.2 (top row). The numerical coefficients are the least squares
                       estimates of the parameters. The small numbers in parentheses beneath the coefficients are the t-ratios
                       for the parameter values. Terms with t < 2 are candidates for elimination from the model because they
                       are almost certainly not significant.
                        The term WA appears insignificant. Dropping this term and refitting the simplified model gives Model
                       B, in which all coefficients are significant:

                              Model B:              pH = 5.82 + 0.95Z 1  + 1.60Z 2  − 0.005Z 1 WA − 0.008Z 2 WA
                              (t-ratios)                 (6.01)    (9.47)    (4.35)    (5.54)
                              [95% conf. interval]   [0.63 to 1.27] [1.26 to 1.94] [−0.007 to −0.002] [−0.01 to −0.005]
                       The regression sum of squares, listed in Table 40.2, is the same for Model A and for Model B (Reg SS =
                       4.278). Dropping the WA term caused no decrease in the regression sum of squares. Model B is equivalent
                       to Model A.
                        Is any further simplification possible? Notice that the 95% confidence intervals overlap for the terms
                       −0.005 Z 1 WA and –0.008 Z 2 WA. Therefore, the coefficients of these two terms might be the same. To
                       check this, fit Model C, which has the same slope but different intercepts for storms 1 and 2. This is
                       © 2002 By CRC Press LLC
   343   344   345   346   347   348   349   350   351   352   353