Page 28 - Statistics for Environmental Engineers
P. 28

L1592_Frame_C02  Page 19  Tuesday, December 18, 2001  1:40 PM










                                            14     40 random samples of n = 4
                                                                              Parent
                                           y 10                               distribution

                                                                              N(10,1)
                                             6
                                            12                                  Sampling
                                        y=  Σy  10                              distribution
                                            n                                   of the mean
                                             8                                  is normal

                                             6
                                             4
                                    2 Σ( y −y )
                                    s =     2  2                              Sampling
                                        n−1                                   distribution of
                                             0                                the variance
                                             2
                                       t =  y −η  0                           Sampling
                                                                              distribution
                                          sn
                                                                              of t
                                            -2
                       FIGURE 2.11  Forty random samples of n = 4 from a N(10,1) normal distribution to produce the sampling distributions
                            2
                       of , s , and t. (Adapted from Box, G. E. P., W. G. Hunter, and J. S. Hunter (1978). Statistics for Experimenters: An
                         y
                       Introduction to Design, Data Analysis, and Model Building, New York, Wiley Interscience.)




                                    area = 0.04
                                                               area = 0.04

                                    7.0   7.5   8.0   8.5   9.0  -4  -3  -2  -1  0  1  2  3  4
                                     (a)  Reference distribution of y   ¯  (b)  Reference distribution of t
                                          P(y  ≤ 7.51) = 0.04         P(t ≤ 1.853) = 0.04
                                            ¯
                                    y
                       FIGURE 2.12  The   and t reference distributions for the sample average of the nitrate data of Example 2.1.
                           serves for negative as well as positive values. For ν = 26, the tail areas are 0.05 for t = –1.706,
                           0.025 for t = –2.056, and 0.01 for t = –2.479. Plotting these and drawing a smooth curve as
                           an aid to interpolation gives Prob(t < –1.853)   0.04, or only about 4%. This low probability
                           suggests that there may be a problem with the measurement method in this laboratory.

                       The assessment given in Example 2.9 can also be made by examining the reference distributions of  .
                                                                                                    y
                       The distribution of   is centered about η = 8.0 mg/L with standard deviation s = 0.266 mg/L. The valuey
                                                                                             y
                       of   observed for this particular experiment is 7.51 mg/L. The shaded area to the left of   = 7.51 in
                         y
                       Figure 2.12(a) is the same as the area to the left of t = –1.853 in Figure 2.12(b). Thus, P(t ≤ –1.853) =
                       P(  ≤ 7.51)   0.04.
                         y
                        In the context of Example 2.9, the investigator is considering the particular result that   = 7.51 mg/L
                                                                                           y
                       in a laboratory assessment based on 27 blind measurements on specimens known to have concentration
                       η = 8.00 mg/L. A relevant reference distribution is needed in order to decide whether the result is easily
                       explained by mere chance variation or whether it is exceptional. This reference distribution represents
                       the set of outcomes that could occur by chance. The t distribution is a relevant reference distribution
                       under certain conditions which have already been identified. An outcome that falls on the tail of the
                       distribution can be considered exceptional. If it is found to be exceptional, it is declared statistically
                       significant. Significant in this context does not refer to scientific importance, but only to its statistical
                       plausibility in light of the data.
                       © 2002 By CRC Press LLC
   23   24   25   26   27   28   29   30   31   32   33