Page 415 - Fair, Geyer, and Okun's Water and wastewater engineering : water supply and wastewater removal
P. 415

JWCL344_ch11_357-397.qxd  8/2/10  9:01 PM  Page 375







                                                                                           11.10 Probability Paper  375
                                         magnitude of an array generally offers the most helpful concept of central tendency.
                                         Unlike the median magnitude of an array, which is positionally central irrespective of
                                         the magnitudes of the individual observations composing the array, and unlike the mode
                                         of the array, which is the magnitude having the highest frequency of observations com-
                                         posing the array, the arithmetic mean is a collective function of the magnitudes of all
                                         component observations.

                    11.9.2 Variability and Skewness
                                         The tendency of a bell-shaped curve to deviate by larger and larger amounts from the cen-
                                         tral magnitude of an array with less and less frequency is a measure of its variability, vari-
                                         ation, deviation, dispersion, or scatter. The tendency of a bell-shaped curve to be asymmet-
                                         rical is a measure of its skewness.
                                             Mathematically, the symmetrical bell-shaped normal frequency curve of Appendix 5
                                         is described fully by two parameters: (a) the arithmetic mean, m, as a measure of central
                                          tendency, and (b) the standard deviation, s, as a measure of dispersion, deviation, or vari-
                                          ation. The equation of the curve, which is referred to as the Gaussian or normal probabil-
                                          ity curve, is

                                                                              1             x - s  2
                                                                                         1
                                                       y>n = F(x)>n = f(x) = c    dexpe - > c    d f        (11.15)
                                                                                          2
                                                                            s12p              s
                                          where y is the number or frequency of observations of magnitude x deviating from the
                                         mean magnitude by x   m, n is the total number of observations, s is the standard
                                         deviation from the mean, F(x) symbolizes a function of x, and m is the mathematical
                                         constant 3.1416.
                                             As shown in Appendix 5 and calculated from Eq. 11.15, the origin of the coordinate
                                         system of the Gaussian curve lies at x   m   0 where the frequency is y = n>(s12p),
                                         and the distance from the origin to the points of inflection of the curve is the standard
                                                               2
                                         deviation s = 2©(x - m) >n.  This presupposes that deviations are referenced to the true
                                         mean m, whereas calculations must be based on the observed mean x =©x>n,  which is
                                         expected to deviate from the true mean by a measurable although small amount.
                                         However, it an be shown that when the observed standard deviation is calculated as
                                                        2
                                          s = 2©(x - m) >(n - 1),  it closely approximates s. As n becomes large, there is in-
                                          deed little difference between s and s as well as between  and m, and the Greek letters
                                                                                         x
                                          are often used even when, strictly speaking, the Latin ones should be. The ratios y>n and
                                          (x   m)/s generalize the equation of the normal frequency function.
                                             Engineers are usually interested in the expected frequencies of observations below or
                                          above a given value of x or falling between two given values of x. The magnitudes of areas
                                          under the normal curve provide this useful information better than do point frequencies.
                                          Appendix 5 is a generalized table of these areas. It is referred to as a probability integral
                                          table because a given area is a measure of the expected fraction of the total area (as unity)
                                          corresponding to different departures from the mean (x   m) in terms of the standard devi-
                                          ation s, that is, (x   m)>s   t.

                    11.10  PROBABILITY PAPER

                                         As suggested by Allen Hazen, the probability integral (Appendix 5) can be used to de-
                                         velop a system of coordinates on which normal frequency distributions plot as straight
                                         lines. The companion to the probability scale can be (a) arithmetic for true Gaussian
                                         normality, (b) logarithmic for geometric normality, or (c) some other function of the
   410   411   412   413   414   415   416   417   418   419   420