Page 265 - Fundamentals of Probability and Statistics for Engineers
P. 265

248                    Fundamentals of Probability and Statistics for Engineers

           8.1  HISTOGRAM AND FREQUENCY DIAGRAMS

           Given a set of independent observations x 1 , x 2 , .. ., and  x n  of a random variable
           X, a useful first step is to organize and present them properly so that they can
           be easily interpreted and evaluated. When there are a large number of observed
           data, a histogram is an excellent graphical representation of the data, facilitating
           (a)an evaluation ofadequacyofthe assumed model, (b)estimation ofpercentiles
           of the distribution, and (c) estimation of the distribution parameters.
             Let us consider, for example, a chemical process that is producing batches of
           a desired material; 200 observed values of the percentage yield, X, representing
           a relatively large sample size, are given in Table 8.1 (Hill, 1975). The sample
           values vary from 64 to 76. Dividing this range into 12 equal intervals and
           plotting the total number of observed yields in each interval as the height of
           a rectangle over the interval results in the histogram as shown in Figure 8.1.

           A frequency  diagram is obtained if the ordinate of the histogram is divided by
           the total number of observations, 200 in this case, and by the interval width D
           (which happens to be one in this example). We see that the histogram or
           the frequency diagram gives an immediate impression of the range, relative
           frequency, and scatter associated with the observed data.
             In the case of a discrete random variable, the histogram and frequency diagram as
           obtained from observed data take the shape of a bar chart as opposed to connected
           rectangles in the continuous case. Consider, for example, the distribution of the
           number of accidents per driver during a six-year time span in California. The data



                  50                                               0.25


                  40                         N(70,4)               0.20
                Histogram  30                                      0.15  Frequency diagram




                  20
                                                                   0.10

                  10                                               0.05



                       64  65 66 67 68 69 70 71 72 73 74 75 76
                                       Percentage yield

                  Figure 8.1 Histogram and frequency diagram for percentage yield
                                   (data source: Hill, 1975)







                                                                            TLFeBOOK
   260   261   262   263   264   265   266   267   268   269   270