Page 236 - Applied statistics and probability for engineers
P. 236

214     Chapter 6/Descriptive Statistics


                                     0.3125    25
                                     0.2500    20
                                    Relative frequency  0.1895  Frequency  15


               FIGURE 6-7            0.1250    10
               Histogram of
               compressive strength   0.0625    5
               for 80 aluminum-
                                         0      0
               lithium alloy                       70  90 110 130 150 170 190 210 230 250
               specimens.                                  Compressive strength (psi)

                                   Notice the symmetric, bell-shaped distribution of the strength measurements in Fig. 6-7.
                                   This display often gives insight about possible choices of probability distributions to use
                                   as a model for the population. For example, here we would likely conclude that the normal
                                   distribution is a reasonable model for the population of compression strength measurements.
                                     Sometimes a histogram with unequal bin widths will be employed. For example, if the
                                   data have several extreme observations or outliers, using a few equal-width bins will result
                                   in nearly all observations falling in just a few of the bins. Using many equal-width bins will
                                   result in many bins with zero frequency. A better choice is to use shorter intervals in the region
                                   where most of the data fall and a few wide intervals near the extreme observations. When the
                                   bins are of unequal width, the rectangle’s area (not its height) should be proportional to the
                                   bin frequency. This implies that the rectangle height should be
                                                   Rectangular height =  Bin frequancy
                                                                      Bin width
                                     In passing from either the original data or stem-and-leaf diagram to a frequency distribu-
                                   tion or histogram, we have lost some information because we no longer have the individual
                                   observations. However, this information loss is often small compared with the conciseness
                                   and ease of interpretation gained in using the frequency distribution and histogram.
                 Histograms are Best   Figure 6-8 is a histogram of the compressive strength data with 17 bins. We have noted
                 for Relatively Large   that histograms may be relatively sensitive to the number of bins and their width. For small
                          Samples  data sets, histograms may change dramatically in appearance if the number and/or width of
                                   the bins changes. Histograms are more stable and thus reliable for larger data sets, preferably
                                   of size 75 to 100 or more. Figure 6-9 is a histogram for the compressive strength data with




                                                                    20

                 10


                Frequency  5                                      Frequency  10




                  0                                                 0
                         100      150      200      250                  80  100 120 140 160 180 200 220 240
                                  Strength                                            Strength
               FIGURE 6-8  A histogram of the compressive        FIGURE 6-9  A histogram of the compressive strength
               strength data with 17 bins.                       data with nine bins.
   231   232   233   234   235   236   237   238   239   240   241