Page 244 - Applied Statistics And Probability For Engineers
P. 244

c06.qxd  5/14/02  9:55  M  Page 205 RK UL 6 RK UL 6:Desktop Folder:TEMP WORK:MONTGOMERY:REVISES UPLO D CH114 FIN L:Quark Files:






                                                             6-4 FREQUENCY DISTRIBUTIOINS AND HISTOGRAMS  205




                                                                       20

                            10

                           Frequency  5                               Frequency  10





                             0
                                                                        0
                                    100      150      200      250          80  100 120 140 160 180 200 220 240
                                              Strength                                   Strength
                          Figure 6-8 A histogram of the compressive strength  Figure 6-9 A histogram of the compressive strength
                          data from Minitab with 17 bins.           data from Minitab with nine bins.

                                       Figure 6-8 shows a histogram of the compressive strength data from Minitab. The “de-
                                   fault” settings were used in this histogram, leading to 17 bins. We have noted that histograms
                                   may be relatively sensitive to the number of bins and their width. For small data sets, his-
                                   tograms may change dramatically in appearance if the number and/or width of the bins
                                   changes. Histograms are more stable for larger data sets, preferably of size 75 to 100 or more.
                                   Figure 6-9 shows the Minitab histogram for the compressive strength data with nine bins. This
                                   is similar to the original histogram shown in Fig. 6-7. Since the number of observations is
                                   moderately large (n   80), the choice of the number of bins is not especially important, and
                                   both Figs. 6-8 and 6-9 convey similar information.
                                       Figure 6-10 shows a variation of the histogram available in Minitab, the cumulative fre-
                                   quency plot. In this plot, the height of each bar is the total number of observations that are less
                                   than or equal to the upper limit of the bin. Cumulative distributions are also useful in data in-
                                   terpretation; for example, we can read directly from Fig. 6-10 that there are approximately 70
                                   observations less than or equal to 200 psi.
                                       When the sample size is large, the histogram can provide a reasonably reliable indicator of
                                   the general shape of the distribution or population of measurements from which the sample
                                                                                           ~
                                                                                           x
                                   was drawn. Figure 6-11 presents three cases. The median is denoted as  . Generally, if the data
                                   are symmetric, as in Fig. 6-11(b), the mean and median coincide. If, in addition, the data have
                                   only one mode (we say the data are unimodal), the mean, median, and mode all coincide. If the
                                   data are skewed (asymmetric, with a long tail to one side), as in Fig. 6-11(a) and (c), the mean,
                                   median, and mode do not coincide. Usually, we  find that  mode   median   mean  if the




                                      80
                                      70
                                     Cumulative frequency  50
                                      60

                                      40
                                      30
                 Figure 6-10  A       20
                 cumulative distribution  10
                 plot of the compressive  0
                 strength data from          100       150       200       250
                 Minitab.                               Strength
   239   240   241   242   243   244   245   246   247   248   249