Page 127 - Statistics for Dummies
P. 127

Chapter 7: Going by the Numbers: Graphing Numerical Data
                                           Figure 7-3:
                                                     Descriptive Statistics: Age
                                          Descriptive
                                            statistics
                                             for Best
                                                             Total
                                                                                        Q1
                                                                                                         Maximum IQR
                                             Actress
                                                                   Mean
                                                                                             Median
                                                                          StDev Minimum
                                                                                                     Q3
                                                     Variable Count
                                                                                       28.00
                                                              83
                                                     Age
                                                                                                                 11.00
                                                                                                    39.00
                                                                    35.69
                                                                                              33.00
                                                                                 21.00
                                                                          11.35
                                                                                                           81.00
                                               ages
                                           (1928–2009).


                                                    Here are some tips for connecting the shape of the histogram (discussed in
                                                    the previous section) with the mean and median:
                                                     ✓ If the histogram is skewed right, the mean is greater than the median.

                                                        This is the case because skewed-right data have a few large values that
                                                        drive the mean upward but do not affect where the exact middle of the
                                                        data is (that is, the median). Looking at the histogram of ages of the Best   111
                                                        Actress Award winners in Figure 7-1, you see they’re skewed right.
                                                     ✓ If the histogram is close to symmetric, then the mean and median are
                                                        close to each other.
                                                        Close to symmetric means it’s almost the same on either side; it doesn’t
                                                        need to be exact. Close is defined in the context of the data; for example,
                                                        the numbers 50 and 55 are said to be close if all the values lie between 0
                                                        and 1,000, but they are considered to be farther apart if all the values lie
                                                        between 49 and 56.
                                                        The histogram shown in Figure 7-2a is close to symmetric. Its mean and
                                                        median are both equal to 3.5.
                                                     ✓ If the histogram is skewed left, the mean is less than the median.
                                                        This is the case because skewed-left data have a few small values that
                                                        drive the mean downward but do not affect where the exact middle of
                                                        the data is (that is, the median).
                                                        Figure 7-2b represents the exam scores of 17 students, and the data are
                                                        skewed left. I calculated the mean and median of the original data set to
                                                        be 70.41 and 74.00, respectively. The mean is lower than the median due
                                                        to a few students who scored quite a bit lower than the others. These
                                                        findings match the general shape of the histogram shown in Figure 7-2b.
                                                    The tips for interpreting histograms found in the previous section can also be
                                                    used the other way around. If for some reason you don’t have a histogram of
                                                    the data, and you only have the mean and median to go by, you compare them
                                                    to each other to get a rough idea as to the shape of the data set.
                                                                                                                           3/25/11   8:16 PM
                             12_9780470911082-ch07.indd   111                                                              3/25/11   8:16 PM
                             12_9780470911082-ch07.indd   111
   122   123   124   125   126   127   128   129   130   131   132