Page 93 - Statistics for Dummies
P. 93

Chapter 5: Means, Medians, and More
                                                    By far the most common measure of variation for numerical data is the stan-
                                                    dard deviation. The standard deviation measures how concentrated the data
                                                    are around the mean; the more concentrated, the smaller the standard devia-
                                                    tion. It’s not reported nearly as often as it should be, but when it is, you often
                                                    see it in parentheses: (s = 2.68).
                                                    Calculating standard deviation
                                                    The formula for the sample standard deviation of a data set (s) is
                                                    To calculate s, do the following steps:
                                                    Reporting the standard deviation                                       77
                                                      1. Find the average of the data set,  .
                                                     2. Take each number in the data set (x) and subtract the mean from it to
                                                        get      .
                                                      3. Square each of the differences,   .
                                                     4. Add up all of the results from Step 3 to get the sum of squares:   .
                                                      5. Divide the sum of squares (found in Step 4) by the number of numbers
                                                        in the data set minus one; that is, (n – 1). Now you have:
                                                      6. Take the square root to get



                                                        which is the sample standard deviation, s. Whew!

                                                    At the end of Step 5 you have found a statistic called the sample variance,
                                                              2
                                                    denoted by s . The variance is another way to measure variation in a data set; its
                                                    downside is that it’s in square units. If your data are in dollars, for example, the
                                                    variance would be in square dollars — which makes no sense. That’s why we
                                                    proceed to Step 6. Standard deviation has the same units as the original data.

                                                    Look at the following small example: Suppose you have four quiz scores:
                                                    1, 3, 5, and 7. The mean is 16 ÷ 4 = 4 points. Subtracting the mean from
                                                    each number, you get (1 – 4) = –3, (3 – 4) = –1, (5 – 4) = +1, and (7 – 4) = +3.
                                                    Squaring each of these results, you get 9, 1, 1, and 9. Adding these up, the
                                                    sum is 20. In this example, n = 4, and therefore n – 1 = 3, so you divide 20 by
                                                    3 to get 6.67. The units here are “points squared,” which obviously makes








                                                                                                                           3/25/11   8:17 PM
                             10_9780470911082-ch05.indd   77                                                               3/25/11   8:17 PM
                             10_9780470911082-ch05.indd   77
   88   89   90   91   92   93   94   95   96   97   98