Page 342 - Statistics for Dummies
P. 342

326
                                         Part VI: The Part of Tens
                                                    Small sample sizes make results less accurate (unless your population was
                                                    small to begin with). Many headlines aren’t exactly what they appear to be
                                                    when the details reveal a study that was based on a small sample. Perhaps
                                                    even worse, many studies don’t even report the sample size at all, which
                                                    should lead you to be skeptical of the results. (For example, an old chewing
                                                    gum ad said, “Four out of five dentists surveyed recommend [this gum] for
                                                    their patients who chew gum.” What if they really did ask only five dentists?)
                                                    Don’t think about this too much, but according to statisticians (who are picky
                                                    about precision), 4 out of 5 is much different than 4,000 out of 5,000, even
                                                    though both fractions equal 80 percent. The latter represents a much more
                                                    precise (repeatable) result because it’s based on a much higher sample size.
                                                    (Assuming it’s good data, of course.) If you ever wondered how math and sta-
                                                    tistics are different, here’s your answer! (Chapter 12 has more on precision.)
                                                    However, more data isn’t always better data — it depends on how well the
                                                    data were collected (see Chapter 16). Suppose you want to gather the opin-
                                                    ions of city residents on a city council proposal. A small random sample with
                                                    well-collected data (such as a mail survey of a small number of homes chosen
                                                    at random from a city map) is much better than a large non-random sample
                                                    with poorly collected data (for example, posting a Web survey on the city
                                                    manager’s Web site and asking for people to respond).
                                                   Always look for the sample size before making decisions about statistical
                                                    information. The smaller the sample size, the less precise the information. If
                                                    the sample size is missing from the article, get a copy of the full report of the
                                                    study, contact the researcher, or contact the journalist who wrote the article.
                                         Detect Misinterpreted Correlations
                                                    Everyone wants to look for connections between variables; for example, what
                                                    age group is more likely to vote Democrat? If I take even more vitamin C,
                                                    am I even less likely to get a cold? How does staring at the computer all day
                                                    affect my eyesight? When you think of connections or associations between
                                                    variables, you probably think of correlation. Yes, correlation is one of the
                                                    most commonly used statistics — but it’s also one of the most misunder-
                                                    stood and misused, especially throughout the media.
                                                    Some important points about correlation are as follows (see Chapter 18 for
                                                    all the additional information):
                                                     ✓ The statistical definition of correlation (denoted by r) is the mea-
                                                        sure of strength and direction of the linear relationship between
                                                        two numerical variables. A correlation tells you whether the variables
                                                        increase together or go in opposite directions and the extent to which
                                                        the pattern is consistent across the data set.







                                                                                                                           3/25/11   8:12 PM
                             29_9780470911082-ch20.indd   326                                                              3/25/11   8:12 PM
                             29_9780470911082-ch20.indd   326
   337   338   339   340   341   342   343   344   345   346   347