Page 343 - Statistics for Dummies
P. 343

Chapter 20: Ten Tips for the Statistically Savvy Sleuth
                                                     ✓ The statistical term correlation is only used in the context of two
                                                        numerical variables (such as height and weight). It does not apply to
                                                        two categorical variables (such as political party and gender).
                                                        For example, voting pattern and gender may be related, but using the

                                                        word correlated to describe their relationship isn’t “sc” (statistically cor-
                                                        rect, get it?). You can say two categorical variables are associated.
                                                     ✓ If a strong correlation and scatterplot exist between two numerical
                                                        variables, you should be able to draw a straight line through the points,
                                                        and the points should lie close to the line. If a line doesn’t fit the data well,
                                                        the variables likely won’t have a strong correlation (r), and vice versa. (See
                                                        Chapter 18 for information on line-fitting, also known as linear regression.)
                                                        A weak correlation implies that a linear relationship doesn’t exist between

                                                        the two variables, but this doesn’t necessarily mean the variables aren’t
                                                        related at all. They may have some other type of relationship besides a
                                                        linear relationship. For example, bacteria multiply at an exponential rate
                                                        over time (their numbers explode, doubling faster and faster).
                                                     ✓ Correlation doesn’t automatically mean cause and effect. For example,   327
                                                        suppose Susan reports based on her observations that people who
                                                        drink diet soda have more acne than people who don’t. If you’re a diet
                                                        soda drinker, don’t break out just yet! This correlation may be a freak
                                                        coincidence that only happened to the people she observed. At most, it
                                                        means more research needs to be done (beyond observation) in order
                                                        to draw any connections between diet soda and acne. (Susan can read
                                                        Chapter 17 to find out how to design a good experiment.)
                                         Reveal Confounding Variables
                                                    A confounding variable is a variable that isn’t included in a study but whose
                                                    influence can affect the results and create confusing (confounding) conclu-
                                                    sions. For example, suppose a researcher reports that eating seaweed helps
                                                    you live longer, but when you examine the study, you find out that it was
                                                    based on a sample of people who regularly eat seaweed in their diets and are
                                                    over the age of 100. When you read the interviews of these people, you dis-
                                                    cover some of their other secrets to long life (besides eating seaweed): They
                                                    slept an average of 8 hours a day, drank a lot of water, and exercised every
                                                    day. So did the seaweed cause them to live longer? You can’t tell, because
                                                    several confounding variables (exercise, water consumption, and sleeping
                                                    patterns) may also have contributed.
                                                   The best way to control for confounding variables is to conduct a well-
                                                    designed experiment (see Chapter 17), which involves setting up two groups
                                                    that are alike in as many ways as possible, except that one group receives a
                                                    specified treatment and the other group receives a control (a fake treatment,
                                                    no treatment, or a standard, non-experimental treatment).You then compare





                                                                                                                           3/25/11   8:12 PM
                             29_9780470911082-ch20.indd   327                                                              3/25/11   8:12 PM
                             29_9780470911082-ch20.indd   327
   338   339   340   341   342   343   344   345   346   347   348