Page 308 - Statistics for Dummies
P. 308

292
                                         Part V: Statistical Studies and the Hunt for a Meaningful Relationship
                                                    The slope, m, for the best-fitting line for the subset of cricket chirps versus
                                                                                           . So as the number of chirps
                                                    temperature data is
                                                    increases by 1 chirp per 15 seconds, the temperature is expected to increase
                                                    by 0.90 degrees Fahrenheit on average. To get a more meaningful interpre-
                                                    tation, you can multiply the top and bottom of the slope by 10 and say as
                                                    chirps increase by 10 (per 15 seconds) temperature increases 9 degrees
                                                    Fahrenheit.
                                                    Now, to find the y-intercept, b, you take
                                                    the best-fitting line for predicting temperature from cricket chirps based on
                                                    the data is y = 0.90x + 43.15, or temperature (in degrees Fahrenheit) = 0.90 ∗
                                                    (number of chirps in 15 seconds) + 43.2. Now can you use the y-intercept to pre-
                                                    dict temperature when no chirping is going on at all? Because no data was col-
                                                    lected at or near this point, you cannot make predictions for temperature in this
                                                    area. You can’t predict temperature using crickets if the crickets are silent.
                                         Making Proper Predictions                        , or 67 – (0.90)(26.5) = 43.15. So
                                                    After you have determined a strong linear relationship and you find the equa-
                                                    tion of the best fitting line using y = mx + b, you use that line to predict (the
                                                    average) y for a given x-value. To make predictions, you plug the x-value into
                                                    the equation and solve for y. For example, if your equation is y = 2x + 1 and
                                                    you want to predict y for x = 1, then plug 1 into the equation for x to get
                                                    y = 2(1) + 1 = 3.
                                                    Keep in mind that you choose the values of X (the explanatory variable)
                                                    that you plug in; what you predict is Y, the response variable, which totally
                                                    depends on X. By doing this, you are using one variable that you can easily
                                                    collect data on to predict a Y variable that is difficult or not possible to mea-
                                                    sure. This process works well as long as X and Y are correlated. This concept
                                                    is the big idea of regression.
                                                    Using the examples from the previous section, the best-fitting line for the
                                                    crickets is y = 0.90x + 43.2. Say you’re camping outside, listening to the crick-
                                                    ets, and remember you can predict temperature by counting cricket chirps.
                                                    You count 35 chirps in 15 seconds, put in 35 for x, and find that y = 0.9(35) +
                                                    43.2 = 74.7. (Yeah, you memorized the formula before you went camping just
                                                    in case you needed it.) So because the crickets chirped 35 times in 15 sec-
                                                    onds, you figure the temperature is probably about 75 degrees Fahrenheit.
                                                    Just because you have a regression line doesn’t mean you can plug in any value
                                                    for X and do a good job of predicting Y. Making predictions using x-values that
                                                    fall outside the range of your data is a no-no. Statisticians call this extrapolation;
                                                    watch for researchers who try to make claims beyond the range of their data.








                                                                                                                           3/25/11   8:13 PM
                             26_9780470911082-ch18.indd   292                                                              3/25/11   8:13 PM
                             26_9780470911082-ch18.indd   292
   303   304   305   306   307   308   309   310   311   312   313