Page 120 - Statistics II for Dummies
P. 120

104
                       Part II: Using Different Types of Regression to Make Predictions
                       Getting a Kick out of Estimating

                       Punt Distance


                                  Before you jump into a model selection procedure to predict y by using a set
                                  of x variables, you have to do some legwork. The variable of interest is y, and
                                  that’s a given. But where do the x variables come from? How do you choose
                                  which ones to investigate as being possible candidates for predicting y? And
                                  how do those possible x variables interact with each other toward making
                                  that prediction?

                                  You must answer all these questions before using any model selection pro-
                                  cedure. However, this part is the most challenging and the most fun; a com-
                                  puter can’t think up x variables for you!

                                  Suppose you’re at a football game and the opposing team has to punt the
                                  ball. You see the punter line up and get ready to kick the ball, and some
                                  questions come to you: “Gee, I wonder how far this punt will go? I wonder
                                  what factors influence the distance of a punt? Can I use those factors in a
                                  multiple regression model to try to estimate punt distance? Hmm, I think
                                  I’ll consult my Statistics II For Dummies book on this and analyze some data
                                  during halftime. . . .”

                                  Well, maybe that’s pushing it, but it’s still an interesting line of question-
                                  ing for football players, golfers, soccer players, and even baseball players.
                                  Everyone’s looking for more distance and a way to get it.

                                  In the following sections, you can see how to identify and assess different x
                                  variables in terms of their potential contribution to predicting y.


                                  Brainstorming variables
                                  and collecting data


                                  Starting with a blank slate and trying to think of a set of x variables that may
                                  be related to y may sound like a daunting task, but in reality, it’s probably
                                  not as bad as you think. Most researchers who are interested in predicting
                                  some variable y in the first place have some ideas about which variables may
                                  be related to it. After you come up with a set of logical possibilities for x, you
                                  collect data on those variables, as well as on y, to see what their actual rela-
                                  tionship with y may be.














                                                                                                       7/23/09   9:27:03 PM
           11_466469-ch06.indd   104                                                                   7/23/09   9:27:03 PM
           11_466469-ch06.indd   104
   115   116   117   118   119   120   121   122   123   124   125