Page 101 - Statistics II for Dummies
P. 101

Chapter 5: Multiple Regression with Two X Variables   85


                                Stepping through the analysis


                                Your job in conducting a multiple regression analysis is to do the following
                                (the computer can help you do steps three through six):

                                  1. Come up with a list of possible x variables that may be helpful in
                                    estimating y.
                                  2. Collect data on the y variable and your x variables from step one.
                                  3. Check the relationships between each x variable and y (using
                                    scatterplots and correlations), and use the results to eliminate those x
                                    variables that aren’t strongly related to y.
                                  4. Look at possible relationships between the x variables to make sure
                                    you aren’t being redundant (in statistical terms, you’re trying to avoid
                                    the problem of multicolinearity).
                                      If two x variables relate to y the same way, you don’t need both in the
                                    model.

                                  5. Use those x variables (from step four) in a multiple regression analysis
                                    to find the best-fitting model for your data.
                                  6. Use the best-fitting model (from step five) to predict y for given
                                    x-values by plugging those x-values into the model.

                                I outline each of these steps in the sections to follow.



                      Looking at x’s and y’s


                                The first step of a multiple regression analysis comes way before the number
                                crunching on the computer; it occurs even before the data is collected. Step
                                one is where you sit down and think about what variables may be useful in
                                predicting your response variable y. This step will likely take more time than
                                any other step, except maybe the data-collection process. Deciding which x
                                variables may be candidates for consideration in your model is a deal-breaking
                                step, because you can’t go back and collect more data after the analysis is
                                over.
                                Always check to be sure that your response variable, y, and at least one of the
                                x variables are quantitative. For example, if y isn’t quantitative but at least one
                                x is, a logistic regression model may be in order (see Chapter 8).















          10_466469-ch05.indd   85                                                                    7/24/09   9:32:32 AM
   96   97   98   99   100   101   102   103   104   105   106