Page 109 - Statistics II for Dummies
P. 109

Chapter 5: Multiple Regression with Two X Variables   93


                                so you can use the resulting equation to estimate y. This specific model is
                                the best-fitting multiple linear regression model. This section tells you how to
                                get, interpret, and test those coefficients in order to complete step five in the
                                multiple regression analysis.

                                Finding the best-fitting linear equation is like finding the best-fitting line in
                                simple linear regression, except that you’re not finding a line. When you have
                                two x variables in multiple regression, for example, you’re estimating a best-
                                fitting plane for the data.


                                Getting the multiple regression

                                coefficients

                                In the simple linear regression model, you have the straight line y = b  + b x;
                                                                                            0  1
                                the coefficient of x is the slope, and it represents the change in y per unit
                                change in x. In a multiple linear regression model, the coefficients b , b ,
                                                                                          1  2
                                and so on quantify in a similar matter the sole contribution that each
                                corresponding x variable (x , x ) makes in predicting y. The coefficient b
                                                        1  2                                  0
                                indicates the amount by which to adjust all these values in order to provide
                                a final fit to the data (like the y-intercept does in simple linear regression).

                                Computer software does all the nitty-gritty work for you to find the proper
                                coefficients (b , b , and so on) that fit the data best. The coefficients that
                                            0  1
                                Minitab settles on to create the best-fitting model are the ones that, as a
                                group, minimize the sum of the squared residuals (sort of like the variance
                                in the data around the selected model). The equations for finding these
                                coefficients by hand are too unwieldy to include in this book; a computer
                                can do all the work for you. The results appear in the regression output in
                                Minitab. You can find the multiple regression coefficients (b , b , b , . . . , b )
                                                                                    0  1  2     k
                                on the computer output under the column labeled COEF.
                                To run a multiple regression analysis in Minitab, click on Stat>Regression>
                                Regression. Then choose the response variable (y) and click on Select. Then
                                choose your predictor variables (x variables), and click Select. Click on OK,
                                and the computer will carry out the analysis.

                                For the plasma TV sales example from the previous sections, Figure 5-3
                                shows the multiple regression coefficients in the COEF column for the
                                multiple regression model. The first coefficient (5.257) is just the constant
                                term (or b  term) in the model and isn’t affiliated with any x variable. This
                                         0
                                constant just sort of goes along for the ride in the analysis; it’s the number
                                that you tack on the end to make the numbers work out right. The second
                                coefficient in the COEF column is 0.162; this value is the coefficient of the
                                x  (TV ad amount) term, also known as b . The third coefficient in the COEF
                                 1                                 1
                                column is 0.249, which is the value for b  in the multiple regression model
                                                                   2
                                and is the coefficient that goes with x  (newspaper ad amount).
                                                                2





          10_466469-ch05.indd   93                                                                    7/24/09   9:32:34 AM
   104   105   106   107   108   109   110   111   112   113   114