Page 77 - Statistics II for Dummies
P. 77

Chapter 4: Getting in Line with Simple Linear Regression  61


                                that does this for you (computers use it in their calculations). Formulas also
                                exist for finding the slope and y-intercept of the best-fitting line by hand. The
                                best-fitting line based on your data is y = a + bx, where a estimates α and b
                                estimates β from the true model. (You can find those formulas in your Stats I
                                text or in Statistics For Dummies.)
                                To run a linear regression analysis in Minitab, go to Stat>Regression>
                                Regression. Highlight the response (y) variable in the left-hand box, and click
                                Select. The variable shows up in the Response Variable box. Then highlight
                                your explanatory (x) variable, and click Select. This variable shows up in the
                                Predictor Variable box. Click OK.

                                The equation of the line that best describes the relationship between average
                                textbook weight and average student weight is y = 3.69 + 0.113x, where x
                                is the average student weight for that grade, and y is the average textbook
                                weight. Figure 4-2 shows the Minitab output of this analysis.


                                   The regression equation is
                        Figure 4-2:   textbook wt = 3.69 + 0.113 student wt
                      Simple linear
                       regression
                         analysis   Predictor     Coef  SE Coef    T      P
                          for the   Constant     3.694   1.395  2.65  0.024
                        textbook-  student wt  0.11337  0.01456  7.78  0.000
                          weight
                         example.  S = 1.51341     R-Sq = 85.8%     R-Sq(adj) = 84.4%



                                By writing y = 3.69 + 0.113x, you mean that this equation represents your estimated
                                value of y, given the value of x that you observe with your data. Statisticians tech-
                                                                                              y
                                nically write this equation by using a caret (or hat as statisticians call it), like ˆ, so
                                everyone can know it’s an estimate, not the actual value of y. This y-hat is your esti-
                                mate of the average value of y over the long term, based on the observed values
                                of x. However, in many Stats I texts, the hat is left off because statisticians have
                                an unwritten understanding as to what y represents. This issue comes up again
                                in Chapters 5 through 8. (By the way, if you think y-hat is a funny term here, it’s
                                even funnier in Mexico, where statisticians call it y-sombrero — no kidding!)



                                The y-intercept of the regression line

                                Selected parts of that Minitab output shown in Figure 4-2 are of importance
                                to you at this point. First, you can see that under the Coef column you have
                                the numerical values on the right side of the equation of the line — in other
                                words, the slope and y-intercept. The number 3.69 represents the coefficient
                                of “Constant,” which is a fancy way of saying that’s the y-intercept (because







          09_466469-ch04.indd   61                                                                   7/24/09   10:20:36 AM
   72   73   74   75   76   77   78   79   80   81   82