Page 158 - Intermediate Statistics for Dummies
P. 158

12_045206 ch07.qxd  2/1/07  9:54 AM  Page 137
                                                    After you know that a quadratic polynomial seems to be a good fit for the
                                                    data, the next challenge is finding the equation for that particular parabola
                                                    that fits the data, among all the possible parabolas out there. Remember from
                                                                                                    2
                                                    algebra that the general equation of a parabola is y = ax + bx + c. Now you
                                                    have to find the values of a, b, and c that create the best-fitting parabola to
                                                    the data (just like you find the a and the b that create the best-fitting line to
                                                    data in a linear regression model). That is the object of the regression model.
                                                    Say that you fit a quadratic regression model to the quiz-score data by using
                                                    Minitab (see the Minitab output in Figure 7-5 and the instructions for using
                                                    Minitab to fit this model in the previous section). On the top line of the
                                                    output, you can see that the equation of the best-fitting parabola is quiz
                                                    score = 9.82 – 6.15  study time + 1.00  study time squared. (Note that y is
                                                                    *
                                                                                    *
                                                    quiz score and x is study time in this example because you’re using study
                                                    time to predict quiz score.)
                                           Figure 7-5: Chapter 7: When Data Throws You a Curve: Using Nonlinear Regression  137
                                                     Polynomial Regression Analysis: Quiz Score versus Study Time
                                             Minitab
                                           output for
                                                     The regression equation is
                                             fitting a  Quiz score = 9.823 − 6.149 study time + 1.003 study time**2
                                          parabola to
                                            the quiz-  S = 1.04825   R−Sq = 91.7%   R−Sq(adj) = 90.7%
                                          score data.
                                                    The scatterplot of the quiz-score data and the parabola that was fit to the
                                                    data via the regression model is shown in Figure 7-6. From algebra, you may
                                                    remember that a positive coefficient on the quadratic term (here a = 1.00)
                                                    means the bowl is right-side-up, which you can see is the case here.
                                                    Looking at Figure 7-6, it appears that the quadratic model fits this data pretty
                                                    well, because the data fall closely to the curve that Minitab found. However,
                                                    data analysts can’t live by scatterplots alone. In the next section, you figure
                                                    out how to assess the fit of a polynomial model in more detail.
                                                    Assessing the fit of a polynomial model
                                                    You have made a scatterplot of your data, and you saw a curved pattern. You
                                                    used polynomial regression to fit a model to the data; the model appears to
                                                    fit well because the points follow closely to the curve Minitab found. But
                                                    don’t stop there. To make sure your results can be generalized to the popula-
                                                    tion from which your data was taken, you need to do a little more checking
                                                    beyond just the graph to make sure your model fits well.
   153   154   155   156   157   158   159   160   161   162   163