Page 138 - Statistics II for Dummies
P. 138

122        Part II: Using Different Types of Regression to Make Predictions



                                Searching for the best polynomial model


                                When fitting a polynomial regression model to your data, you always start
                                with a scatterplot so you can look for patterns; the scatterplot will give you
                                some idea of the type of model that may work. Always start with the simplest
                                model possible and work your way up as needed. Don’t plunge in with a high-
                                order polynomial regression model right off the bat. Here are a couple of rea-
                                sons why:
                                  ✓ High-order polynomials are hard to interpret, and their models are
                                    complex. For example, with a straight line, you can interpret the values
                                    of the y-intercept and slope easily, but interpreting a tenth-degree poly-
                                    nomial is difficult (and that’s putting it mildly).

                                  ✓ High-order polynomials tend to cause overfitting. If you’re fitting the
                                    model as close as you can to every single point in a data set, your model
                                    may not hold for a new data set, meaning that your estimates for y could
                                    be way off.

                                To fit a polynomial to a data set in Minitab, go to Stat>Regression>Fitted Line
                                Plot> and click on the type of regression model you want: linear, quadratic, or
                                cubic. (It doesn’t go beyond a third-degree polynomial, but these options
                                should cover 90 percent of the cases where a polynomial is appropriate.) Click
                                on the y variable from the left-hand box, and click Select; this variable will
                                appear in the Response (y) box. Click on the x variable from the left-hand box,
                                and click Select; it will appear in the Predictor (x) box. Click OK.
                                Following are steps that you can use to see if a polynomial fits your data.
                                (Statistical software can jump in and fit the models for you after you tell it
                                which ones to fit.)

                                  1. Make a scatterplot of your data, and look for any patterns, such as a
                                    straight line or a curve.
                                  2. If the data resemble a straight line, try to fit a first-degree polynomial
                                    (straight line) to the data first: y = b  + b x.
                                                                    0   1
                                      If the scatterplot doesn’t show a linear pattern, or if the correlation isn’t
                                    close to +1 or –1, move to step three.
                                  3. If the data resemble the shape of a parabola, try to fit a second-degree
                                                               2
                                    polynomial: y = b  + b x + b x .
                                                    0   1    2
                                      If the data fit the model well, stop here and refer to the later section
                                    “Assessing the fit of a polynomial model.” If the model still doesn’t fit
                                    well, move to step four.













          12_466469-ch07.indd   122                                                                   7/24/09   9:39:08 AM
   133   134   135   136   137   138   139   140   141   142   143