Page 149 - Statistics II for Dummies
P. 149

Chapter 7: Getting Ahead of the Learning Curve with Nonlinear Regression  133


                                  5. Assess the fit of the model by looking at the scatterplot of the log(y)
                                                                  2
                                    data, checking out the value of R  (adjusted) for the straight-line
                                    model for log(y), and checking the residual plots for the log(y) data.
                                      The techniques and criteria you use to do this are the same as those I dis-
                                    cuss in the previous section “Assessing the fit of a polynomial model.”

                                If these steps seem dubious to you, stick with me. The example in the next
                                section lets you see each step firsthand, which helps a great deal. In the end,
                                actually finding predictions by using an exponential model is a lot easier to
                                do than it is to explain.


                                Spreading secrets at an exponential rate


                                Often, the best way to figure something out is to see it in action. Using the
                                secret-spreading example from Figure 7-2, you can work through the series
                                of steps from the preceding section to find the best-fitting exponential model
                                and use it to make predictions.

                                Step one: Check the scatterplot
                                Your goal in step one is to make a scatterplot of the secret-spreading data
                                and determine whether the data resemble the curved function of an expo-
                                nential model. Figure 7-2 shows the data for the spread of a number of people
                                knowing the secret, as a function of the number of days. You can see that the
                                number of people who know the secret starts out small, but then as more
                                and more people tell more and more people, the number grows quickly until
                                the secret isn’t a secret anymore. This is a good situation for an exponential
                                model, due to the amount of upward curvature in this graph.

                                Step two: Let Minitab do its thing to log(y)
                                In step two, you let Minitab find the best-fitting line to the log(y) data (see
                                the section “Searching for the best exponential model” to find out how to do
                                this in Minitab). The output for the analysis of the secret-spreading data is in
                                Figure 7-12; you can see that the best-fitting line is log(y) = –0.19 + 0.28 * x,
                                where y is the number of people knowing the secret and x is the number
                                of days.



                       Figure 7-12:   Regression Analysis: Day versus Number
                       Minitab fits
                       a line to the   The regression equation is
                         log(y) for   logten (number) = − 0.1883 + 0.2805 day
                       the secret-
                        spreading   S = 0.157335   R-Sq = 93.3%   R-Sq(adj) = 91.6%
                           data.







          12_466469-ch07.indd   133                                                                   7/24/09   9:39:11 AM
   144   145   146   147   148   149   150   151   152   153   154