Page 225 - Statistics II for Dummies
P. 225

Chapter 12: Regression and ANOVA: Surprise Relatives!  209


                                Suppose you have a brainstorm that number of years of education could pos-
                                sibly be related to Internet use. In this case, the explanatory variable (input
                                variable, x) is years of education, and you want to use it to try to estimate
                                y, the number of hours spent on the Internet in a month. You ask a larger
                                random sample of 250 Internet users how many years of education they have
                                (so n = 250). You can check out the first ten observations from your data set
                                containing the (x, y) pairs in Table 12-1. If a significant connection of some
                                sort exists between the x-values and the y-values, then you can say that x is
                                helping to explain some of the variability in the y’s. If it explains enough vari-
                                ability, you can place x into a simple regression model and use it to estimate y.


                                   Table 12-1             First Ten Observations from the
                                                        Education and Internet Use Example
                                  Years of Education                Hours Spent on Internet (In One Month)
                                  15                                41
                                  15                                32
                                  11                                33
                                  10                                42
                                  10                                28
                                  10                                21
                                  10                                17
                                  10                                14
                                  9                                 18
                                  9                                 14


                                Getting results with regression


                                After you have a possible x variable picked, you collect pairs of data (x, y) on
                                a random sample of individuals from the population, and you look for a pos-
                                sible linear relationship between them. Looking at the small snippet of 10 out
                                of the 250-person data set in Table 12-1, you can begin to see that you may
                                have a pattern between education and Internet use. It looks like as education
                                increases so does Internet use.

                                To delve deeper, you make a scatterplot of the data and calculate the
                                correlation (r). If the data appear to follow a straight line (as shown on
                                the scatterplot), go ahead and perform a simple linear regression of the
                                response variable y based on the x variable. The p-value of the x variable











          18_466469-ch12.indd   209                                                                   7/24/09   9:45:28 AM
   220   221   222   223   224   225   226   227   228   229   230