Page 96 - Intermediate Statistics for Dummies
P. 96

09_045206 ch04.qxd  2/1/07  9:49 AM  Page 75
                                                                Chapter 4: Getting in Line with Simple Linear Regression
                                                    By writing y = 3.69 + 0.113x, you mean that this equation represents your
                                                    estimated value of y, given the value of x that you observe with your data.
                                                    Statisticians write this equation by using a carrot (or hat as statisticians call
                                                           /
                                                    it), like  y, so everyone can know it’s an estimate, not the actual value of y.
                                                    This y-hat is your estimate of the average value of y over the long term, based
                                                    on the observed values of x. However, in many intro stats texts, the hat is left
                                                    off because statisticians have an unwritten understanding as to what y repre-
                                                    sents. This issue comes up again in Chapters 5 through 8. (By the way, if you
                                                    think y-hat is a funny term here, it’s even funnier in Mexico, where statisti-
                                                    cians call it y-sombrero — no kidding!)
                                                    The y-intercept of the regression line
                                                    Selected parts of that Minitab output shown in Figure 4-2 are of importance to
                                                    you at this point. First, you can see that under the column “Coef” you have
                                                    the numerical values on the right side of the equation of the line — in other
                                                    words, the slope and y-intercept. The number 3.69 represents the coefficient  75
                                                    of “Constant,” which is a fancy way of saying that’s the y-intercept (because
                                                    the y-intercept is just a constant, it never changes). The y-intercept is the
                                                    point where the line crosses the y-axis, in other words, the value of y when
                                                    x equals 0.
                                                    The y-intercept of a regression line may or may not have a practical meaning
                                                    depending on the situation. To determine whether the y-intercept of a regres-
                                                    sion line has practical meaning, look at the following:
                                                       Does the y-intercept fall within the actual values in the data set? If yes,
                                                        then it has practical meaning.
                                                       Does the y-intercept fall into negative territory where negative y-values
                                                        aren’t possible? For example if the y-values, are weights, they can’t be
                                                        negative. Then the y-intercept has no practical meaning. It is still correct
                                                        though, because it just happens to be the place where the line, if
                                                        extended to the y-axis, crosses the y-axis.
                                                       Does the value x = 0 have practical meaning? For example, if x is temper-
                                                        ature at a football game in Green Bay, then x = 0 is a value that’s relevant
                                                        to examine. If x = 0 has practical meaning, then the y-intercept would
                                                        also because it represents the value of y when x = 0. If not, for example,
                                                        when x represents height of a toddler, then the y-intercept has no practi-
                                                        cal meaning.
                                                    In the textbook example, the y-intercept doesn’t really have a practical mean-
                                                    ing because students don’t weigh zero pounds, so you don’t really care what
                                                    the estimated textbook weight is for that situation. But you do need to find a
                                                    line that fits the data you do have (where average student weights go from
   91   92   93   94   95   96   97   98   99   100   101