Page 92 - Intermediate Statistics for Dummies
P. 92

09_045206 ch04.qxd  2/1/07  9:49 AM  Page 71
                                                                Chapter 4: Getting in Line with Simple Linear Regression
                                                                                         Average Textbook Wt. (lbs.)
                                                                Average Student Wt. (lbs.)
                                                      Grade
                                                      6
                                                                                         13.61
                                                                85.00
                                                                                         15.13
                                                                89.00
                                                      7
                                                                99.00
                                                                                         15.47
                                                      8
                                                      9
                                                                                         17.36
                                                                112.00
                                                      10
                                                                                         18.07
                                                                123.00
                                                                                         20.79
                                                      11
                                                                134.00
                                                                142.00
                                                      12
                                                                                         16.06
                                                    In this section, you begin exploring whether or not a relationship exists
                                                    between these two quantitative variables. You start by displaying the pairs of
                                                    data using a two-dimensional scatterplot to look for a possible pattern, and  71
                                                    you quantify the strength and direction of that pattern using the correlation
                                                    coefficient.
                                                    Data analysts should never make any conclusions about a relationship
                                                    between x and y based solely on either the correlation or the scatterplot
                                                    alone; the two elements need to be examined together. It is possible (but of
                                                    course not a good idea) to manipulate graphs to look better or worse than
                                                    they really are just by changing the scales on the axes. Because of this, statis-
                                                    ticians never go with the scatterplot alone to determine whether or not a
                                                    linear relationship exists between x and y. A correlation without a scatterplot
                                                    is dangerous too, because the relationship between x and y may be very
                                                    strong, but just not linear.
                                                    Using scatterplots to explore relationships
                                                    In order to explore a possible relationship between two variables, such as
                                                    textbook weight and student weight, you first plot the data in a special graph
                                                    called a scatterplot. A scatterplot is a two-dimensional graph that displays
                                                    pairs of data, one pair per observation in the (x, y) format. Figure 4-1 shows
                                                    a scatterplot of the textbook weight data from Table 4-1.
                                                    You can see that the relationship appears to follow the straight line that’s
                                                    included on the graph, except possibly for the last point, where textbook
                                                    weight is 16.06 pounds and student weight is 142 pounds (for grade 12). This
                                                    point appears to be an outlier — it’s the only point that doesn’t fall into the
   87   88   89   90   91   92   93   94   95   96   97