Page 181 - Statistics for Dummies
P. 181

165
                                         Chapter 11: Sampling Distributions and the Central Limit Theorem



 to denote actual outcomes of random variables. A distribution is a listing,
 graph, or function of all possible outcomes of a random variable (such as X)
 and how often each actual outcome (x), or set of outcomes, occurs. (See
 Chapter 8 for more details on random variables and distributions.)  30

 For example, suppose a million of your closest friends each rolls a single die
 and records each actual outcome (x). A table or graph of all these possible   25
 outcomes (one through six) and how often they occurred represents the
 distribution of the random variable X. A graph of the distribution of X in this   20
 case is shown in Figure 11-1a. It shows the numbers 1–6 appearing with equal
 frequency (each one occurring  ⁄6 of the time), which is what you expect over
 1
 many rolls if the die is fair.         Percent  15
 Now suppose each of your friends rolls this single die 50 times (n = 50) and
 records the average,  . The graph of all their averages of all their samples   10
 represents the distribution of the random variable  . Because this distribu-
 tion is based on sample averages rather than individual outcomes, this distri-
 bution has a special name. It’s called the sampling distribution of the sample   5
 mean,  . Figure 11-1b shows the sampling distribution of  , the average of
 50 rolls of a die.                       0
                                                  1.0  1.5  2.0  2.5  3.0  3.5  4.0  4.5  5.0  5.5  6.0
                                                              Individual rolls of one die
 Figure 11-1b (average of 50 rolls) shows the same range (1 through 6) of out-
 comes as Figure 11-1a (individual rolls), but Figure 11-1b has more possible   a
 outcomes. You could get an average of 3.3 or 2.8 or 3.9 for 50 rolls, for exam-
 ple, whereas someone rolling a single die can only get whole numbers from   1.8
 1 to 6. Also, the shape of the graphs are different; Figure 11-1a shows a flat
 shape, where each outcome is equally likely, and Figure 11-1b has a mound   1.6
 shape; that is, outcomes near the center (3.5) occur with high frequency and
 outcomes near the edges (1 and 6) occur with extremely low frequency. A   1.4
 detailed look at the differences and similarities in shape, center, and spread
 for individuals versus averages, and the reasons behind them, is the topic of   1.2
 the following sections. (See Chapter 8 if you need background info on shape,
                                        Density
 center, and spread of random variables before diving in.)  1.0
                           Figure 11-1:   0.8
 The Mean of a Sampling Distribution  Distributions   0.6
                          of a) individ-
                            ual rolls of   0.4
 Using the die-rolling example from the preceding section, X is a random vari-  one die; and
 able denoting the outcome you can get from a single die (assuming the die   b) average   0.2
 is fair). The mean of X (over all possible outcomes) is denoted by    (pro-  of 50 rolls of
 nounced mu sub-x); in this case its value is 3.5 (as shown in Figure 11-1a). If   one die.  0.0  1.0  1.5  2.0  2.5  3.0  3.5  4.0  4.5  5.0  5.5  6.0
 you roll a die 50 times and take the average, the random variable   repre-        Average of 50 die rolls
 sents any outcome you could get. The mean of  , denoted    (pronounced   b
 mu sub-x-bar) equals 3.5 as well. (You can see this result in Figure 11-1b.)











              17_9780470911082-ch11.indd   165                                                             3/25/11   10:01 PM
   176   177   178   179   180   181   182   183   184   185   186