Page 180 - Statistics for Dummies
P. 180

164        Part III: Distributions and the Central Limit Theorem




                                    to denote actual outcomes of random variables. A distribution is a listing,
                                    graph, or function of all possible outcomes of a random variable (such as X)
                                    and how often each actual outcome (x), or set of outcomes, occurs. (See
                                    Chapter 8 for more details on random variables and distributions.)

                                    For example, suppose a million of your closest friends each rolls a single die
                                    and records each actual outcome (x). A table or graph of all these possible
                                    outcomes (one through six) and how often they occurred represents the
                                    distribution of the random variable X. A graph of the distribution of X in this
                                    case is shown in Figure 11-1a. It shows the numbers 1–6 appearing with equal
                                    frequency (each one occurring  ⁄6 of the time), which is what you expect over
                                                                1
                                    many rolls if the die is fair.
                                    Now suppose each of your friends rolls this single die 50 times (n = 50) and
                                    records the average,  . The graph of all their averages of all their samples
                                    represents the distribution of the random variable  . Because this distribu-
                                    tion is based on sample averages rather than individual outcomes, this distri-
                                    bution has a special name. It’s called the sampling distribution of the sample
                                    mean,  . Figure 11-1b shows the sampling distribution of  , the average of
                                    50 rolls of a die.

                                    Figure 11-1b (average of 50 rolls) shows the same range (1 through 6) of out-
                                    comes as Figure 11-1a (individual rolls), but Figure 11-1b has more possible
                                    outcomes. You could get an average of 3.3 or 2.8 or 3.9 for 50 rolls, for exam-
                                    ple, whereas someone rolling a single die can only get whole numbers from
                                    1 to 6. Also, the shape of the graphs are different; Figure 11-1a shows a flat
                                    shape, where each outcome is equally likely, and Figure 11-1b has a mound
                                    shape; that is, outcomes near the center (3.5) occur with high frequency and
                                    outcomes near the edges (1 and 6) occur with extremely low frequency. A
                                    detailed look at the differences and similarities in shape, center, and spread
                                    for individuals versus averages, and the reasons behind them, is the topic of
                                    the following sections. (See Chapter 8 if you need background info on shape,
                                    center, and spread of random variables before diving in.)


                          The Mean of a Sampling Distribution



                                    Using the die-rolling example from the preceding section, X is a random vari-
                                    able denoting the outcome you can get from a single die (assuming the die
                                    is fair). The mean of X (over all possible outcomes) is denoted by    (pro-
                                    nounced mu sub-x); in this case its value is 3.5 (as shown in Figure 11-1a). If
                                    you roll a die 50 times and take the average, the random variable   repre-
                                    sents any outcome you could get. The mean of  , denoted    (pronounced
                                    mu sub-x-bar) equals 3.5 as well. (You can see this result in Figure 11-1b.)











              17_9780470911082-ch11.indd   164                                                             3/25/11   10:01 PM
   175   176   177   178   179   180   181   182   183   184   185