Page 186 - Statistics for Dummies
P. 186

170        Part III: Distributions and the Central Limit Theorem




                                    Case 1: The distribution of X is normal


                                    If X has a normal distribution, then   does too, no matter what the sample
                                    size n is. In the example regarding the amount of time (X) for a clerical
                                    worker to complete a task (refer to the section “Sample size and standard
                                    error”), you knew X had a normal distribution (refer to the lowest curve in
                                    Figure 11-2). If you refer to the other curves in Figure 11-2, you see the aver-
                                    age times for samples of n = 10 and n = 50 clerical workers, respectively, also
                                    have normal distributions.
                                    When X has a normal distribution, the sample means also always have a
                                    normal distribution, no matter what size samples you take, even if you take
                                    samples of only 2 clerical workers at a time.

                                    The difference between the curves in Figure 11-2 is not their means or their
                                    shapes, but rather their amount of variability (how close the values in the
                                    distribution are to the mean). Results based on large samples vary less and
                                    will be more concentrated around the mean than results from small samples
                                    or results from the individuals in the population.


                                    Case 2: The distribution of X is not normal —
                                    enter the Central Limit Theorem


                                    If X has any distribution that is not normal, or if its distribution is unknown,
                                    you can’t automatically say the sample mean ( ) has a normal distribution.
                                    But incredibly, you can use a normal distribution to approximate the distribu-
                                    tion of   — if the sample size is large enough. This momentous result is due
                                    to what statisticians know and love as the Central Limit Theorem.
                                    The Central Limit Theorem (abbreviated CLT) says that if X does not have a
                                    normal distribution (or its distribution is unknown and hence can’t be deemed
                                    to be normal), the shape of the sampling distribution of   is approximately
                                    normal, as long as the sample size, n, is large enough. That is, you get an
                                    approximate normal distribution for the means of large samples, even if the
                                    distribution of the original values (X) is not normal.
                                  	 Most statisticians agree that if n is at least 30, this approximation will be rea-
                                    sonably close in most cases, although different distribution shapes for X have
                                    different values of n that are needed. The larger the sample size (n), the closer
                                    the distribution of the sample means will be to a normal distribution.















              17_9780470911082-ch11.indd   170                                                             3/25/11   10:01 PM
   181   182   183   184   185   186   187   188   189   190   191