Page 67 - Computational Statistics Handbook with MATLAB
P. 67

Chapter 3: Sampling Concepts                                     53


                             which is simply the product of the individual densities (or mass functions)
                             evaluated at each sample point.
                              There are two types of simple random sampling: sampling with replace-
                             ment and sampling without replacement. When we sample with replace-
                             ment, we select an object, observe the characteristic we are interested in, and
                             return the object to the population. In this case, an object can be selected for
                             the sample more than once. When the sampling is done without replacement,
                             objects can be selected at most one time. These concepts will be used in Chap-
                             ters 6 and 7 where the bootstrap and other resampling methods are dis-
                             cussed.
                              Alternative sampling methods exist. In some situations, these methods are
                             more practical and offer better random samples than simple random sam-
                             pling. One such method, called stratified random sampling, divides the pop-
                             ulation into levels, and then a simple random sample is taken from each level.
                             Usually, the sampling is done in such a way that the number sampled from
                             each level is proportional to the number of objects of that level that are in the
                             population. Other sampling methods include cluster sampling and system-
                             atic random sampling. For more information on these and others, see the
                             book by Levy and Lemeshow [1999].
                              Sometimes the goal of inferential statistics is to use the sample to estimate
                             or make some statements about a population parameter. Recall from Chapter
                             2 that a parameter is a descriptive measure for a population or a distribution
                             of random variables. For example, population parameters that might be of
                             interest include the mean (µ), the standard deviation (σ), quantiles, propor-
                             tions, correlation coefficients, etc.
                              A statistic is a function of the observed random variables obtained in a
                             random sample and does not contain any unknown population parameters.
                             Often the statistic is used for the following purposes:

                                • as a point estimate for a population parameter,
                                • to obtain a confidence interval estimate for a parameter, or
                                • as a test statistic in hypothesis testing.

                             Before we discuss some of the common methods for deriving statistics, we
                             present some of the statistics that will be encountered in the remainder of the
                                                                                      ,  ,   , of
                             text. In most cases, we assume that we have a random sample, X 1 … X n
                             independent, identically (iid) distributed random variables.



                                               aamm pleple
                                       an
                                        and
                                                         aancence
                                                     VVaarr ii
                                                ple
                             SSampleample  Me  eanandSandSa  amm pleV  Va  ar  ri  iaancence
                             SampleM
                             Sample
                                       an
                                    MM eeanandSS
                             A familiar statistic is the sample mean given by
                            © 2002 by Chapman & Hall/CRC
   62   63   64   65   66   67   68   69   70   71   72