Page 275 - Applied statistics and probability for engineers
P. 275

Section 7-3/General Concepts of Point Estimation     253


                                         some of the standard probability distributions, such as the exponential and Weibull dis-
                                         tributions. A relatively new computer-intensive technique, the bootstrap, can be used to
                                         solve this problem.
                                            To explain how the bootstrap works, suppose that we have a random variable X with
                                         a known probability density function characterized by a parameter θ, say f x( ; )θ . Also
                                         assume that we have a random sample of data from this distribution, x x 2 , ... ,  and that
                                                                                                     1 ,
                                                                                                           x n
                                                                                ˆ
                                         the estimate of θ based on this sample data is θ = 4 . . The bootstrap procedure would use
                                                                                    5
                                         the computer to generate bootstrap samples randomly from the probability distribution
                                                                                   ˆ B
                                                  5
                                          (
                                         f x; θ = 4 . )  and calculate a bootstrap  estimate θ . This process is repeated n  times,
                                                                                                             B
                                         resulting in:
                                                                             1
                                                                          1
                                                                                                    1 θ
                                                                          1 ,
                                                         Bootstrap sample 1: x x 2 ...,  x n 1 , Bootstrap estimate ^ B
                                                                             2
                                                                          2
                                                                          1 ,
                                                         Boootstrap sample 2: x x 2 ...,  x n 2 , Bootstrap estimate ^ B 2 θ
                                                                               ⋮
                                                                       nB  nB  nB  nB
                                                       B Bootstrap sample n B  : x 1 , x 2 ...,  x n  ,  Bootstrap estimmate ^ B
                                                                                                     θn
                                                                                                       B
                                         Typically, the number of bootstrap samples is n  = 100 or 200. The sample mean of the boot-
                                                                               B
                                         strap estimates is
                                                                                 n B
                                                                          θ =  1  ∑  θ i B ˆ
                                                                           B
                                                                               n B i=1
                                                                      ˆ
                                         The bootstrap standard error of θ is just the sample standard deviation of the bootstrap
                                                 ˆ B
                                         estimates θ i  or
                                                                                 ∑
                                                                                     B ˆ
                                                                   SE B (θ) =  1  n B  (θ i − θ )               (7-7)
                                                                       ˆ
                                                                                         B 2
                                                                            n B −1  i=1
                                         Some authors use n  in the denominator of Equation 7-7.
                                                         B
                     Example 7-6     Bootstrap Standard Error  A team of analytics specialists has been investigating the cycle time
                                     to process loan applications. The specialists' experience with the process informs them that cycle
                     time is normally distributed with a mean of about 25 hours. A recent random sample of 10 applications gives the
                     following (in hours):
                        24.1514, 27.4145, 20.4000, 22.5151, 28.5152, 28.5611, 21.2489, 20.9983, 24.9840, 22.6245
                        The sample standard deviation of these observations is s = 3.11407. We want to i nd a bootstrap standard error for
                     the sample standard deviation. We use a computer program to generate n  = 200 bootstrap samples from a normal dis-
                                                                               B

                     tribution with a mean of 25 and a standard deviation of 3.11417. The irst of these samples is:
                        25.4274, 24.2272, 24.8565, 24.3458, 18.4343, 23.3179, 23.0699, 25.2876, 27.1541, 27.2932
                     from which we calculate s = 2.50635. After all 200 bootstrap samples were generated, the average of the bootstrap esti-
                     mates of the standard deviation was 3.03972, and the bootstrap estimate of the standard error was 0.5464. The standard
                     error is fairly large because the sample size here (n = 10) is fairly small.

                                         In some problem situations, the distribution of the random variable is not known. The bootstrap
                                         can still be used in these situations. The procedure is to treat the data sample as a population
                                         and draw bootstrap samples from it. So, for example, if we had a sample of 25 observations,
                                         we would draw n  bootstrap samples by sampling with replacement from the original sample.
                                                       B
                                         Then we would proceed as in the preceding example to calculate the bootstrap estimate of the
                                         standard error for the statistic of interest.
   270   271   272   273   274   275   276   277   278   279   280