Page 235 - Applied Statistics And Probability For Engineers

P. 235

c06.qxd 5/14/02 9:54 M Page 196 RK UL 6 RK UL 6:Desktop Folder:TEMP WORK:MONTGOMERY:REVISES UPLO D CH114 FIN L:Quark Files:

196 CHAPTER 6 RANDOM SAMPLING AND DATA DESCRIPTION

Population

µ
σ
, x , x ,…, x )
Sample (x 1 2 3 n
x, sample average
s, sample standard
deviation
Histogram

Figure 6-3 Relation-
ship between a popula-
x x
tion and a sample. s

chassis structural element to be normally distributed with mean and variance 2 . We could
refer to this as a normal population or a normally distributed population.
In most situations, it is impossible or impractical to observe the entire population. For ex-
ample, we could not test the tensile strength of all the chassis structural elements because it
would be too time consuming and expensive. Furthermore, some (perhaps many) of these
structural elements do not yet exist at the time a decision is to be made, so to a large extent,
we must view the population as conceptual. Therefore, we depend on a subset of observations
from the population to help make decisions about the population.

Deﬁnition
A sample is a subset of observations selected from a population.

For statistical methods to be valid, the sample must be representative of the population. It
is often tempting to select the observations that are most convenient as the sample or to exer-
cise judgment in sample selection. These procedures can frequently introduce bias into the
sample, and as a result the parameter of interest will be consistently underestimated (or over-
estimated) by such a sample. Furthermore, the behavior of a judgment sample cannot be statis-
tically described. To avoid these difﬁculties, it is desirable to select a random sample as the
result of some chance mechanism. Consequently, the selection of a sample is a random exper-
iment and each observation in the sample is the observed value of a random variable. The
observations in the population determine the probability distribution of the random variable.
To deﬁne a random sample, let X be a random variable that represents the result of one se-
lection of an observation from the population. Let f(x) denote the probability density function
of X. Suppose that each observation in the sample is obtained independently, under unchanging
conditions. That is, the observations for the sample are obtained by observing X independently
under unchanging conditions, say, n times. Let X i denote the random variable that represents
the ith replicate. Then, X , X , p , X n is a random sample and the numerical values obtained
2
1
are denoted as x , x , p , x . The random variables in a random sample are independent with
2
n
1
the same probability distribution f(x) because of the identical conditions under which each
observation is obtained. That is, the marginal probability density function of X , X , p , X n is
2
1

230 231 232 233 234 235 236 237 238 239 240