Page 321 - Statistics for Environmental Engineers

P. 321

L1592_frame_C37.fm Page 330 Tuesday, December 18, 2001 3:20 PM

intervals, prediction intervals, and probabilities associated with these quantities. Thus, weighting is
important even if it does not make a notable difference in the position of the line.

Theory: Weighted Least Squares

The following is a general statement of the least squares criterion. It is used for all models, linear and
otherwise, and for constant or nonconstant variance. If the values of the response are y 1 , y 2 ,…y n and if
2 2 2
the variances of these observations are σ 1 ,σ 2 ,…,σ n , then the parameter estimates that individually and
uniquely have the smallest variance will be obtained by minimizing the weighted sum of squares:
minimize S = ∑ w i y i η) 2
(
–
where η is the response calculated from the proposed model, y i is the observation at a speciﬁed value
2
of x i , and w i is the weight assigned to observation y i . The w i will be proportional to 1/σ i . If the variance is
constant σ 1 =( 2 σ 2 = … = σ n ) , all w i = 1, and each observation has an equal opportunity to determine
2
2
the calibration curve. If the variance is not constant, the least accurate measurements are assigned a
small weight and the most accurate measurements are assigned large weights. This prevents the least
accurate measurements from dominating the outcome of the regression.
The least squares parameter estimates for a general linear model η = β 0 + β 1 x + β 2 x + … + β n x n
2
are obtained from:
[
minimize S = ∑ w i y i η–( ) = ∑ w i y i – ( β 0 + β 1 x i + … + β n x i )] 2
n
2
The analytical solution for a straight-line model applied to calibration is given in Gibbons (1994), Otto
(1999), and Zorn et al. (1997, 1999).

Determining the Appropriate Weights
If the variance is not constant, the magnitude of the weights will depend somehow on the magnitude of
the variance. We present two ways in which the weights might be assigned.

Method 1
2 2
s i
The weights are inversely proportional to the variance of each observation (w i = 1/σ i ) where is used
2
as an estimate of σ i . Obviously this method can only be used when there are replicate measurements
2
to calculate the . The weights may increase smoothly from low to high levels of the analyte, as shown
s i
in the left-hand panel of Figure 37.5, or they might be as shown in the other two panels. Using weights
that are inversely proportional to the variance will deal with any of these cases.
Variance of y 10000 • • • • • • • • • • • • • • • • • • • • • • • •
1000
100
10
1
0.1 1 10 100 0.1 1 • 10 100 0.1 1 10 100
y y y

FIGURE 37.5 Possible patterns of variance and concentration. Using weights that are inversely proportional to the variance
2
(w i = 1/s i ) will deal with any of these cases.
© 2002 By CRC Press LLC

316 317 318 319 320 321 322 323 324 325 326