Page 318 - Statistics for Environmental Engineers

P. 318

L1592_frame_C37.fm Page 327 Tuesday, December 18, 2001 3:20 PM

Weighted Least Squares

KEY WORDS calibration, conﬁdence interval, constant variance, ion chromatography, least squares,
nitrate, regression, simulation, variance, weighted least squares, weights, weighted regression.

Weighted least squares (weighted regression) is used to ﬁt a model y = f(β, x) when the variance of the
response y is not constant. This method is often needed to ﬁt calibration curves where x and y cover
wide ranges of concentration and response signal. As the range of values gets larger, it becomes more
likely that there will be some proportionality between the level of the response and the variance of the
response.
Figure 37.1 shows four possible situations: (a) a straight line with random measurement errors of equal
variance at all concentrations, (b) a straight line with random measurement errors that are proportional
to concentration, (c) a curved function with random measurement errors of constant variance, and (d) a
curved function with random measurement errors that are proportional to concentration.
For curves a and c the magnitude of the error in y is the same over the full range of x. We can think
of these curves as having random measurement error in the range of ± ∆ mg/L added or subtracted from
the true response, where ∆ does not change as the concentration changes. In contrast, for curves b and
d the magnitude of the error increases as x and y increase. The error for these curves tends to be a
percentage of the response. We can think of these curves as having random measurement error in the
range of ± ∆% of the true response level added or subtracted from the true response. Then the measure-
ment error in units of milliliters per liter (mg/L) does change over the range of concentration values.
Calibration curve a is the most familiar (see Chapter 36) but there is no theoretical reason for assuming
that a straight line will be the correct calibration model. There are factors inherent in some measurement
processes that make curve b much more likely than curve a (e.g., ICP instruments). Also, there are some
instruments for which a straight-line calibration is not expected (e.g., HPLC instruments). Therefore,
the analyst must know how to recognize and how to treat calibration data for these four patterns.
The analyst must decide whether the data have constant variance and if not, what weights should be
assigned to each y value. In addition, the analyst is also trying to decide the form of the ﬁtted function
(straight line or some curved function). These decisions will have a profound effect on conﬁdence
intervals and prediction intervals. The conﬁdence intervals of parameter estimates will be wrong if
weighted regression has not been used when there is nonconstant variance. Also, prediction intervals
for the model line and specimen concentrations will be wrong unless proper weighting and model form
have been used.

Case Study: Ion Chromatograph Calibration
The nitrate calibration data in Table 37.1 were made on a Dionex-120 ion chromatograph. There are
triplicate measurements at 13 concentration levels covering a range from 0 to 40 mg/L nitrate.
Suppose we assume that the calibration model is a straight line and ﬁt it using ordinary least squares
to obtain:
y ˆ = −3246 + 8795x

313 314 315 316 317 318 319 320 321 322 323