Page 297 - Statistics for Environmental Engineers

P. 297

L1592_frame_C34 Page 303 Tuesday, December 18, 2001 2:52 PM
34

Precision of Parameter Estimates in Linear Models

KEY WORDS conﬁdence interval, critical sum of squares, joint conﬁdence region, least squares, linear
regression, mean residual sum of squares, nonlinear regression, parameter correlation, parameter estima-
tion, precision, prediction interval, residual sum of squares, straight line.

Calculating the best values of the parameters is only half the job of ﬁtting and evaluating a model. The
precision of these estimates must be known and understood. The precision of estimated parameters in
a linear or nonlinear model is indicated by the size of their joint conﬁdence region. Joint indicates that
all the parameters in the model are considered simultaneously.

The Concept of a Joint Conﬁdence Region
When we ﬁt a model, such as η = β 0 + β 1 x or η = θ 1 [1 − exp(−θ 2 x)], the regression procedure delivers
a set of parameter values. If a different sample of data were collected using the same settings of x,
different y values would result and different parameter values would be estimated. If this were repeated
with many data sets, many pairs of parameter estimates would be produced. If these pairs of parameter
estimates were plotted as x and y on Cartesian coordinates, they would cluster about some central point
that would be very near the true parameter values. Most of the pairs would be near this central value,
but some could fall a considerable distance away. This happens because of random variation in the y
measurements.
The data (if they are useful for model building) will restrict the plausible parameter values to lie within
a certain region. The intercept and slope of a straight line, for example, must be within certain limits or
the line will not pass through the data, let alone ﬁt it reasonably well. Furthermore, if the slope is
decreased somewhat in an effort to better ﬁt the data, inevitably the intercept will increase slightly to
preserve a good ﬁt of the line. Thus, low values of slope paired with high values of intercept are plausible,
but high slopes paired with high intercepts are not. This relationship between the parameter values is
called parameter correlation. It may be strong or weak, depending primarily on the settings of the x
variables at which experimental trials are run.
Figure 34.1 shows some joint conﬁdence regions that might be observed for a two-parameter model.
Panels (a) and (b) show typical elliptical conﬁdence regions of linear models; (c) and (d) are for nonlinear
models that may have conﬁdence regions of irregular shape. A small joint conﬁdence region indicates
precise parameter estimates. The orientation and shape of the conﬁdence region are also important. It
may show that one parameter is estimated precisely while another is only known roughly, as in (b) where
β 2 is estimated more precisely than β 1 . In general, the size of the conﬁdence region decreases as the
number of observations increases, but it also depends on the actual choice of levels at which measure-
ments are made. This is especially important for nonlinear models. The elongated region in (d) could
result from placing the experimental runs in locations that are not informative.
The critical sum of squares value that bounds the (1 − α)100% joint conﬁdence region is:




p
p
S c = S R + S R ------------ F p,n− p,α = S R 1 + ------------ F p,n− p,α  



p
n –
p
n –
© 2002 By CRC Press LLC

292 293 294 295 296 297 298 299 300 301 302