Page 356 - Statistics for Environmental Engineers

P. 356

L1592_Frame_C41 Page 366 Tuesday, December 18, 2001 3:24 PM

24
y = 21.04 + 0.12x
23
y 22
21
20

19
0 2 4 6 8 10 12
x

FIGURE 41.1 The original data from a suspicious laboratory experiment.

25
y = 20.06 + 0.43 x

24
23
22
y 21
20
19
18
0 2 4 6 8 10 12
x

FIGURE 41.2 Data obtained from a repeated experiment with randomization to eliminate autocorrelation.

One might be tempted to blame the peculiar result entirely on the low value measured at x = 6, but
the experimenters did not leap to conclusions. Discussion of the experimental procedure revealed that
the tests were done starting with x = 0 ﬁrst, then with x = 1, etc., up through x = 10. The measurements
of y were also done in order of increasing concentration. It was also discovered that the injection port
of the instrument used to measure y might not have been thoroughly cleaned between each run. The
students knew about randomization, but time was short and they could complete the experiment faster
by not randomizing. The penalty was autocorrelation and a wasted experiment.
They were asked to repeat the experiment, this time randomizing the order of the runs, the order of
analyzing the specimens, and taking more care to clean the injection port. This time the data were as shown
in Figure 41.2. The regression equation is = 20.06 + 0.43x, with R = 0.68. The conﬁdence interval ofy ˆ 2
the slope is 0.21 to 0.65. This interval includes the expected slope of 0.5 and shows that x and y are related.
Can the dramatic difference in the outcome of the ﬁrst and second experiments possibly be due to the
presence of autocorrelation in the experimental data? It is both possible and likely, in view of the lack
of randomization in the order of running the tests.

The Consequences of Autocorrelation on Regression

An important part of doing regression is obtaining a valid statement about the precision of the estimates.
Unfortunately, autocorrelation acts to destroy our ability to make such statements. If the error terms are
positively autocorrelated, the usual conﬁdence intervals and tests using t and F distributions are no longer
strictly applicable because the variance estimates are distorted (Neter et al., 1983).

351 352 353 354 355 356 357 358 359 360 361