Page 332 -

P. 332

Section 10.4 Robustness 300

6 6
4 4
2 2
0 0
-2 -2
-4 -4
-6 -6
-8 -8
-10 -10
-12 -12
-14 -14
-14 -12 -10 -8 -6 -4 -2 0 2 4 6 -14 -12 -10 -8 -6 -4 -2 0 2 4 6
2
6
4 1.5
2
1
0
0.5
-2
0
-4
-6
-0.5
-8
-1
-10
-1.5
-12
-14 -2
-14 -12 -10 -8 -6 -4 -2 0 2 4 6 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2
FIGURE 10.5: Line ﬁtting with a squared error is extremely sensitive to outliers, both in
x and y coordinates. We show an example using least squares. At the top left, a good
least-squares ﬁt of a line to a set of points. Top right shows the same set of points,
but with the x coordinate of one point corrupted; this means that the point has been
translated horizontally from where it should be. As a result, it contributes an enormous
error term to the true line, and a better least-squares ﬁt is obtained by making a signiﬁcant
change in the line’s orientation. Although this makes the errors at most points larger, it
reduces the very large error at the outlier. Bottom left shows the same set of points, but
with the y coordinate of one point corrupted. In this particular case, the x intercept has
changed. These three ﬁgures are on the same set of axes for comparison, but this choice
of axes does not clearly show how bad the ﬁt is for the third case. Bottom right shows
a detail of this case, in which the line is clearly a bad ﬁt.

10.4.1 M-Estimators
An M-estimator estimates parameters by replacing the squared error term with a
term that is better behaved. This means we minimize an expression of the form

ρ(r i (x i ,θ); σ),
i
where θ are the parameters of the model being ﬁtted (for example, in the case of
the line, we might have the orientation and the y intercept), and r i (x i ,θ)is the
residual error of the model on the ith data point. Using this notation, our least

327 328 329 330 331 332 333 334 335 336 337