Page 133 - Matrix Analysis & Applied Linear Algebra

P. 133

3.8 Inverses of Sums and Sensitivity127

essentially the same conclusion from (3.8.4) when only a single entry is perturbed
and from Exercise 3.8.2 when a single column is perturbed.
This discussion resolves, at least in part, an issue raised in §1.6—namely,
“What mechanism determines the extent to which a nonsingular system Ax = b
is ill-conditioned?” To see how, an aggregate measure of the magnitude of the
entries in A is needed, and one common measure is

A = max |a ij | = the maximum absolute row sum. (3.8.7)
i
j
This is one example of a matrix norm, a detailed discussion of which is given in
§5.1. Theoretical properties speciﬁc to (3.8.7) are developed on pp. 280 and 283,
and one property established there is the fact that XY ≤ X Y for all
conformable matrices X and Y. But let’s keep things on an intuitive level for
the time being and defer the details. Using the norm (3.8.7), the approximation
(3.8.6) insures that if B is suﬃciently small, then

$ −1 $ $ −1 $ $ $ $ $
$
$
$ A − (A + B) −1 $ ≈ A BA −1 $ ≤ A −1 $ B A −1 $ ,
$
so, if we interpret x < y to mean that x is bounded above by something not
∼
far from y, we can write
$ −1 $ % &
$ A − (A + B) −1$ $ $ $ $ B
< $ A −1 $ B = A −1 $ A .
$
A −1 ∼ A
The term on the left is the relative change in the inverse, and B / A is the
$ $
relative change in A. The number κ = A −1$ A is therefore the “magniﬁ-
$
cation factor” that dictates how much the relative change in A is magniﬁed.
This magniﬁcation factor κ is called a condition number for A. In other
words, if κ is small relative to 1 (i.e., if A is well conditioned), then a small
relative change (or error) in A cannot produce a large relative change (or error)
in the inverse, but if κ is large (i.e., if A is ill conditioned), then a small rela-
tive change (or error) in A can possibly (but not necessarily) result in a large
relative change (or error) in the inverse.
The situation for linear systems is similar. If the coeﬃcients in a nonsingular
system Ax = b are slightly perturbed to produce the system (A + B)˜ x = b,
then x = A −1 b and ˜ x =(A + B) −1 b so that (3.8.6) implies

x − ˜ x = A −1 b − (A + B) −1 b ≈ A −1 b − A −1 − A −1 BA −1 b = A −1 Bx.
For column vectors, (3.8.7) reduces to x = max i |x i |, and we have

$ $
x − ˜ x < $ A −1 $ B x ,
∼

128 129 130 131 132 133 134 135 136 137 138