Page 133 - Matrix Analysis & Applied Linear Algebra
P. 133

3.8 Inverses of Sums and Sensitivity127

                                    essentially the same conclusion from (3.8.4) when only a single entry is perturbed
                                    and from Exercise 3.8.2 when a single column is perturbed.
                                        This discussion resolves, at least in part, an issue raised in §1.6—namely,
                                    “What mechanism determines the extent to which a nonsingular system Ax = b
                                    is ill-conditioned?” To see how, an aggregate measure of the magnitude of the
                                    entries in A is needed, and one common measure is


                                            A  = max     |a ij | = the maximum absolute row sum.   (3.8.7)
                                                   i
                                                       j
                                    This is one example of a matrix norm, a detailed discussion of which is given in
                                    §5.1. Theoretical properties specific to (3.8.7) are developed on pp. 280 and 283,
                                    and one property established there is the fact that  XY ≤ X  Y  for all
                                    conformable matrices X and Y. But let’s keep things on an intuitive level for
                                    the time being and defer the details. Using the norm (3.8.7), the approximation
                                    (3.8.6) insures that if  B  is sufficiently small, then

                                            $  −1            $   $  −1     $   $    $     $    $
                                                                                          $
                                                                 $
                                            $ A   − (A + B)  −1 $  ≈ A  BA  −1 $  ≤ A  −1 $   B  A  −1 $  ,
                                                                               $
                                    so, if we interpret x  <  y to mean that x is bounded above by something not
                                                      ∼
                                    far from y, we can write
                                            $  −1            $                           %    &
                                            $ A   − (A + B) −1$  $    $       $    $       B
                                                               < $ A  −1 $   B  = A  −1 $  A    .
                                                                              $
                                                   A −1        ∼                           A
                                    The term on the left is the relative change in the inverse, and  B  /  A  is the
                                                                        $    $
                                    relative change in A. The number κ = A −1$   A  is therefore the “magnifi-
                                                                        $
                                    cation factor” that dictates how much the relative change in A is magnified.
                                    This magnification factor κ is called a condition number for A. In other
                                    words, if κ is small relative to 1 (i.e., if A is well conditioned), then a small
                                    relative change (or error) in A cannot produce a large relative change (or error)
                                    in the inverse, but if κ is large (i.e., if A is ill conditioned), then a small rela-
                                    tive change (or error) in A can possibly (but not necessarily) result in a large
                                    relative change (or error) in the inverse.
                                        The situation for linear systems is similar. If the coefficients in a nonsingular
                                    system Ax = b are slightly perturbed to produce the system (A + B)˜ x = b,
                                    then x = A −1 b and ˜ x =(A + B) −1 b so that (3.8.6) implies


                                      x − ˜ x = A −1 b − (A + B) −1 b ≈ A −1 b − A −1  − A −1 BA −1  	  b = A −1 Bx.
                                    For column vectors, (3.8.7) reduces to  x  = max i |x i |, and we have

                                                                    $    $
                                                           x − ˜ x   < $ A  −1 $   B  x  ,
                                                                  ∼
   128   129   130   131   132   133   134   135   136   137   138