Page 422 - Matrix Analysis & Applied Linear Algebra

P. 422

418 Chapter 5 Norms, Inner Products, and Orthogonality

Example 5.12.3
Filtering Noisy Data. The SVD can be a useful tool in applications involving
the need to sort through noisy data and lift out relevant information. Suppose
that A m×n is a matrix containing data that are contaminated with a certain
level of noise—e.g., the entries A might be digital samples of a noisy video or
audio signal such as that in Example 5.8.3 (p. 359). The SVD resolves the data
in A into r mutually orthogonal components by writing

r r
0
D r×r T T
A = U V = σ i u i v = σ i Z i , (5.12.11)
i
0 0
i=1 i=1
where Z i = u i v T and σ 1 ≥ σ 2 ≥· · · ≥ σ r > 0. The matrices {Z 1 , Z 2 ,..., Z r }
i
constitute an orthonormal set because

T 0if i = j,
Z i Z j = trace Z Z j =
i
1if i = j.
In other words, the SVD (5.12.11) can be regarded as a Fourier expansion as
described on p. 299 and, consequently, σ i = Z i A can be interpreted as the
proportion of A lying in the “direction” of Z i . In many applications the noise
contamination in A is random (or nondirectional) in the sense that the noise
is distributed more or less uniformly across the Z i ’s. That is, there is about as
much noise in the “direction” of one Z i as there is in the “direction” of any
other. Consequently, we expect each term σ i Z i to contain approximately the
same level of noise. This means that if SNR(σ i Z i ) denotes the signal-to-noise
ratio in σ i Z i , then

SNR(σ 1 Z 1 ) ≥ SNR(σ 2 Z 2 ) ≥ ··· ≥ SNR(σ r Z r ),

more or less. If some of the singular values, say, σ k+1 ,...,σ r , are small relative to
(total noise)/r, then the terms σ k+1 Z k+1 ,...,σ r Z r have small signal-to-noise
ratios. Therefore, if we delete these terms from (5.12.11), then we lose a small part
of the total signal, but we remove a disproportionately large component of the
k
total noise in A. This explains why a truncated SVD A k = σ i Z i can, in
i=1
many instances, ﬁlter out some of the noise without losing signiﬁcant information
about the signal in A. Determining the best value of k often requires empirical
techniques that vary from application to application, but looking for obvious
gaps between large and small singular values is usually a good place to start.
The next example presents an interesting application of this idea to building an
Internet search engine.

417 418 419 420 421 422 423 424 425 426 427