Page 145 -

P. 145

124 3 Image processing

where s(x, y) is the (unknown) image we are trying to recover, n(x, y) is the additive noise
signal, and o(x, y) is the observed noisy image. Because of the linearity of the Fourier trans-
form, we can write
O(ω x ,ω y )= S(ω x ,ω y )+ N(ω x ,ω y ), (3.67)

where each quantity in the above equation is the Fourier transform of the corresponding
image.
At each frequency (ω x ,ω y ), we know from our image spectrum that the unknown trans-
form component S(ω x ,ω y ) has a prior distribution which is a zero-mean Gaussian with vari-
ance P s (ω x ,ω y ). We also have noisy measurement O(ω x ,ω y ) whose variance is P n (ω x ,ω y ),
i.e., the power spectrum of the noise, which is usually assumed to be constant (white),
2
P n (ω x ,ω y )= σ .
n
According to Bayes’ Rule (Appendix B.4), the posterior estimate of S can be written as
p(O|S)p(S)
p(S|O)= , (3.68)
p(O)

where p(O)= p(O|S)p(S) is a normalizing constant used to make the p(S|O) distribution
S
proper (integrate to 1). The prior distribution p(S) is given by
(S−μ) 2
p(S)= e − 2P s , (3.69)
where μ is the expected mean at that frequency (0 everywhere except at the origin) and the
measurement distribution P(O|S) is given by

(S−O) 2
p(S)= e − 2P n . (3.70)
Taking the negative logarithm of both sides of (3.68) and setting μ =0 for simplicity, we
get

− log p(S|O)= − log p(O|S) − log p(S)+ C (3.71)
2
2
= 1 / 2P n −1 (S − O) + / 2P s −1 S + C, (3.72)
1
which is the negative posterior log likelihood. The minimum of this quantity is easy to
compute,
P n −1 P s 1
S opt = O = O = O. (3.73)
−1 −1
P n + P s P s + P n 1+ P n /P s
The quantity
1
W(ω x ,ω y )= (3.74)
2
1+ σ /P s (ω x ,ω y )
n
is the Fourier transform of the optimum Wiener ﬁlter needed to remove the noise from an
image whose power spectrum is P s (ω x ,ω y ).
Notice that this ﬁlter has the right qualitative properties, i.e., for low frequencies where
2
P s σ , it has unit gain, whereas for high frequencies, it attenuates the noise by a factor
n
2
P s /σ . Figure 3.25 shows the one-dimensional transform W(f) and the corresponding ﬁlter
n
kernel w(x) for the commonly assumed case of P(f)= f −2 (Field 1987). Exercise 3.16 has
you compare the Wiener ﬁlter as a denoising algorithm to hand-tuned Gaussian smoothing.

140 141 142 143 144 145 146 147 148 149 150