Page 127 - Video Coding for Mobile Communications Efficiency, Complexity, and Resilience
P. 127

104                             Chapter 4.  Basic  Motion  Estimation  Techniques


            noise.  In  particular,  the  use  of  the  2-D  DFT  instead  of  the  2-D  FT  results  in
            the following  e ects  [10]:

               •	The  boundary  e ect:  In  order  to  obtain  a  perfect  impulse,  the  transla-
                 tional displacement must be cyclic. In other words, objects disappearing
                 at one end of the moving area must reappear at the other end. In practice
                 this  does  not  happen,  which  leads  to  the  degeneration  of  the  impulse
                 into peaks. Furthermore, the DFT assumes periodicity in both directions.
                 In  practice,  however,  discontinuities  occur  from  left  to  right  and  from
                 top to bottom, introducing  spurious  peaks.
               •	Spectral leakage: In order to obtain a perfect impulse, the translational
                 displacement must correspond to an integer multiple of the fundamental
                 frequency.  In  practice,  noninteger  motion  vectors  may  not  satisfy  this
                 condition, leading to the well-known spectral leakage phenomenon [89],
                 which degenerates  the impulse  into peaks.
               •	Displacement  wrapping:  The  2-D  DFT  is  periodic  with  the  area  size
                 (N x ;N y  ). Negative estimates will be wrapped and will appear as positive
                 displacements.  To  accommodate  negative  displacements,  the  estimated
                 displacement  needs  to be unwrapped  as  follows [10]:
                              ˆ         ˆ
                           
                            d i      if  |d i  |≤   N i   and  N i  is even
                                              2
                           
                        ˆ
                                            ˆ
                       d i  =         or  if  |d i |≤   N i  −1   and  N i  is odd;   (4.27)
                                                2
                           
                           
                              ˆ
                             d i  − N i ;  otherwise:
                 This means that the range of estimates is limited to [  −N i   +1;  ] for N i
                                                                     N i
                                                               2
                                                                     2
                 even.
               The phase correlation motion estimation method was  rst reported by Kuglin
            and  Hines  in  1975  [90].  It  was  later  extensively  studied  by  Thomas  [91].  In
            his  study,  Thomas  analyzed  the  properties  of  the  phase  correlation  function.
            He suggested using a weighting function to smooth the correlation surface and
            suppress  spurious  peaks.  He  also  proposed  a  second  stage  to  the  method,  in
            which smaller moving areas are used and more than one dominant peak from
            the   rst  stage  are  considered  and  compared.  Girod  [92]  augmented  this  by  a
            third  stage,  in  which  the  estimated  integer-pel  motion  displacement  is  re ned
            to subpel  accuracy.
               The phase correlation method has a number of desirable properties. It has a
            small computational complexity, especially with the use of fast Fourier trans-
            forms  (FFTs).  In  addition,  it  is  relatively  insensitive  to  illumination  changes
            because shifts in the mean value or multiplication by a constant do not a ect
            the Fourier phase. Furthermore, the method can detect multiple moving objects,
   122   123   124   125   126   127   128   129   130   131   132