Page 159 - Video Coding for Mobile Communications Efficiency, Complexity, and Resilience
P. 159

136                      Chapter 5.  Warping-Based  Motion  Estimation  Techniques

                At  the  start  of  the  node-tracking  algorithm,  the  BMA  described  earlier
                was  used  to  provide  initial  estimates  of  the  inner  nodal  motion  vectors.
                Those initial estimates were then re(ned using the iterative procedure of
                Sullivan  and  Baker  [108].  In  each  iteration  of  this  procedure,  the  nodes
                are processed sequentially, where the motion vector of a node is re(ned
                using  a  local  search  around  the  motion  vector  from  the  previous  itera-
                tion  while  holding  constant  the  motion  vectors  of  its  surrounding  eight
                nodes. During this local search, the quality of a candidate motion vector is
                measured  by  calculating  the  distortion  measure  between  all  four  patches
                connected  to  the  node  and  their  warped  predictions  from  the  reference
                frame. The local search is applied to a node only if its motion vector, or
                the motion vector of at least one of its surrounding nodes, was changed
                in  the  previous  iteration.  The  local  search  used  here  examines  the  eight
                nearest  candidate  displacements  centered  around  the  displacement  from
                the  previous  iteration.  For  each  frame,  10iterations  were  used  to  re(ne
                the nodal motion vectors.

                During motion estimation and compensation, the bilinear spatial transfor-
                mation  is  employed.  This  is  implemented  in  the  CGI  [108]  form  (de-
                scribed  in  Section  5.2.7),  where  the  motion  vector  used  to  compensate
                a  pel  within  a  patch  is  bilinearly  interpolated,  Equation  (5.8),  from  the
                four nodal motion vectors at the vertices of the patch.
               In BMA-HO and WBA algorithms, bilinear interpolation was used to obtain
            intensity values at subpel locations of the reference frame. In each algorithm,
            motion was estimated and compensated using original reference frames. Motion
            vectors were coded using the median predictor and the VLC table of the H.263
            standard. The DFD signal was also transform encoded according to the H.263
            standard and a quantization parameter of QP = 10. All quoted results refer to
            the luma components of sequences.
               Table 5.1 compares the objective prediction quality of the preceding three
            algorithms  when  applied  to  the  three  test  sequences  with  a  frame  skip  of  3.
            The WBA outperforms the basic BMA by about 0.16 –1:57 dB, depending on
            the sequence. However, the WBA fails to outperform the advanced BMA-HO


               Table 5.1: Comparison between BMA and WBA in terms of objective prediction quality
                                      Average PSNR (dB) with a frame skip of 3
                              AKIYO            FOREMAN           TABLE  TENNIS
            BMA                39.88             27.81               29.06
            WBA                41.45             29.09               29.22
            BMA-HO             41.77             29.51               29.87
   154   155   156   157   158   159   160   161   162   163   164