Page 87 - Video Coding for Mobile Communications Efficiency, Complexity, and Resilience
P. 87

64                                      Chapter 3.  Video Coding:  Standards


            frame. In general, it is advisable to use the Improved PB-frames mode instead
            of  the original  PB-frames mode.

            3.4.6.8  Reference Picture Selection Mode (Annex N)

            In  normal  operation,  a  picture  is  temporally  predicted  from  the  most  recently
            decoded  picture.  The  reference  picture  section  (RPS)  mode,  however,  allows
            temporal  prediction  from  pictures  other  than  the  most  recently  decoded  one.
            Thus,  in  this  mode,  both  the  encoder  and  the  decoder  use  more  than  one
            picture  memory.  As  discussed  in  Chapter  6,  this  method  belongs  to  a  class
            of  motion  estimation  and  compensation  techniques  called  multiple-reference
            motion-compensated  prediction.  The  information  to  signal  which  picture  is
            selected  for  prediction  is  included  by  the  encoder  in  the  encoded  bitstream.
            However, the strategy used by the encoder to select this picture is not subject
            for standardization.
               This mode can be used to improve the performance of video communication
            over error-prone channels. In normal operation, if part of the reference picture
            is lost due, for example, to a transmission error, then this error will propagate
            to and severely degrade the quality of future pictures. In this mode, however,
            the  encoder  may  switch  to  another  reference  picture  to  suppress  the  temporal
            error propagation  due to interframe  coding.
               In  order  to  utilize  this  mode,  the  encoder  needs  to  have  some  knowledge
            about the conditions of the channel and the outcome of the decoding process
            (e.g.,  which  parts  of  the  reference  picture  have  been  decoded  in  error).  One
            way  to  achieve  this  is  to  utilize  a  backward  (feedback)  channel.  This  mode
            has two back-channel mode switches that de/ne whether a backward channel
            is used and what kind of messages are returned on that backward channel from
            the decoder. Together, the two switches de/ne four basic methods of operation:
            NEITHER  (no  backward  messages),  ACK  (acknowledgment  messages  only),
            NACK  (negative  acknowledgment  messages  only),  and  ACK+NACK  (both
            acknowledgment and negative acknowledgment messages). There are also two
            methods of operation in terms of the channel for backward channel messages.
            The  /rst  method  is  the  Separate  Logical  Channel  mode,  where  back-channel
            data  is  delivered  through  a  separate  logical  channel  in  the  multiplex  layer  of
            the  system,  whereas  the  second  method  is  the  VideoMux  mode,  where  back-
            channel data for received video is delivered within the forward video data of
            a video stream of  encoded data.

            3.4.6.9  Temporal, SNR, and Spatial Scalability Mode (Annex O)
            Scalability  implies  that  a  bitstream  is  composed  of  a  base  layer  and  one
            or  more  associated  enhancement  layers.  The  base  layer  is  separately  decod-
            able.  The  enhancement  layers  can  be  decoded  in  conjunction  with  the  base
   82   83   84   85   86   87   88   89   90   91   92