next up previous contents
Next: H.263 Up: The ITU H.26x Family Previous: The ITU H.26x Family

H.261

For H.261 only two input video resolutions are supported: the Common Intermediate Format (CIF)  at $352 \times 288$ pels for the luminance, and the Quarter CIF (QCIF) at $176 \times 144$ pels. The chrominance values are sampled according to the 4:2:0 sampling, see Figure 2.1. The maximum frame rate is 29.97 fps, however in most applications frames are dropped before encoding and transmission to fit the available bandwidth.

The images are segmented in macroblocks, which comprise six $8 \times 8$ pel blocks, four for Y values and one for each of their respective Cr, Cb values. There are two types of frames in H.261. The first frame and specific frames in the sequence inserted for stability and resynchronisation, are encoded independent of their adjacent frames, i.e. intra-coded. All the other frames are inter-coded, that is, they are predicted using motion compensation from the previous frame.

Intra coding is DCT transformation of the $8 \times 8$ blocks of luminance and chrominance values, quantisation, and run-length encoding of the coefficients. An inverse quantisation and decoding procedure is also performed, in order to provide a local copy of the non-identical reconstructed image. The latter is used as a reference for prediction, rather than the original input. Therefore, for error-free transmission, the same prediction will be formed to the decoder and the encoder.

Inter coding applies block matching to the $16 \times 16$ luminance values of each macroblock, and determines the respective motion vectors, which are differentially encoded and transmitted. The motion compensation procedure is performed locally in order to calculate the prediction error, which, if above a threshold, is DCT encoded and transmitted. If the prediction error is very high, the motion vectors are ignored, and the corresponding macroblock is intra-coded. As in intra-frames, the new reference for prediction is the locally reconstructed motion compensated frame, which is stored in a buffer for this task. Inter coding achieves much higher compression efficiency than intra coding, in other words, the quality of inter-coded pictures is much better than that of intra-coded, for the same number of bits.

The resulting bitstream has a hierarchical structure. Blocks of variable length coded coefficients are collected together to form macroblocks, which are collected to form Group of Blocks (GOB), and a complete frame is made up of several GOB, see also Figure 2.5.


  
Figure 2.5: The hierarchy of structures for a H.263 CIF frame.
\begin{figure}
\centering\epsfig{file=h263bit.eps,width=5in}\end{figure}

It is worth mentioning that the output bitrate of H.261 is variable, depending highly on the amount of motion in the encoded scene. As H.261 is using ISDN connections, which are constant bit rate, there is a rate control mechanism used, which changes the quantisation parameters when buffer overflows due to increased scene activity are imminent [RR96].


next up previous contents
Next: H.263 Up: The ITU H.26x Family Previous: The ITU H.26x Family
Isaac Kokkinidis
1998-08-27
Hosted by www.Geocities.ws

1