PAPER Special Section on Image Media Quality
Projection Based Adaptive Window Size Selection for E fficient Motion Estimation in H.264 /AVC
Anand PAUL †a) , Jhing-Fa WANG †† , Nonmembers, Jia-Ching WANG † , Member, An-Chao TSAI † , and Jang-Ting CHEN † , Nonmembers
SUMMARY This paper introduces a block based motion estimation al- gorithm based on projection with adaptive window size selection. The blocks cannot match well if their corresponding 1D projection does not match well, with this as foundation 2D block matching problem is trans- lated to a simpler 1D matching, which eliminates majority of potential pixel participation. This projection method is combined with adaptive window size selection in which, appropriate search window for each block is deter- mined on the basis of motion vectors and prediction errors obtained for the previous block, which makes this novel method several times faster than ex- haustive search with negligible performance degradation. Encoding QCIF size video by the proposed method results in reduction of computational complexity of motion estimation by roughly 45% and over all encoding by 23%, while maintaining image /video quality.
key words: block based motion estimation, 1D projection, adaptive win- dow size selection
1. Introduction
Recently established H.264 /AVC is the newest video coding standard. The main goals of the H.264/AVC standardization e ffort have been to enhance compression performance and provide a “network-friendly” video representation. In H.264 encoder, motion estimation is the major burden of complex- ity [1]. It is well known that motion search range is im- portant parameter to determine the coding e fficiency and the encoding computational cost. Most video coding standards, including MPEG-l/2/4, H.261, H.263, and H.264/AVC use block motion estimation and compensation for removing temporal redundancy. This is one of the most important part of a video encoder, and newer standards achieve better video quality at constant bitrate by allowing Subdivision of 16×16 pixel macroblocks (MBs) into smaller blocks. The encoder may then select whether to use large blocks and only a few motion vectors (MVs), or more accurate motion estimation (ME) with smaller blocks but with more motion vectors to transmit. For example, the newest and most e fficient cod- ing standard H.264/AVC allows subdividing the MBs into 16 × 8, 8 × 16, or 8 × 8 pixel blocks, and when the small- est size is chosen, the block may be further subdivided in a treelike fashion into 4 × 8, 8 × 4, or 4 × 4 pixel blocks
Manuscript received December 7, 2005.
Manuscript revised April 28, 2006.
Final manuscript received July 31, 2006.
†
The authors are with Multimedia and Communication IC Lab, EE Department, NCKU, Tainan-701, Taiwan, Republic of China.
††