C OMPARISON WITH R ELATED W ORKS

CHAPTER 5 EXPERIMENT RESULT

5.2 C OMPARISON WITH R ELATED W ORKS

Table 5.2 lists the comparison with related works about motion compensation. We only focus on memory bandwidth reduction and interpolator design comparison. This is because memory bandwidth always is bottleneck of motion compensation and interpolator is key module in motion compensation. For another reason, each related works support different specification. We can see our memory bandwidth optimization is better than previous works although our storage is not least. However, our storage size is after trade-off and can get better performance. In terms of interpolator, [10] and [11] use hardware sharing to operate twice to achieve area efficiency. Even though these hardware sharing is suitable for Baseline Profile, but the poor throughput is not meet real-time decode in Main/High Profile. Moreover, our interpolator gate count is very close to these previous work [10] [11] and provide enough throughput performance in Main/High Profile.

Table 5.2 H.264decoder comparison with related work

ISCAS

Interpolator 20,686 15,000 13,027 21,506 11,823 13,201

total 43k 61k 32k 47k N/A 68k

Chapter 6 Conclusion and Future Work

6.1 Conclusion

Motion compensation engine consists of three parts: motion vector generator, interpolator, and weighted predictor. Firstly, motion vector generator needs to support many tools in Main/High Profile. The challenge of motion vector generator is high complexity. We use hardware sharing to deal with double motion vectors, use coordinate mapping method to process direct modes, and merge MBAFF mode LUT and non-MBAFF mode LUT effectively to reduce the complexity. The design of interpolator, 4-parallel separate 1-D architecture gives the most space on high throughput compared with other proposed architectures. Hence, our interpolator is suitable for B slice and our restructured design can significantly reduce area cost. Lastly, weighted predictor located on last stage of motion compensation engine, we use LUT to deal with complicated implicit mode and collocate with interpolator in order to execute operation only occupies one cycle.

The design target of memory bandwidth reduction is to reduce external memory access and improve throughput of motion compensation engine. The proposed reduction strategies of memory bandwidth for motion compensation need 319 pixel storages is after trade-off and own better performance than other works. After applying these strategies, the memory bandwidth requirement can save the required bandwidth about 71~80 %. Moreover, achieve efficient memory access scheduling.

6.2 Future Work

The proposed motion compensator for H.264/AVC standard only supports up to Main/High Profile. If we want to support H.264/SVC/MVC, there are many issues should be taken into account. For example, hierarchical B pictures [18] [19]. In addition, a successor to H.264/AVC, High Efficiency Video Coding (HEVC) [20], is a proposed video compression standard, currently under development. If we want to support HEVC, the subjects such as extended macroblock size (EMS), decoder-side motion vector derivation (DMVD), 2-D non-separable adaptive interpolation filter (AIF), separable AIF, Direction AIF, Competition-based scheme for motion vector selection and coding, and so on tools should be taken into account for a next generation motion compensator.

In terms of memory bandwidth, our proposed mechanism can effectively reduce bandwidth requirement. However, there only focus on one single module in system view.

Hence, there are still many important issues should be considered in order to provide bandwidth reduction in the viewpoint of overall system. For example, when embedded compressor/decompressor is disabled, a smarter SDRAM controller should be designed include scheduled memory accesses.

Bibliography

[1] Joint Draft ITU-T Rec. H.264 | ISO/IEC 14496-10 / Amd.3 Scalable video coding

[2] T. Wiegand, G. J. Sullivan, G. Bjntegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Trans. Circuits Syst. Video Technol., Vol. 13, no 7, pp.

560- 576, July 2003.

[3] T. C. Chen, Y. W. Huang, and L. G. Chen, “Fully utilized and reusable architecture for fractional motion estimation of H.264/AVC,” IEEE International Conference of Acoustics, Speech, and Signal Processing 2004, vol. 5, pp.V-9-12, May 2004

[4] C. D. Chien, H. C. Chen, L. C. Huang, and J. I. Guo, “A Low-power motion compensation IP core design for MPEG-1/2/4 video decoding,” IEEE International Symposium of Circuits and Systems, 2005, Vol. 5, pp.4542-4545, May 2005

[5] Digital Video Broadcasting - Wikipedia, the free encyclopedia Available from:

<http://en.wikipedia.org/wiki/Digital_Video_Broadcasting>

[6] Iain E. G. Richardson, “H.264 and MPEG-4 VIDEO COMPRESSION”, WILEY, 2003.

[7] Joint Video Team (JVT) reference software JM 8.2

[8] S. Z. Wang, “A Flexible Motion Compensation Memory Organization for Dual-standard Video Decoder”, National Chiao-Tung University Taiwan, Master Thesis, June 2004.

[9] S. Wuytack, J. P. Diguet, and F. V. M. Catthoor, “Formalized methodology for data reuse exploration for low-power hierarchical memory mappings,” IEEE Trans. VLSI Syst., Vol. 6, no. 4, pp. 529-537, Dec. 1998.

[10] Y. Li, Y. Qu, and Y. He, “Memory Cache Based Motion Compensation Architecture for HDTV H.264/AVC Decoder ”, IEEE International Symposium on Circuits and Systems, pp. 2906-2909, May 2007

[11] D. Y. Shen, T. H. Tsai,“A 4X4-block level pipeline and bandwidth optimized motion compensation hardware design for H.264/AVC decoder”, IEEE International Conference on Multimedia and Expo 2009, pp.1106-1109, Jul. 2009.

[12] A. Azevedo, B. Zatt, L. Agostini, S. Bampi, “Motion Compensation Decoder Architecture for H.264/AVC Main Profile Targeting HDTV”, International Conference on Very Large Scale Integration 2006, pp.52-57, Oct. 2006.

[13] B. Zatt, A. Susin, S. Bampi, L. Agostini, “HP422-MoCHA: A H.264/AVC High Profile Motion Compensation Architecture for HDTV”, IEEE International Symposium on Circuits and Systems 2008, pp.25-28, May 2008.

[14] C. Y. Tsai, T. C. Chen; T. W. Chen; L. G. Chen, “Bandwidth optimized motion compensation hardware design for H.264/AVC HDTV decoder”, Circuits and Systems, 2005. 48th Midwest Symposium, Vol. 2, pp.1199-1202, Aug. 2005.

[15] S. Z. Wang, T. A. Lin, T. M. Liu, and C. Y. Lee, “A new motion compensation design for H.264/AVC decoder,” IEEE International Symposium on Circuits and Systems 2005, Vol. 5, pp.4558–4561, May 2005

[16] J. W. Chen, C. C. Lin, J. I. Guo, J. S. Wang, “Low Complexity Architecture Design of H.264 Predictive Pixel Compensator for HDTV Application”, IEEE International Conference Acoustics, Speech and Signal Processing 2006. (ICASSP), Vol. 3, pp. III - III, May 2006.

[17] Joint Video Team (JVT) reference software JM 17.0

[18] H. Schwarz, D. Marpe, and T. Wiegand, “ANALYSIS OF HIERARCHICAL B PICTURES AND MCTF”, IEEE International Conference on Multimedia and Expo 2006, pp.1929-1932, Jul. 2006

[19] M. Winken, H. Schwarz, D. Marpe, and T. Wiegand, “JOINT OPTIMIZATION OF TRANSFORM COEFFICIENTS FOR HIERARCHICAL B PICTURE CODING IN H.264/AVC”, IEEE International Conference on Image Processing 2007 (ICIP), Vol. 4, pp.IV-89-IV-92, Sep. 2007

[20] .High Efficiency Video Coding - Wikipedia, the free encyclopedia Available from:

<http://en.wikipedia.org/wiki/HEVC>

[21] L. Yu, J. Li, Y. Zhang, “Fast Picture and Macroblock Level Adaptive Frame/Field Coding for H.264”, IEEE International Conference of Acoustics, Speech, and Signal Processing 2006, pp. 768-771, Dec. 2006

Vita

姓名：陳浩民

出生地：台灣省彰化市出生日期：1977.11.18

學歷：彰化縣立南興國民小學彰化縣立彰安國民中學

國立虎尾科技大學電機工程科

明新科技大學電機工程系

國立交通大學電機學院 (電子與光電學程) 碩士班

工作經歷：太和科技服份有限公司研發一處工程師

研發處專案一部高級工程師

研發處硬碟陣列部科長

研發處硬碟陣列部經理

在文檔中適用於H.264/AVC之降低記憶體頻寬的動作補償 (頁 72-0)