• 沒有找到結果。

M EASUREMENT R ESULTS AND C OMPARISON

CHAPTER 5 CHIP IMPLEMENTATION FOR DIGITAL TV APPLICATIONS

5.4 M EASUREMENT R ESULTS AND C OMPARISON

Table 5.4 shows the power report of our work. The power consumption of decoding CIF, NTSC, and 720pHD MPEG-2 video sequences are 3.12mW, 11.15mW, and 30.15mW;

the power consumption of decoding CIF, NTSC, and 720pHD H.264 video sequences are 4.51mW, 16.39mW, and 44.35mW, respectively.

Table 5.4 Power report

Items (Core Power) MPEG-2’s power analysis H.264’s power analysis

720pHD (1280x720) 30.15mW@35.7MHz 44.35mW@56MHz

NTSC (648x486) 11.15mW@13.2MHz 16.39mW@20.7MHz

CIF (352x288) 3.12mW@3.7MHz 4.51mW@5.7MHz

Table 5.5 shows the comparisons to the State-Of-the-Art. It’s hard to find a pure ASIC decoder but RISC included or ARM-based works. Thus it’s hard to have a fair comparisons.

However, we can still see that our work is a good solution to dual mode H.264/MPEG-2 decoder.

Table 5.5 Comparisons

Proposed [1]-[4]

ISCAS’05 VLSI-TSA’05

C&S [11]

ISCAS’04

Conexant [18]

ISCE’04

NTU [19]

ISCAS’05

Specification 1280x720@30fps 1920x1088@30fps 2048x1024@30fps 2048x1024@30fps Operating

Frequency

56MHz 130MHz (local

bus:170MHz)

200MHz 120MHz

Technology 180nm (1.8V) 130nm (1.2V) 130nm (1.2V) 180nm (1.8V)

Profile H.264 baseline

MPEG-2 SP@ML

H.264 baseline MPEG-4 SP H.261,H.263,JPEG

H.264 main H.264 baseline

Implementation ASIC ARM-based ARM-based ASIC+RISC

Gate Count 491K 910K 300K 217K

Internal Memory

24K bytes N/A 74K bytes 10K bytes

Power 44.35mW 554mW 160mW N/A

Normalized power*

100.92mW 2422.88mW 691.89mW N/A

*Normalized to 180nm(1.8v), 2048x1024@30fps

Chapter 6

Conclusion and Future Work

6.1 Conclusion

In this work, we implemented a dual mode H.264/MPEG-2 video decoder. We adopt many design techniques both on system-point-of-view and architectures.

From the system point of view, first we proposed the hybrid 4x4-sub-block pipelining scheme, by which we can save 93.75% intermediate buffers compared with macroblock-level pipelining scheme at the penalty of slightly throughput degradation. The instantaneous switching scheme reduces the latency to minimum during pipeline stages.

Second, we proposed the efficient 1x4-column-by-column decoding ordering, by which the 28% memory access times and 28% processing cycles in motion compensation process can be saved. 17% memory access times can be saved as well in intra predictor. Third, we proposed a variable length FIFO architecture for the synchronization problems in adding pixels from residual/prediction paths. Forth, the exploration on coded-block-pattern technique saves power in inverse quantizer and IDCT modules from 30% to 86% under qP ranging from 20 to 48.

In architecture design, first we proposed a hierarchical syntax parser. The hierarchical syntax parser is easy to design and is very suitable for the bit-stream in hierarchical structure. With the hierarchical enable signals in these parsers, the power savings by clock-gating technique can be up to 86% in these parsers. Second, the register sharing technique is applied on syntax parsers for both systems. This technique reduces the amount of registers required for both system and 26% registers can be saved. Third, we implement

the Exp-Golomb decoder for parsing the H.264 bit-stream. The dedicated interface of this decoder enables this decoder to be shared for all parsers. Forth, 3 types of reusable buffers in intra predictor are proposed. By the aids of these reusable buffers (upper, left, and corner), implementation of the directional modes becomes very easy and the memory access times can be reduced also.

In our final chip implementation, the total gate-count is about 491K, maximum working frequency is 83.3MHz, supports real time decoding 720pHD H.264 sequence@56MHz and 720pHD MPEG-2 sequence @35.7MHz in 30fps. The power consumption for these 2 systems is 44.35mW (720p H.264 sequence) and 30.15mW (720p MPEG-2 sequence), respectively.

6.2 Future Work

In our future work, first, we will try to integrate and combine more functional blocks for both systems like IDCT, and inverse quantizer. For IDCT, we will take efforts on splitting the 8x8 IDCT formula into 2-stage 4x4 and 2x2 IDCT so that the 4x4 IDCT module for H.264 can be shared for the 8x8 IDCT operation of MPEG-2. Then, we will try to add the CABAC with other functional blocks to our current work to support H.264 main profile. We will also try to find the critical path in our work such that we can speed up our decoder to work under more than 120MHz to support real time decoding 1080i H.264 video sequence in 30fps.

Bibliography

[1] Ting-An Lin, Sheng-Zen Wang, Tsu-Ming Liu and Chen-Yi Lee, "An H.264/AVC Decoder with 4x4-block level pipeline", ISCAS 2005

[2] Ting-An Lin, Tsu-Ming Liu and Chen-Yi Lee, "A Low-Power H.264/AVC Decoder", VLSI-TSA 2005

[3] Sheng-Zen Wang, Ting-An Lin, Tsu-Ming Liu and Chen-Yi Lee, " A New Motion Compensation Design For H.264/AVC Decoder”, ISCAS 2005

[4] Tsu-Ming Liu, Wen-Ping Lee, Ting-An Lin and Chen-Yi Lee, "A Memory-Efficient Deblocking Filter For H.264/AVC Video Coding", ISCAS 2005

[5] Shih-Hao Wang, Wen-Hsiao Peng et al., “A Platform-Based MPEG-4 Advanced Video Coding (AVC) Decoder with Block Level Pipelining”, Information, Communications and Signal Processing, ICICS-PCM December 2003

[6] Tung-Chien Chen, Yu-Wen Huang, and Liang-Gee Chen, “Analysis and design of macroblock pipelining for H.264/AVC VLSI architecture”, ISCAS 2004

[7] Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG, “Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification”

ITU-T Rec. H.264 | ISO/IEC 14496-10 AVC, May 2003

[8] Iain E. G. Richardson, “H.264 and MPEG-4 video compression”, John Willey & Sons, autumn 2003, ISBN 0-470-84837-5

[9] Ville Lappalainen, Antti Hallapuro, and Timo D. Hamalainen, “Complexity of Optimized H.26L Video Decoder Implementation”, Circuits and Systems for Video Technonlogy, IEEE Transactions, July 2003

[10] Yu-Wen Huang, Bing-Yu Hsieh, Tung-Chien Chen, and Liang-Gee Chen, “Hardware architecture design for H.264/AVC intra frame coder”, ISCAS 2004

[11] Hae-Yong Kang, Kyung-Ah Jeong, Jung-Yang Bae, Young-Su Lee, Seung-Ho Lee,

“MPEG4 AVC/H.264 decoder with scalable bus architecture and dual memory controller”, ISCAS 2004

[12] K. Suhring, Ed., JM 8.2 reference software (online), 2004. Available at ftp://ftp.imtc.org/jvt-experts/

[13] Yu-Wen Huang, To-Wei Chen, Bing-Yu Hsieh , Tu-Chih Wang, Te-Hao Chang and Liang-Gee Chen, “Architecture Design for Deblocking Filter in H.264/JVT/AVC”

International Conference on Multimedia and Expo(ICME’03), Vol. 1, pp. I-693-6, July 2003.

[14] Miao Sima, Yuanhua Zhou and Wei Zhang, “an Efficient Architecture for Adaptive Deblocking Filter of H.264/AVC Video Coding” IEEE Transactions on Consumer Electronics, Vol. 50, Issue 1, pp. 292-296, Feb. 2004.

[15] He-Wei Feng, Zhi-Gang Mao, Jin-Xiang Wang, Dao-Fu Wang, “Design and implementation of motion compensation for MPEG-4 AS profile streaming video decoding,”. 5th International Conference on ASIC, Oct. 2003. Proceeding.

[16] Tung-Chien Chen, Yu-Wen Huang, and Liang-Gee Chen, “Fully utilized and reusable architecture for fractional motion estimation of H.264/AVC,” IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2004.

[17] Shu-Tzu Lin, Chen-Yi Lee, “Analysis and design of a high-throughput two dimension inverse scan discrete cosine transform processor”, Master Thesis, Department of Electronics Engineering, National Chiao Tung University, Taiwan, June 2000

[18] Y. Hu, A. Simpson, K. McAdoo, and J. Cush, “A high definition H.264/AVC hardware video decoder core for multimedia SoC’s,” Proc. ISCE 2004.

[19] T. W. Chen, Y. W. Huang, T. C. Chen, Y. H. Chen, C. Y. Tsai, and L. G. Chen,

“Architecture Design of H.264/AVC Decoder with Hybrid Task Pipelining for High Definition Videos,” Proc. ISCAS 2005.

作 者 簡 歷

姓名 :林亭安

出生地 :台灣省台北市 出生日期:1980. 11. 24

學歷: 1987. 9 ~ 1993. 6 台北市立大安國民小學 1993. 9 ~ 1996. 6 台北市立和平國民中學 1996. 9 ~ 1999. 6 台北市立松山高級中學

1999. 9 ~ 2003. 6 國立交通大學 電子工程系 學士

2003. 9 ~ 2005. 6 國立交通大學 電子研究所 系統組 碩士

得 獎 事 績

2000/06 書卷獎 2001/01 書卷獎

2003/05 2003 全國 IC 設計競賽設計完整獎

2003/06 電子實驗專題競賽榮獲殷之同獎學金

2004/05 2004 全國 IC 設計競賽設計完整獎 2004/06 書卷獎

2004/10 2004 全國系統晶片設計比賽-光電通訊類 SIP 組特優 2005/05 2005 全國 IC 設計競賽優等獎

發 表 論 文

Ting-An Lin, Sheng-Zen Wang, Tsu-Ming Liu and Chen-Yi Lee, "An H.264/AVC Decoder with 4x4-block level pipeline", ISCAS 2005

Ting-An Lin, Tsu-Ming Liu and Chen-Yi Lee, "A Low-Power H.264/AVC Decoder", VLSI-TSA 2005

Sheng-Zen Wang, Ting-An Lin, Tsu-Ming Liu and Chen-Yi Lee, "A New Motion Compensation Design for H.264/AVC Decoder”, ISCAS 2005

Tsu-Ming Liu, Wen-Ping Lee, Ting-An Lin and Chen-Yi Lee, "A Memory-Efficient Deblocking Filter for H.264/AVC Video Coding", ISCAS 2005

Ting-An Lin, and Chen-Yi Lee, “Predictive Equalizer Design for DVB-T system”, ISCAS 2005

相關文件