• 沒有找到結果。

5.1 Conclusion

Viterbi decoders are widely used in communication systems as decoding convolutional codes which provide a superior error correction capacity while maintaining a reasonable coding complexity and computing resource. However, power dissipation in modern VLSI designs has become a more critical issue in communication systems and System-on-Chip (SoC) era which have emphasized low power features due to the limited battery life. For this reason, the goal of this work is to implement a low power survivor memory unit (SMU) hard macro for Viterbi decoders.

In this thesis, a fully custom SMU hard macro, which is composed of a lot of registers, with low power feature for Viterbi decoders is presented. We choose the register-exchange based SMU as our topology. After introducing relative low power registers, we proposed our low power low swing static edge-triggered latch (ETL) which is very suitable for the register-exchange SMU design due to its less clock loading and fewer transistors number at power and area domain. We not only simulate proposed low swing static ETL in 0.13 um CMOS process, but also in 90 nm CMOS process especially interesting in leakage power dissipation. The detailed simulation results and discussions are addressed in Chapter 3.

Then we based on our proposed low swing static ETL to construct the low power SMU hard macro by means of the fully custom design flow, and the

5.2 Future Works

Although we have completed a low power SMU hard macro design for Viterbi decoders, to integrate this work with other synthesized blocks of Viterbi decoders by means of EDA tools is another challenge. Besides, the data-gating and clock-gating techniques which are very powerful in low power applications could be utilized in implementation of SMU for saving input and clock buffers’ power dissipation.

In modern VLSI designs the demand of design for testability (DFT) is more and more important to the past because the larger chip size and complexity are disadvantageous to the yield of manufacture. Adding scan chain is an effective and widely used way in cell-based design flow for DFT. However, in our fully custom SMU hard macro we can not add scan mechanism easily. The DFT issue in this work needs to be solved in the future.

Besides, the complex interconnect routing between each stages in proposed SMU design occupied more area than stages itself. Besides, the fully custom routing takes a lot of time and is not an efficient solution while the system specifications of Viterbi decoders change. If we build the proposed low power low swing static ETL into the standard cell library, not only the complex interconnect routing could be done efficiently but also the DFT problem could be solved easily by means of convenient EDA tools in the cell-based design flow.

Bibliography

[1] M. Powell, S.-H. Yang, B. Falsafi, K. Roy, and T. N. Vijaykumar, "Gated-Vdd:

A Circuit Tecnhique to Reduce Leakage in Deep-Submicron Cache Memories,"

in International Symposium on Low Power Electronics and Design, 2000.

[2] M. Nakai, S. Akui, K. Seno, T. Meguro, T. Seki, T. Kondo, A. Hashiguchi, H.

Kawahara, K. Kumano, and M. Shimura, "Dynamic Voltage and Frequency Management for a Low-Power Embedded Microprocessor," IEEE Journal of

Solid-State Circuits, vol. 40, pp. 28-35, January 2005.

[3] J. T. Kao, M. Miyazaki, and A. P. Chandrakasan, "A 175-mV Multiply-Accumulate Unit Using an Adaptive Supply Voltage and Body Bias Architecture," IEEE Journal of Solid-State Circuits, vol. 37, pp. 1545-1554, November 2002.

[4] C.C. Lin, Y.H. Shih, H.C. Chang, and C.Y. Lee, “Design of a Power-Reduction Viterbi Decoder for WLAN Applications,” IEEE Transactions on Circuits and Systems, vol. 52, no. 6, JUNE 2005.

[5] A. J. Viterbi, “Error bounds for convolutional codes and asymptotically optimum decoding algorithm,” IEEE Trans. Inf. Theory, vol. IT-13, no. 2, pp. 260–269, Apr. 1967.

[6] J. G. D. Forney, “The Viterbi algorithm,” Proc. IEEE, vol. 61, no. 3, pp.268–278, Mar. 1973.

[7] W.M. Chan; Herming Chiueh, “A block-level optimization of comprehensive thermal-aware power management for SoC integration in nano-scale CMOS technology,” Circuits and Systems, 2005. 48th Midwest Symposium on, August 7-10, 2005.

[8] G. Fettweis and H. Meyr, “A 100 MBit/s Viterbi decoder chip: Novel architecture and its relization,” in Proc. IEEE Int. Conf. Communications (ICC), vol. 2, Aug. 1990, pp. 463–467.

[12] S. Sridharan and L. R. Carley, “A 110 MHz 350 mW 0.6 m CMOS 16-State Generalized-Target Viterbi Detector for Disk Drive Read Channels,” IEEE TRANSACTIONS ON SOLID-STATE CIRCUITS, VOL. 35, NO. 3, MARCH 2000.

[13] V. S. Gierenz, O. Weiß, T. G. Noll, I. Carew, J. Ashley and R. Karabed, “A 550 Mb/s Radix-4 Bit-Level Pipelined 16-State 0.25-μm CMOS Viterbi Decoder,”

Application-Specific Systems, Architectures, and Processors, 2000. Proceedings.

IEEE International Conference on.

[14] N. Bruels, E. Sicheneder, M. Loew, A. Schackow, J. Gliese and C. Sauer, “A 2.8 Gb/s, 32-State, Radix-4 Viterbi Decoder Add-Compare-Select Unit,” 2004 Symposium On VLSl Circuits Digest of Technical Papers.

[15] M. Anders, S. Mathew, R. Krishnamurthy and S. Borkar, “A 64-state 2GHz SOOMbps 40mW Viterbi Accelerator in 90nm CMOS,” 2004 Symposium On VLSl Circuits Digest of Technical Papers.

[16] J.M. Rabaey, “Digital Integrated Circuits: A Design Perspective,” Prentice Hall, New Jersey, 1996.

[17] G. Gerosa et al., “A 2.2 W, 80 MHz superscalar RISC microprocessor,” IEEE J.Solide-State Circuits, vol. 29, DEC 1994.

[18] V. De, Y. Ye, A. Keshavarzi, S. Narendra, J. Kao, D. Somasekhar, R. Nair, and S. Borkar, “Techniques for leakage power reduction,” in Design of High-Performance Microprocessor Circuits, A. Chandrakasan, W. Bowhill, and F. Fox, Eds. Piscataway, NJ: IEEE, 2001, ch. 3, pp. 52–55.

[19] Ding Li, P. Mazumder, N. Srinivas, “A dual-rail static edge-triggered latch”.

ISCAS’01, pp.645-648 vol. 2

[20] S. H. Rasouli, A. Khademzadeh, A. Afzali-Kusha, and M. Nourani, “Low power single- and double-edge-triggered flip-flops for high speed applications,” IEE Proc.Circuits Devices Syst. Vol. 152, no. 2, April 2005

[21] S.-D. Shin and B.-S. Kong, “Variable sampling window flip-flop for low power high-speed VLSI,” IEE Proc.Circuits Devices Syst. Vol. 152, no. 3, June 2005.

[22] H. Zhang, P. Mazumber, “Design of a new sense amplifier flip-flop with improved power-delay product,” Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on 23-26 May 2005 Page(s):1262 - 1265 Vol. 2

[23] H. Partovi, et al., “Flow-Through Latch and Edge-Triggered Flip-Flop Hybrid Elements,” IEEE biteniatiorzal Solid-State Circuits Corlfereiice, pp. 138-139, Feb. 1996.

[24] Chi-Ken Tsai , Po-Tsang Huang and Wei Hwang, "Low Power Pulsed Edge-Triggered Latches Design", 16th VLSI Design/CAD Symposium, 2005 [25] K. Usami and M. Horowitz, "Clustered Voltage Scaling Technique for

Low-Power Design," in International Symposium on Low Power Electronics

and Design, 1995.

[26] D. E. Lackey, P. S. Zuchowski, T. R. Bednar, D. W. Stout, S. W. Gould, and J.

M. Cohn, "Managing Power and Performance for System-on-Chip Designs Using Voltage Islands," in International Conference on Computer Aided Design, 2002.

[27] R. Puri, L. Stok, J. Cohn, D. Kung, D. Pan, D. Sylvester, A. Srivastava, and S.

Kulkarni, "Pushing ASIC Performance in a Power Envelope," in 40th Design

Automation Conference, 2003.

相關文件