• 沒有找到結果。

Joint Shield Insertion and Bus Encoding Flow for Power Minimization

Chapter 4 JOINT SHIELD INSERTION AND BUS

5.3 Joint Shield Insertion and Bus Encoding Scheme for Power

5.3.2 Joint Shield Insertion and Bus Encoding Flow for Power Minimization

Figure 49 illustrates our overall joint shield insertion and bus encoding flow for the bus power minimization. There are four steps in our proposed flow. At Step (1), with the initial bus structure (i.e., inserted m − n extra signal wires to the original bus) and the given bus parameters, we utilize the flexible bus encoding scheme to find the valid code set with minimum total transition power (see Section 3.2 for details). Then, we will check whether the code set covers all data patterns. If not, we conduct Step (3) to generate another bus structure (see Section 4.3 for details) and then redo from the Step (1). Otherwise, the code set will be output to Step (2) (see Section 5.2 for details)

set with minimal total transition power from the outputted code set. Following, the found minimum power valid code set will be recorded together with the corresponding bus structure, and then we will check if all bus structures are tried (i.e., all m − n extra signal wires substituted by shielding wires). If not, we conduct Step (3) to generate another bus structure and then redo from the Step (1). Otherwise, the best valid code set (with minimum total transition power) will be output to map to the data patterns with the corresponding bus structure.

(4). Save the low power valid code set and bus structure (4). Save the low power valid

code set and bus structure (2). Flexible bus encoding for power minimization (2). Flexible bus encoding

for power minimization Bus parameters, working frequency, and constraints Bus parameters, working frequency, and constraints

(1). Flexible bus encoding for delay constraint (1). Flexible bus encoding

for delay constraint

Does the valid code set cover all data pattern?

Does the valid code set cover all data pattern?

Output the valid code set with minimum power and the corresponding bus structure Output the valid code set with

minimum power and the corresponding bus structure

Figure 49: The proposed joint shield insertion and bus encoding flow for bus power

5.3.3 Simulation Results

Here, we present the simulation results of reducing the power consumption of the on-chip bus subject to the delay constraint by using the proposed joint shield insertion and bus encoding scheme.

In the following, the length, width, height, pitch of signal wires, and Power/Ground-to-signal pitch are 2000μm, 2μm, 2μm, 4μm, and 13μm, respectively.

The supply voltage is 1.2V. We vary the bus working frequency from 0.5 to 5 GHz, and the simulation results of a 6-bit bus with wire overheads (m − n) varying from 1 to

5 are shown in Figures 50, 51, and 52.

From Figures 50, 51, and 52, we observe that when the wire overhead m − n is larger than 3, the average power reduction of the joint shield insertion and bus encoding scheme could be more than that of the flexible bus encoding scheme. This is because a wider bus has more significant inductance effects than a narrower one, and the inductance coupling can be effectively minimized by the optimized shield insertion technique. Therefore, for tight wire overhead constraint (≤ 3 wires), we could obtain very well average power minimization by using the flexible bus encoding scheme. However, for the wire overhead constraint larger than 3, the joint

than that of the flexible bus encoding scheme.

In addition, from Figures 50, 51, and 52, we observe that as the working frequency increasing, our proposed joint shield insertion and bus encoding scheme could also obtain better average power reduction than that of the flexible bus encoding scheme. This is due to the significant inductance effects at high working frequency.

Hence, by applying the optimized shield insertion technique first for the inductance-dominated bus, then we could obtain a better valid code set for the bus power reduction by flexible bus encoding. Therefore, our proposed scheme is very suitable for minimizing the power consumption of the inductance-dominated bus.

0.5 GHz

10.00%

12.00%

14.00%

16.00%

18.00%

20.00%

22.00%

24.00%

26.00%

m=7 m=8 m=9 m=10 m=11

average power reduction(%)

Joint shielding and coding Flexible bus coding

Figure 50: Average power reduction of the flexible bus encoding scheme and the joint

1 GHz

Figure 51: Average power reduction of the flexible bus encoding scheme and the joint shield insertion and bus coding scheme for 1 GHz working frequency.

5 GHz

Figure 52: Average power reduction of the flexible bus encoding scheme and the joint

5.3.4 Summary

In this section, by incorporating the optimized shield insertion technique with our flexible bus encoding scheme, we proposed the joint shield insertion and bus encoding scheme for bus power minimization under a given delay constraint. For loose wire overhead constraints and high working frequency, simulation results show that the new scheme can gain more average power reduction than that gained by using the flexible bus encoding only. Therefore, we can conclude that the joint shield insertion and bus encoding scheme could outperform the flexible bus encoding scheme for inductance-dominated buses.

Chapter 6

CONCLUSIONS & FUTURE WORKS

In this dissertation, we first show that the worst-case switching pattern that incurs the longest bus delay while considering the RLC effect is quite different from that while considering RC effect alone. It implies that the existing encoding schemes based on the RC model may not improve or possibly worsen the delay when the inductance effects become dominant. Then, we proposed several bus encoding schemes to minimize the bus coupling delay and the power dissipation. Simulation results are also given to show the performance of our proposed schemes.

In Chapter 1, while considering the RLC circuit model for the bus structure, our finding show that the worst-case switching pattern could be very different from that of only considering RC effects. First, when the inductance effect dominates, the

simultaneously switch in the same direction. Furthermore, we indicate that while considering the RLC effects of interconnects, the worst-case switching pattern varies from different levels of interconnect and different working frequencies. Hence, as inductance cannot be neglected in today’s high-speed circuit design, it is very important to consider the RLC effects with different bus parameters to develop the proper encoding schemes for bus delay reduction.

In Chapter 2, we proposed a bus-invert method to reduce the worst-case on-chip bus delay with the dominance of the inductance coupling effect. Simulation results have shown that our encoding method can significantly reduce the worst coupling delay of the buses with strong inductive coupling.

From our simulation results, we observe that the worst-case switching pattern could vary with given design parameters considering the RLC effects of interconnects.

However, the proposed bus-invert method can be utilized to reduce the bus coupling delay only when the inductance effects dominate. Therefore, in Chapter 3, we proposed a flexible encoding scheme for on-chip buses that can consider the given parameters to reduce the LC coupling delay. In addition, with some modification, this new encoding scheme can be utilized to predict and lengthen the signal propagation length of a bus under the given constraints. Simulation results are also given to support our claims.

In Chapter 4, to further reduce the coupling noise, we proposed a joint shield insertion and bus encoding scheme for global bus design in nanometer technologies.

With the user-given bus parameters, the working frequency, the scheme can effectively reduce the LC coupling effects and hence, minimize the bus coupling delay.

Simulation results show that the proposed scheme can significantly reduce the coupling delay.

Recently, the power consumption of on-chip interconnect is another crucial issue in high performance circuit designs. In Chapter 5, we first proposed a flexible encoding scheme to minimize the power consumption of buses under given delay constraints. To further improve the performance of the encoding scheme (further reduce the power consumption), we also utilized the joint shield insertion and bus encoding method to solve the problem. With the user-given bus parameters, the working frequency, and the delay constraint, both schemes can minimize the bus power consumption subject to the delay constraint by effectively reducing the LC coupling effects. Simulation results show that the proposed schemes can significantly reduce the coupling delay and the power consumption of a bus according to the delay constraint. In addition, the joint shield insertion and bus encoding method can gain more power reduction than that gained by only using the flexible bus encoding scheme.

In the future, we intend to incorporate the repeater insertion technique in our joint shield insertion and bus encoding scheme to further reduce the bus delay and power consumption. In addition, the ground bounce effect can also be considered and minimized in our future work. Consider the switching probabilities of the bus wires, we can also try to generate a specific code set for the bus used in ASIC to optimize the bus power and delay. If those issues can be considered, our encoding scheme will be more powerful.

Bibliography

[1] Semiconductor Industry Association, National Technology Roadmap for Semiconductors, 1997.

[2] Semiconductor Industry Association, International Technology Roadmap for Semiconductors, 2003.

[3] L. Trevillyan, D. Kung, R. Puri, L. N. Reddy, and M. A. Kazda, “An Integrated Environment for Technology Closure of Deep-Submicron IC Designs,” IEEE Design and Test of Computers, vol. 21, issue 1, pp. 14−22, Jan. −Feb., 2004.

[4] P. P. Sotiriadis and A. P. Chandrakasan, ”Bus Energy Reduction by Transition Pattern Coding Using a Detailed Deep Submicrometer Bus Model,” IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications, vol. 50, issue 10, pp. 1280−1295, Oct., 2003.

[5] M. A. Elgamel and M. A. Bayoumi, “Interconnect Noise Analysis and Optimization in Deep Submicron Technology,” IEEE Circuits and Systems Magazine, vol. 3, issue 4, pp. 6−17, 2003.

[6] M. H. Chowdhury, Y. I. Ismail, C. V. Kashyap, and B. L. Krauter, “Performance Analysis of Deep Sub micron VLSI Circuits in the Presence of Self and Mutual Inductance,” IEEE International Symposium on Circuits and Systems, vol. 4, pp.

197−200, 2002.

[7] Y. I. Ismail, “On-Chip Inductance Cons and Pros,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 10, issue 6, pp. 685−694, Dec.

2002.

[8] L. He and K. M. Lepak, “Simultaneous Shield Insertion and Net ordering for Capacitive and Inductive Coupling Minimization,” International Symposium on Physical Design, pp. 55−60, 2000.

[9] K. Hirose and H. Yasuura, “A bus delay reduction technique considering crosstalk,” Proceedings of the Design, Automation and Test in Europe Conference and Exhibition, pp. 441−445, March 2000.

[10] R. Escovar and R. Suaya, “Optimal Design of Clock Trees for Multigigahertz Applications,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 23, issue 3, pp. 329−345, March, 2004.

[11] K. Nabors and J. White, “FastCap: A Multipole Accelerated 3-D capacitance extraction program,” IEEE Trans. Computer-Aided Design, vol. 10, No. 11, pp.

1447−1459, Nov. 1991.

[12] M. Kamon, M. J. Tsuk, and J. K. White, “FastHenry: a Multipole-accelerated 3D Inductance Extraction Program,” IEEE Trans. Computer-Aided Design, pp.

1750−1758, Sept. 1994.

[13] H. Kriplani, F. N. Najm, and I. N. Hajj, “Pattern Independent Maximum Current Estimation in Power and Ground Buses of CMOS VLSI Circuits: Algorithms, Signal Correlations, and Their Resolution,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 14, issue 8, pp.

998−1012, Aug. 1995.

[14] J. Chen and L. He, “Determination of Worst-Case Crosstalk Noise for Non-Switching Victims in GHz+ Interconnects,” Proceedings of Asia and South Pacific Design Automation Conference, pp. 162−167, Jan., 2003.

[15] C. K. Cheng, J. Lillis, S. Lin, and N. Chang, Interconnect Analysis and Synthesis, John Wiley & Sons, Inc., 2000.

[16] D. K. Cheng, Filed and Wave Electromagnetics, 2nd Ed., Addison-Wesley, 1989.

[17] M. W. Beattie and L. T. Pileggi, “Inductance 101: modeling and extraction,”

Proceedings of Design Automation Conference, pp. 323−328, 2001.

[18] Y. Cao, X. Huang, N. H. Chang, S. Lin, O. S. Nakagawa, W. Xie, D. Sylvester, and C. Hu, “Effective On-Chip Inductance Modeling for Multiple Signal Lines and Application to Repeater Insertion,” International Symposium on Quality Electronic Design, pp. 185−190, Mar. 2001.

[19] Y. I. Ismail, E. G. Friedman, and J. L. Neves,” Figures of Merit to Characterize the Importance of On-Chip Inductance,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 7, issue 4, pp. 442−449, Dec. 1999.

[20] M. Stan and W. Burleson, “Bus-invert coding for low-power I/O,” IEEE Transactions on Very Large Scale Integrated Systems, vol. 3, issue 2, pp. 49−58, Mar. 1995.

[21] M. Stan and W. Burleson, “Low-power encodings for global communication in CMOS VLSI,” IEEE Transactions on Very Large Scale Integrated Systems, vol.

5, issue 12, pp. 444−455, Dec. 1997.

[22] Y. Shin, S. I. Chae, and K. Choi, ”Partial bus-invert coding for power optimization of system level bus,” Proceedings of the International Symposium on Low Power Electronics and Design, pp. 127−129, 1998.

[23] Y. Zhang, J. Lach, K. Skadron, M. Stan, “Odd/even bus invert with two-phase transfer for buses with coupling,” Proceedings of the International Symposium on Low Power Electronics and Design, pp. 80−83, 2002.

[24] T. Lindkvist, J. Lofvenberg, and O. Gustafsson, ”Deep submicron bus invert coding,” Proceedings of the Nordic Signal Processing Symposium, pp. 133−136, June 2004.

[25] H. Kaul, D. Sylvester, M. A. Anders, and R. K. Krishnamurthy, “Design and analysis of spatial encoding circuits for peak power reduction in on-chip buses,”

1225−1238, Nov. 2005.

[26] H. S. Deogun, R. R. Rao, D. Sylvester, and D. Blaauw, “Leakage-and crosstalk-aware bus encoding for total power reduction,” IEEE/ACM Design Automation Conference, pp. 779−782, June, 2004.

[27] S. R. Sridhara and N. R. Shanbhag, “A low power bus design using joint repeater insertion and coding,” Proceedings of the International Symposium on Low Power Electronics and Design, pp. 99−102, 2005.

[28] P. P. Sotiriadis and A. P. Chandrakasan, “Reducing bus delay in submicron technology using coding,” Proceedings of the Asia and South Pacific Design Automation Conference, pp. 109−114, Feb. 2001.

[29] B. Victor and K. Keutzer, “Bus encoding to prevent crosstalk delay,”

International Conference on Computer Aided Design, pp. 57-63, Nov. 2001.

[30] K. H. Baek, K. W. Kim, and S. M. Kang, “A low energy encoding technique for reduction of coupling effects in SOC interconnects,” Proceedings of the 43rd IEEE Midwest Symposium on Circuits and Systems, pp. 80−83, Aug. 2000.

[31] M. Cha, C. G. Lyuh, and T. Kim, “Resource-constrained low-power bus encoding with cosstalk delay elimination,” Proceedings of the Asia and South Pacific Design Automation Conference, pp. 835−838, Jan. 2004.

[32] M. Lampropoulos, B. M. Al-Hashimi, and P. Rosinger, “Minimization of crosstalk noise, delay and power using a modified bus invert technique,”

Proceedings of the Design, Automation and Test in Europe Conference and Exhibition, pp. 1372−1373, Feb. 2004.

[33] D. Chinnery and K. Keutzer, Closing the Gap Between ASIC & Custom - Tools and Techniques for High-Performance ASIC Design. Kluwer, 2002.

[34] R. Ho, K.W. Mai, and M. A. Horowitz, “The Future of Wire,” Proceedings of the IEEE, vol. 89, issue 4, pp. 490−504, April 2001.

[35] J. Cong, “An Interconnect-Centric Design Flow for Nanometer Technologies,”

Proceedings of the IEEE, vol. 89, issue 4, pp. 505−528, April 2001.

[36] L. O. Chua, C. A. Desoer, and E. S. Kuh, Linear and nonlinear circuits, McGraw-Hill Inc, 1987.

[37] D. Matzke, “Will physical scalability sabotage performance gain?” IEEE Computer, pp. 37−39, Sept. 1997.

[38] Y. Cao, C. Hu, X. Huang, A.B. Kahng, S. Muddu, D. Stroobandt, and D.

Sylvester, “Effects of global interconnect optimizations on performance estimation of deep submicron design,” IEEE/ACM International Conference on Computer Aided Design, pp. 56 – 61, Nov. 2000.

[39] M. A. Elgamel, A. Kumar, and M. A. Bayoumi, “Efficient shield insertion for inductive noise reduction in nanometer technologies,” IEEE Transaction on

Very Large Scale Integration Systems, vol. 13, issue 3, pp. 401−405, Mar. 2005.

[40] H. Kual, D. Sylvester, and D. Blaauw, “Clock net optimization using active shielding,” Proceedings of the 29th European Solid-State Conference, pp.

265−268, Sept. 2003.

[41] J. Cong, L. He, C. K. Koh, and Z. Pan, “Interconnecting sizing and spacing with consideration of coupling capacitance,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, pp.1164-1169, Sept. 2001.

[42] M. A. El-Moursy and E. G. Friedman, “Optimum wire sizing of RLC interconnect with repeaters,” Proceedings of the IEEE Great Lakes Symposium on VLSI, pp. 27−32, April 2003.

[43] Y. I. Ismail, E. G. Friedman, and J. L. Neves, “Repeater insertion in tree structured inductive interconnect,” IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, vol. 48, issue 5, pp. 471−481, May 2001.

[44] L. P. P. P. van Ginneken, ”Buffer Placement in distributed RC-tree networks for minimal Elmore delay,” Proceedings of the International Symposium on Circuits and Systems, pp. 865−868, 1990.

[45] C. J. Alpert, A. Devgan, and S. T. Quay, “Buffer insertion for noise and delay optimization,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, pp.1633−1645, Nov. 1999.

[46] A. Nalamalpu, S. Srinivasan, and W. Burleson, “Boosters for driving long on-chip interconnects: design issues, interconnect synthesis and comparison with repeaters,” IEEE Transactions on Computer-Aided Design, vol. 21, issue 1, pp. 50−62, Jan. 2002.

[47] H. Zhang, V. George, and J. Rabaey, “Low-swing on-chip signaling techniques:

effectiveness and robustness,” IEEE Transactions on Very Large Scale Integration Systems, vol. 8, issue 3, pp. 264−272, June 2000.

[48] Y. Massoud, J. Kawa, D. MacMillen, and J. White, “Modeling and analysis of differential signaling for minimizing inductive crosstalk,” Proceedings of the Design Automation Conference, pp. 804−809, 2001.

[49] Y. I. Ismail and E.G. Friedman, ”Effects of inductance on the propagation delay and repeater insertion in VLSI circuits: a summary,” IEEE Circuits and Systems Magazine, vol. 3, issue 1, pp. 24−28, 2003.

[50] M. H. Chowdhury, Y. I. Ismail, C. V. Kashyap, and B. L. Krauter, “Performance analysis of deep sub micron VLSI circuits in the presence of self and mutual inductance,” IEEE Internstional Symposium on Circuits and Systems, vol. 4, pp.

IV-197−IV-200, May 2002.

Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol.

16, pp. 1418−1430, Dec. 1997.

[52] G. Zhong, C. K. Koh, and K. Roy, “A Twisted-Bundle Layout Structure for minimizing Inductive Coupling Noise,” IEEE International Conference on Computer Aided Design, pp. 406−411, 2000.

[53] H. Kual, D. Sylvester, and D. Blaauw, “Performance optimization of critical nets through active shielding,” IEEE Transactions on Circuits and Systems, vol.

51, issue 12, pp. 2417−2435, 2004.

[54] H. Kual, D. Sylvester, and D. Blaauw, “Active shielding of RLC global interconnects,” IEEE European Solid-State Circuits Conference, pp. 265−268, 2003.

[55] D. Deutsch, G. Kopcsay, P. Restle, H. Smith, G. Katopis, W. Becker, P. Coteus, C. Surovic, B. Rubin, R. Dunne, T. Gallo, K. Jenkins, L. Terman, R, Dennard, G.

Sai-Halasz, B. Krauter, and D. Knebel, “When are transmission-line effects important for on-chip interconnects?,” IEEE Transactions on Microwave Theory Technique, vol. 45, pp. 1836−1846, Oct. 1997.

[56] L. Pileggi, “Coping with RC(L) interconnect design headaches,” Proceedings of International Conference on Computer Aided Design, Nov. 1995.

[57] V. Soteriou and L. S. Peh, ”Design-space exploration of power-aware on/off interconnection networks,” Proceedings of the IEEE International Conference on Computer Design, pp. 510−517, 2004.

[58] S. R. Sridhara, A. Ahmed, and N. R. Shanbhag, “Area and energy-efficient crosstalk avoidance codes for on-chip buses,” Proceedings of the IEEE International Conference on Computer Design, pp. 12−17, 2004.

[59] P. Subrahmanya, R. Manimegalai, V. Kamakoti, and Madhu Mutyam, “A bus encoding technique for power and cross-talk minimization,” Proceedings of the 17th International Conference VLSI Dessign, pp. 443−448, 2004.

[60] P. P. Sotiriadis and A. Chandrakasan, “Low power bus coding techniques considering inter-wire capacitance,” Proceedings of the IEEE Custom Integrated Circuits Conference, pp. 507−510, 2000.

作 者 簡 介 About the Author

姓 名 中文:涂尚瑋 英文:Shang-Wei Tu

基本資料 生日:1977/04/10 籍貫:台灣省台南市

學 歷 國立交通大學電子研究所博士班 2001/09 ~ 2006/09 國立交通大學電子研究所碩士班 1999/09 ~ 2001/06 國立交通大學電子工程學系 1995/09 ~ 1999/06

經 歷 數位電路與系統助教 2006 九十四年度交通大學電子所年度論文獎佳作 2005

計算機輔助設計特論助教 2003 ~ 2005 交通大學電子研究所博士班甄試第一名 2001

程式語言助教 2000 研究所學期成績優異獎 1999 凌陽科技公司暑期實習 1999

教育部大學校院積體電路電腦輔助設計軟體製作競賽 佳作 1999

教育部大學校院積體電路電腦輔助設計軟體製作競賽 入圍 1999

研究領域 Circuit Modeling Computer-Aided Design Algorithms

專 長 EDA Algorithms

Hardware Description Language: Verilog Programming Language: C/C++

著 作 目 錄 涂尚瑋

Publication List Shang-Wei Tu

依新法記點 國外期刊(共 4 點)

(2 點,長文) 1. S. W. Tu, W. Z. Shen, Y. W. Chang, T. C. Chen, and J. Y. Jou,

"Inductance modeling for on-chip interconnects," Journal of Analog Integrated Circuits and Signal Processing, pp. 65-78, vol.

35, No. 1, April, 2003. (Invited paper for ISCAS’2002 special issue on analog and mixed signal circuits.)

(2 點,短文) 2. Shang-Wei Tu, Yao-Wen Chang, and Jing-Yang Jou, “RLC coupling-aware simulation and on-chip bus encoding for delay reduction,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2005. (Accepted)

國際會議(至多採計一篇)

國際會議(至多採計一篇)