Overall Algorithm - 在佈局階段時同時對緩衝器與正反器做放置規畫以及電壓下降的最小化

The simulated annealing process begins from a random feasible Γ. We insert the buffer and flip-flop according to the method described in Section 3. Then we per-turbs the floorplan using the three perturbations. After each move, buffers and flip-flops are planned according to the new floorplan. The process terminates when the solution is frozen, the temperature is too low, or the runtime is too long.

The flow of our algorithm is summarized in Figure 4.1. In line 1, we first get a initial floorplan by random assign the B*-tree. In lines 3-16, we perturbs the floorplan from one to another until the solution is converged or cool down.

Figure 4.1: simultaneous buffer / flip-flop station planning and voltage drop.

Chapter 5 Experimental Results

We implemented our approach in the C++ Programming language and the platform is AMD Opteron (tm) 2.8G with 2.0GB memory. We experiment with our approach on MCNC[1] circuit banchmark. Table 5.1 lists the technology file and buffer library used in our experiments that are based on 0.18-µm in the NTRS’97 roadma[2]. The intrinsic delay and input capacitance of a flip flop is 10% that buffer has. Our IR-drop constraint is 5% of the supply voltage. Thus the IR-drop constraints are Vmin = 1.71 for the power and Vmax = 0.9 for the ground. We give each circuit two power pads and randomly assigned the peak current on each P/G pin of the modules as [15] did. The current of buffer and flip flop are assigned as the proportion of its area to the smallest module area in each circuits. The vertical and horizontal power wire pitches are both 600µm.

The first experiment compare the results of planning buffer block without IR drop consideration and our methodology that insert buffer block and also consider IR-drop. In this experiment, the two-terminal nets obtained by splitting from mul-titerminal nets and the timing requirement of each net are generated by [9] from 1.05-1.20D_opt as the [11] did. The experimental result are summarized in Table 5.2 . The first column shows the circuit name and the algorithm used. The second column shows the number of nets meeting timing requirements (# nets meet) and

Table 5.1: Parameter of 0.18-µm Technology in the NRTS’97 RoadMap[11].

Parameter Description (unit) Value

r wire sheet resistance (Ω/¤) 0.068

r_w wire unit-length resistance of 0.9 µm width (Ω/µm) 0.075

ω wire width (µm) 0.9

c_w wire unit-length capacitance of 0.9 µm width (Ω/µm) 0.118

C^L load capacitance (fF) 23.4

R^D driver resistance (Ω) 180

D_b intrinsic buffer delay (ps) 36.4

C_b buffer input capacitance (fF) 23.4

R_b buffer output resistance (Ω) 180

A_b buffer size (µm²) 400

that of total nets in a circuit ( Tot. # nets). The third column gives the percentages of nets meeting the timing constraints. Column4 lists the number of buffers inserted (# buffers). Column 5 gives the worst voltage of modules in each circuit. Column 6 gives the percentages of extra areas over the given floorplans for buffer insertion.

The result shows that our methodology have almost the equal result on the timing requirement and the area overhead and also does not violate any IR-drop constraint.

In the second experiment, unlike the first experiment, the timing constraint for every net are given the same constraint to reflect that some nets need pipelining . Because the ami33 circuit is much smaller than others, its timing constraint is given as half value of other circuits. We compare our methodology ”simultaneous buffer / flip-flop station planning and voltage drop minimization in floorplan design” called method A (M. A) to the case that does not consider IR-drop called method B (M. B) and the case that does not consider latency and throughput called method (M. C).

The experimental result are summarized in Table 5.3. Column 4 lists the number of nets that only needs buffer (B. net). Column 5 lists the number of nest that are pipelined (FF net). Column 6 lists the number of buffer inserted (# B). Column

Table 5.2: Compare the result of planning buffer block and the result of planning buffer block considering IR-drop, where column 2 is the number of nets meet timing requirement / number of total nets, column 3 is the percentage representation of column 2, column 4 is the number of buffer inserted, column 5 is the smallest voltage of the P/G pin among the modules in each circuit, column 6 is the area overhead caused by buffer inserted.

# buffers worst voltage Extra area (%) apte

7 lists the number of flip-flops inserted (# FF). Column 8, 9 , respectively, are the worst latency and system throughput.

From the results of M. A, M. B, and M. C, it shows that our methodology can find a path with minimum latency if there exist any one (each percentage of nets meet timing requirement is high than 98%). The results of M. A and M. B shows that M. A does not have any IR-drop violation thought M. B have less area overhead.

The IR-drop violation and area overhead is a tradeoff between M. A and M. B. The Results M. C have almost equal overhead to M. A but have less system throughput and higher latency.

Table 5.3: Compare the result of planning buffer / flip flop considering IR drop (M.

A) and without IR-drop consideration (M. B) and without latency and throughput consideration (M. C), where column 4 is the number of nets that need only buffer to satisfy its timing requirement, column 5 is the number of the nets have to be pipelined, column 6 is the number of buffer inserted, column 7 is the number of flip flop inserted, column 8 is the largest latency of nets, column 9 is the largest throughput of cycles.

Chapter 6 Conclusion

In this thesis, we propose a methodology to pipeline interconnect in floorplan to estimate the system latency and throughput to avoid extreme high latency global net. Also we consider the IR drop during the planning of buffers and flip flops. The experimental results shows that our methodology is effective. As the size of chip getting larger, and size of buffer getting smaller, we expect the methodology will become more important in the future.

Bibliography

[1] “www.cse.ucsc.edu/research/surf/gsrc/mcncbench.html”.

[2] “National Technoloogy Roadmap for Semiconductors”. ed: Semiconductor In-dustry Assoc., 1997.

[3] C. J. Alpert and A. Devgan. “Wire segmenting for improved buffer insertion”.

In Proc. of DAC, pages 588–593. June 1997.

[4] Y. C. Chang, Y. W. Chang, G. M. Wu, and S. W. Wu. “B*-Trees : A new representation for non-slicing floorplans”. In Proc. of DAC, pages 458–463.

April 2000.

[5] Y. H. Cheng and Y. W. Chang. “Integrating buffer planning with floorplanning for simultaneous multi-objective optimization”. In Proc. of ASPDAC, pages 431–434. January 2003.

[6] P. Cocchini. “A methodology for optimal repeater insertion in pipelined inter-connects”. In IEEE TCAD, volume 32, No 12, pages 1613–1624. Dec 2003.

[7] J. Cong. “Challenges and opportunities for design innovations in nanometer technologies”. In SRC Design Sciences Concept Paper. San Jose, CA: Semicon-ductor Research Corp., 1997.

[8] J. Cong. “Post-Placement Voltage Island Generation”. In An interconnect-centric design flow for nanometer technologies, volume 89, pages 505–528. Apr 2001.

[9] J. Cong, T. Kong, and D. Z. Pan. “Buffer block planning for interconnect-driven floorplanning”. In Proc. of ICCAD, pages 358–363. Nov 1999.

[10] A. Jahanian and M. Saheb Zamani. “Multi-level buffer block planning and buffer insertion for large design circuits”. In International Symposium on Very Large Scale Integrated Circuits, pages 411–415. 2005.

[11] Iris H. R. Jiang, Y. W. Chang, J. Y. Jou, and K. Y. Chao. “Simultaneous floorplan and buffer-block optimization”. In IEEE TCAD, volume 23, No.5.

May 2004.

[12] J. Lillis, C. K. Cheng, and T. T. Y. Lin. “Optimal wire sizing and buffer insertion for low power and a generalized delay model”. In Proc. ICCAD, pages 138–143. Nov 1995.

[13] S. Lin and N. Chang. “Challenges in power-ground integrity”. In Proc. of ICCD, pages 651–654. 2006.

[14] V. Litovski and M. Zwolinski. “VLSI Circuit Simulation and Optimization”.

Champman & Hall, 1997.

[15] C. W. Liu and Y. W. Chang. “Floorplan and power/ground network co-synthesis for fast design convergence”. In Proc. of ISPD, pages 86–93. 2006.

[16] T. Okamoto and J. Cong. “Buffered Steiner tree construction with wire sizing for low power and a generalized elay model”. In Proc. of ICCAD, pages 44–49.

Nov 1996.

[17] P. Sarkar, V. Sundararaman, and C. K. Koh. “Routability-driven repeater block planning for interconnct-centric floorplanning”. In Proc. of ISPD, pages 186–191. Apr 2000.

[18] X. Tang and D. F. Wong. “Planning buffer locations by network flows”. In Proc. of ISPD, pages 180–185. Apr 2000.

[19] L. P. P. P. van Ginneken. “Buffer placement in distributed RC-tree net works for minimal Elemore delay”. In IEEE ISCS, pages 865–868. 1990.

[20] L. Wang. “Throughput-Aware Floorplanning by Dynamically Considering Mul-tiple Critical Cycles”. Master Thesis, Department of Electronics Engineering, National Chiao Tung University, 2007.

[21] G. M. Wu, Y. C. Chang, and Y. W. Chang. “Rectilinear Block Placement Using B*-Trees”. In ACM TODAE. April 2003.

[22] J. S. Yim, S. O. Bae, and C. M. Kyung. “A floorplan-based planning method-ology for power and clock distribution in ASICS”. In Proc. of DAC, pages 766–771. 1999.

作者簡歷

潘信華，民國七十一年七月出生於台北市。民國九十四年六月畢業於輔仁大學電子工程學系，並於同年九月進入國立交通大學電子研究所就讀，從事 VLSI 實體設計方面相關研究。民國九十七年一月取得碩士學位，碩士論文題目為『在

佈局階段同時對緩衝器與正反器做放置規畫以及電壓下降的最小化』。

在文檔中在佈局階段時同時對緩衝器與正反器做放置規畫以及電壓下降的最小化 (頁 41-52)