• 沒有找到結果。

The scan-chain ordering problem is formulated as TSP problem and modified for 3D IC designs considering TSV constraints. A fast scan-chain ordering algorithm is developed, consisting of two stages: (1) initial solution computation; and (2) local refinement and constraint solving. To avoid high complexity of 3D optimization, we convert such a problem into a TSP problem and use a state-of-the-art algorithm, a multiple fragment heuristic, combined with a dynamic closest-pair data structure named FastPair to quickly derive a satisfactory initial solution. Two techniques, 3D planarization and 3D relaxation are also proposed to minimize the wire cost, (power cost or combined cost) and to relax the total number of TSVs in use, respectively.

Experimental results show that the proposed algorithm can achieve comparable (<3 percent difference) or even better performance than that from a GA method, resulting in run time speeds at least two-orders faster on all ISCAS’89 benchmark circuits when considering TSV constraints. Moreover, our algorithm is shown to successfully scale to multiple scan chains for large designs from [23]. Therefore, the proposed algorithm can be practically used for scan-chain ordering of 3D IC designs.

As a result, the contributions of this work can be summarized as:

• Formulate scan-chain ordering considering TSV constraints into a modified TSP problem.

• Propose a greedy algorithm for scan-chain ordering of 3D-IC designs to simultaneously minimize wire and power costs.

• Demonstrate that the proposed algorithm can be practically used while supporting multiple scan chains.

To achieve high-performance computing on embedded systems, three-dimensional (3D) multi-core processors have become a promising alternative where energy efficiency is crucial to its success. Many heuristics applying Dynamic Voltage and Frequency Scaling (DVFS) techniques were proposed for energy minimization. However, most of the previous works were built upon a fixed task-to-core mapping where many slack spaces can be further improved. Therefore, in this work, we propose two dynamic remapping strategies to enhance an energy-aware task-scheduling algorithm considering transmission cost.

The other issue in this work, we apply two dynamic task-to-core remapping strategies, DR and IDR, on top of a baseline task-scheduling algorithm from [41].

Experimental results show the solutions from the two task-to-core remapping strategies are more close to the ILP solution where IDR only has 2.54 percent difference. Besides, two proposed methods can run 300X to 11,000X faster than the ILP method. Our experiments also show that the energy-saving rate of the IDR method can achieve 16 percent higher than that of the baseline algorithm on average.

As a result, the scheduling algorithm with dynamic task-to-core remapping can result in more energy efficient scheduling than that with a fixed task-to-core mapping.

Based on such result, the scheduling algorithm with dynamic task-to-core remapping can result in more energy efficient scheduling than that with a fixed task-to-core mapping. We also implemented an ILP-based method for an optimal solution without considering the transmission costs between cores. The experimental results show IDR strategy can achieve comparable performance as the ILP and the energy-saving difference between IDR and ILP is only −2.54 percent on average.

However, in our future work, we would like to further validate our IDR strategy considering transmission costs between cores. Hence, the problem is formulated into Quadratic Programming (QP) and compared to the IDR solution considering the transmission costs between cores. However, the scalability problem of QP-based method is more sever with transmission costs. Therefore, scheduling with a larger task graph is implemented into Genetic Algorithm (GA) and also compared to our IDR solution.

References

[1] J. W. Joyner, P. Zarkesh-Ha, J. A. Davis, and J. D. Meindl, “A three-dimensional stochastic wire-length distribution for variable separation of strata,” in Interconnect Technology Conference, 2000. Proceedings of the IEEE 2000 International, 2000, pp.

126 –128.

[2] J. Joyner and J. Meindl, “Opportunities for reduced power dissipation using three-dimensional integration,” in Interconnect Technology Conference, 2002.

Proceedings of the IEEE 2002 International, 2002, pp. 148 – 150.

[3] C. Ababei, Y. Feng, B. Goplen, H. Mogal, T. Zhang, K. Bazargan, and S.

Sapatnekar, “Placement and routing in 3d integrated circuits,” Design Test of Computers, IEEE, vol. 22, no. 6, pp. 520 – 531, nov.-dec. 2005.

[4] W. R. Davis, J. Wilson, S. Mick, J. Xu, H. Hua, C. Mineo, A. M. Sule, M. Steer, and P. D. Franzon, “Demystifying 3d ics: the pros and cons of going vertical,” Design Test of Computers, IEEE, vol. 22, no. 6, pp. 498 – 510, nov.-dec. 2005.

[5] Y. Xie and Y. Ma, “Design space exploration for 3d integrated circuits,” in Solid-State and Integrated-Circuit Technology, 2008. ICSICT 2008. 9th International Conference on, 20-23 2008, pp. 2317 –2320.

[6] B. Noia, K. Chakrabarty, and E. J. Marinissen, “Optimization methods for post-bond die-internal/external testing in 3d stacked ics,” in INTERNATIONAL TEST CONFERENCE (ITC), 2010.

[7] M. Hirech, J. Beausang, and X. Gu, “A new approach to scan chain reordering using physical design information,” in Test Conference, 1998. Proceedings., International, 18-23 1998, pp. 348 –355.

[8] S. Makar, “A layout-based approach for ordering scan chain flip-flops,” in Test Conference, 1998. Proceedings., International, 18-23 1998, pp. 341 –347.

[9] P. Gupta, A. Kahng, and S. Mantik, “Routing-aware scan chain ordering,” in Design Automation Conference, 2003. Proceedings of the ASP-DAC 2003. Asia and South Pacific, 21-24 2003, pp. 857 – 862.

[10] S. Remersaro and X. et al., “Preferred fill a scalable method to reduce capture power for scan based designs,” in Proc. Int. Test Conf., 2006.

[11] S. R., R. Oruganti, and N. Touba, “Static compaction techniques to control scan vector power dissipation,” in VLSI Test Symposium, 2000. Proceedings. 18th IEEE, 2000, pp. 35 –40.

[12] P.Rosinger and B.-H. et al., “Scan architecture with mutually exclusive scan segment activation for shift and capture power reduction,” Proc. Transactions on Computer-Aided Design(TCAD), pp. 1142–1153, 2004.

[13] Y. Bonhomme, P. Girard, L. Guiller, C. Landrault, S. Pravossoudovitch, and A.

Virazel, “Design of routing-constrained low power scan chains,” in DATE’04:

Proceedings of the conference on Design, automation and test in Europe. Washington, DC, USA: IEEE Computer Society, 2004, p. 10062.

[14] X.-L. Huang and J.-l. Huang, “A routability constrained scan chain ordering technique for test power reduction,” in Design Automation, 2006. Asia and South Pacific Conference on, 24-27 2006, p. 5 pp.

[15] B. Paul, R. Mukhopadhyay, and I. Gupta, “Genetic algorithm based scan chain optimization and test power reduction using physical information,” in TENCON 2006.

2006 IEEE Region 10 Conference, 14-17 2006, pp. 1 –4.

[16] Y.-Z. Wu and M.-T. Chao, “Scan-chain reordering for minimizing scan-shift power based on non-specified test cubes,” in VLSI Test Symposium, 2008. VTS 2008.

26th IEEE, april 2008, pp. 147 –154.

[17] X. Wu, P. Falkenstern, K. Chakrabarty, and Y. Xie, “Scan-chain design and optimization for three-dimensional integrated circuits,” J. Emerg. Technol. Comput.

Syst., vol. 5, no. 2, pp. 1–26, 2009.

[18] D. Lewis and H.-H. Lee, “A scan island based design enabling prebond testability in die-stacked microprocessors,” in Test Conference, 2007. ITC 2007, IEEE International, oct. 2007, pp. 1 –8.

[19] A. Kumar, S. Reddy, I. Pomeranz, and B. Becker, “Hyper-graph based partitioning to reduce dft cost for pre-bond 3d-ic testing,” in Design, Automation Test in Europe Conference Exhibition (DATE), 2011, March 2011, pp. 1 –6.

[20] J. L. Bentley, “Experiments on traveling salesman heuristics,” in SODA ’90:

Proceedings of the first annual ACM-SIAM symposium on Discrete algorithms.

Philadelphia, PA, USA: Society for Industrial and Applied Mathematics, 1990, pp.

91–99.

[21] D. Eppstein, “Fast hierarchical clustering and other applications of dynamic closest pairs,” J. Exp. Algorithmics, vol. 5, p. 1, 2000.

[22] C. Giri, S. K. Roy, B. Banerjee, and H. Rahaman, “Scan chain design targeting dual power and delay optimization for 3d integrated circuit,” Advances in Computing, Control, and Telecommunication Technologies, International Conference on, vol. 0, pp. 845–849, 2009.

[23] “Taiwan integrated circuit computer-aided design (ic/cad) contest, http : //cad contest.cs.nctu.edu.tw/cad11/,” 2011.

[24] 'International Technology Roadmap for Semiconductors (ITRS)', http://www.itrs.net/

[25] Kahle, J. and Day, M. and Hofstee, H. and johns, C. and Maeurer, T.and Shippy, D.: 'Introduction to the Cell multiprocessor', IBM Journal of Research and Development, July, 2005, 49, (4/5), pp. 589-604

[26] Hung, W.-L. and Link, G. M. and Xie, Y. and Vijaykrishnan, N. and Irwin, M. J.:

'Interconnect and Thermal-aware Floorplanning for 3D Microprocessors'. Proc. Int.

Symposium on Quality Electronic Design, Washington, DC, USA, 2006, pp. 98-104 [27] Puttaswamy, K.: 'Thermal Analysis of a 3D Die-Stacked High-Performance Microprocessor'. Proc. GLSVLSI, 2006

[28] Bakir, M.S. and King, C. and Sekar, D. and Thacker, H. and Dang, B. and Huang, G. and Naeemi, A. and Meindl, J.D.: '3D heterogeneous integrated systems: liquid cooling, power delivery, and implementation'. Proc. IEEE Custom Integrated Circuits Conference, 2008

[29] Cong, J. and Wei, J. and Zhang, Y.: 'A thermal-driven _oorplanning algorithm for 3D ICs'. Proc. IEEE Int. Conf. on Computer-Aided Design, 2004

[30] Cong, J. and Zhang, Y.: 'Thermal via planning for 3-D ICs'. Proc. IEEE Int. Conf.

on Computer-Aided Design, 2005

[31] Raje, S. and Sarrafzadeh, M.: 'Variable voltage scheduling'. Proc. Int. symposium on Low power design, Dana Point, California, United States, 1995, pp. 9-14

[31] Zhou, X. and Xu, Y. nd Du, Y. and Zhang, Y. and Yang, J.: 'Thermal management for 3D processors via task scheduling'. Proc. Int. Conf. on Parallel Processing, 2008

[32] Sun, C. and Shang, L. and Dick, R.P.: 'Three-dimensional multi-processor system-on-chip thermal optimization'. Proc. Int. Conf. Hardware/Software Codesign and System Synthesis, 2007

[33] Chandrakasan, A. and Brodersen, R.: 'Low Power Digital CMOS Design'.

Kluwer Academic Publishers, 1995

[34] Benini, L. and Bogliolo, A. and Micheli, G. D.: 'A survey of design techniques for system-level dynamic power management', IEEE Transactions on VLSI systems, June, 2000, pp. 299-316

[35] Hong, I. and Kirovski, D. and Qu, G. and Potkonjak, M. and Srivastava, M.:

'Power optimization of variable voltage core-based systems'. Proc. IEEE Design Automation Conference, 1998

[36] Yao, F., Demers, A. and Shenker, S.: 'A scheduling model for reduced CPU energy'. Proc. Int. Foundations of Computer Science, Oct 1995, pp. 374-382

[37] Ishihara, T. and Yasuura, H.: 'Voltage scheduling problem for dynamically variable voltge processors'. Proc. Int. Symposium on Low Power Electronics and Design, 1998

[38] Zhang Y. and Hu X. and Chen D.: 'Task scheduling and voltage selection for energy minimization'. Proc. IEEE Design Automation Conference, 2002

[39] Lin, Y.-R., Hwang, C.-T. and Wu, Allen C.-H.: 'Scheduling techniques for variable voltage low power designs', ACM Trans. Des. Autom. Electron. Syst., 1997,

2, (2), pp. 81-97

[40] Wu, C.-B., Lin, Y.-L.: 'Energy-Ef_cient Task Scheduling for DVFS-Multi-Core 3D IC'. Master thesis, National Tsing Hua University, 2010

[41] Gajski, D. D. and Dutt, N. D. and Wu, A. C.-H. and Lin, S. Y.-L.:

'Scheduling','High-Level Synthesis: Introduction to Chip and System Design'(Springer, 1992, 1st edn.), pp. 213-258

[42] Adam, T. L. and Chandy, K. M. and Dickson, J. R.: 'A comparison of list schedules for parallel processing systems'. ACM, Commun. 1974, 17, (12), pp.

685-690

[43] 'Standard Task Graph Set, Kasahara Laboratory Department of Electrical, Electronics and Computer Enginnering, Waseda University', http://www.kasahara.elec.waseda.ac.jp/schedule/index.html

[44] Chandrakasan, A. P. and Potkonjak, M. and Mehra, R. and Rabaey, J. and Brodersen, R.: 'Optimizing power using transformations', IEEE Trans. on Computer-Aided Design of Integrated Circuits and Systems., Jan, 1995, 14, (1), pp.

12-31

國科會補助計畫衍生研發成果推廣資料表

日期:2011/10/30

國科會補助計畫

計畫名稱: 子計畫五:應用在驗證與測試3D IC整合過程中以計算智慧為基礎的測試向量 產生方法(2/2)

計畫主持人: 溫宏斌

計畫編號: 99-2220-E-009-039- 學門領域: 晶片科技計畫--整合型學術研究 計畫

無研發成果推廣資料

99 年度專題研究計畫研究成果彙整表

其他成果

2. W.T. Chen, C.C. Chiang, 3D-MFH, University Booth, IEEE/ACM Design Automation Conference, 2011.

國科會補助專題研究計畫成果報告自評表

3.目前有四篇論文發表在國內會議中 (The 22nd VLSI Design / CAD Symposium 2 篇,2011 VLSI Test Technology Workshop)

培養學生在三維積體電路這一領域的專業知識,以期畢業後可為業界注入新血。

相關文件