Stochastic Collocation Based Statistical Expression Generator

3.3 Statistical Electro-Thermal Analyzer

3.3.2 Stochastic Collocation Based Statistical Expression Generator

Smolyak Sparse Grid Formulation

The primary advantage of Smolyak sparse grid formulation is to construct an interpolating polynomial of the multivariate function u ∈ C^r by using much less samples of the desired function than those of the full tensor product interpolation formula and the Monte Carlo method but still maintains an acceptable error bound [96, 97]. Here, C^ris the set of all functions which have continuous derivatives of all orders up to r. With the stochastic collocation technique, the statistical expression of the on-chip temperature distribution can be efficiently constructed.

The difference between Monte Carlo method and Smolyak sparse grid formulation is that the Monte Carlo method randomly generates the samples of random variables and, hence, requires a large number of samples for achieving an accurate estimate. On contrary to the Monte Carlo Method, the Smolyak sparse grid technique uses the roots of H-PCs or the extrema of Cheby-shev polynomial [97] to generate the samples of random variables and employs these fewer samples to effectively interpolate the desired solution. For a two-dimensional random variable, its possible sample sets of the Monte Carlo method and the Smolyak sparse grid formulation are illustrated in Figure 3.11.

According to the Smolyak sparse grid formulation, the on-chip temperature distribution can

Sampling Points of Monte Carlo

Sampling Points of Smolyak Sparse Grid

ξ

₁

ξ

₂

ξ

₁

ξ

₂

Figure 3.11: The number of sampling random variables comparison between the Monte Carlo method and the Smolyak sparse grid formulation. Here, the samples of Smolyak sparse grid are adopted for achieving a level two approximation.

be explicitly approximated as follows [96, 97].

Tb_q^N^KL(r, ξ)= X

q−N_KL+1≤|i|≤q

(−1)^q−|i| NKL−1 q − |i|

Qⁱ¹(T ) ⊗ · · · ⊗ Qⁱ^NKL(T ) . (3.39)

Here, NKL = Nt_ox + NL is the number of random variables in ξ, q = NKL + l, l ≥ 1 is the formulation level, and |i| = i1+ · · · + in + · · · + iN_KL. With level in ≥ 1, Qⁱⁿ is an interpolating polynomial of T (r, ξ) by only utilizing the random variable ξn, and ⊗ is the functional cross product. The level in is the index to decide the number of samples (mi_n) for the interpolating polynomial Qⁱⁿ. As suggested by [97], the relation between mi_n and in is that m₁ = 1 and mi_n = 2ⁱⁿ⁻¹+ 1 for in > 1.

From (3.39), only the corresponding temperature values of a small set of samples for ξ [97]

need to be known. This set is called the sparse grid and is equal to [97]

H(q, NKL)= [

q−N_KL+1≤|i|≤q

~ⁱ¹ × · · · × ~ⁱⁿ × · · · × ~ⁱ^NKL, (3.40)

where ~ⁱⁿ =nξ_i¹

n, · · · , ξ^m_i ⁱⁿ

ois the set of sample points used by Qⁱⁿ(T ), and the operator ‘×’ is the cross product of sets. The number of sample points from the Smolyak sparse grid formulation

Cdet· O

N^l_KL/l!

. Here, Cdetis the runtime complexity for performing the deterministic electro-thermal simulation once.

For a function having bounded derivatives up to order r, the Smolyak sparse grid formulation ensures a error bound, |El| = cN_KL,r· N_H^−r · log NH(r+1)(NKL−1)

[97]. Here, NH is the number of sample points in H (q, NKL), and cN_KL,r is a constant that only depends on NKL and r. In our experience, the accurate estimation of thermal yield profile can be obtained by setting the level lto be 1. Therefore, the number of sample points in the Smolyak sparse grid formulation can be much less than that of the Monte Carlo method.

An example with NKL = 2 and q = NKL+ 1 = 3 is given to illustrate the Smolyak sparse grid formulation. Since q − NKL + 1 ≤ |i| ≤ q, we have i1 = 1, i2 = 1 for |i| = 2, and i₁ = 1, i2 = 2 or i1 = 2, i2 = 1 for |i| = 3. Therefore, the numbers of sample values for random variables ξ₁ and ξ₂ are mi₁=1 = 1, mi₂=1 = 1 for |i| = 2, and mi₁=1 = 1, mi₂=2 = 3 or mi₁=2 = 3, mi₂=1 = 1 for |i| = 3, respectively. According to various values of i1 and i₂, the interpolating polynomial forms by individually utilizing each random variable at different levels can be determined. After that, the interpolating polynomial forms corresponding to ξ^T = [ξ₁, ξ₂] at different combined levels (i1, i₂) can be constructed by the functional cross product distribution excited by the point that belongs to the following sample set of ξ needs to be known.

Given ~¹ = {p¹₀} and ~² = {p²₀, p²₁, p²₂}, we have The sampling values of ~ⁱ for each level i must be properly decided. Adopting the roots of H-PCs with its order being corresponding to the level i can achieve the most accurate result as

Algorithm Temperature Profile Calculation for a Sample Point Input: A sampling point ξ^j, initial temperature T_ξⁱⁿⁱ_j and pd(r) Output: Temperature profile T (r, ξ^j)

1 Begin

†Any deterministic thermal simulators can be used to execute Line 9.

Here, the simulator stated in Chapter 2 is adopted.

Figure 3.12: Deterministic electro-thermal analysis for each sampling point, ξ^j, in sparse grid.

pleak, pd and p are the leakage, dynamic and total power density profiles for each sampling point, respectively.

ξ is a normal random vector [98]. Choosing the extrema of the Chebyshev polynomial with its order being corresponding to the level i can achieve the nested sparse grid structure, i.e. ~ⁱ ⊂ ~^k for i < k, for any levels and the acceptable accuracy [97]. In this work, we select the roots of H-PCs as the sampling values since the result is shown to be very accurate by using the low level approximation, and the nested sparse grid structure is still preserved for q= N^KL+ 1⁶. Temperature Profile Calculation for a Given Sample Point

After the sparse grid H (q, NKL) of ξ is obtained, the samples of channel length and oxide thickness in the m-th parameter modeling grid corresponding to the j-th sample point, ξ^j, of H (q, NKL) can be obtained by equations (3.15) and (3.16). Hence, the deterministic power den-sity profile corresponding to ξ^j can be obtained. With the deterministic power density profile,

6If the high order approximation is needed for the accuracy, we suggest to use the extrema of the Chebyshev polynomial because the nested sparse grid structure is preserved for any levels; hence, the number of sample points can be much less.

we have the following deterministic steady-state heat transfer equation

κ∇²T(r, ξ^j)= −p(r, ξ^j, T ), (3.42)

subject to the following boundary condition κ∂T (rb_s, ξ^j)

∂~nb_s

+ hb_sT(rb_s, ξ^j)= fb_s(rb_s). (3.43)

Here, p(r, ξ^j, T ) and T (r, ξ^j) are the deterministic power density and temperature profiles with respect to ξ^j, respectively. Since the power density profile in equation (3.42) is temperature dependent, a deterministic electro-thermal analysis procedure summarized in Figure 3.12 is built to obtain each T (r, ξ^j).

Temperature Profile Construction by Using Polynomial Interpolation

Instead of directly using equation (3.39) to obtain Qⁱ¹(T ) ⊗ · · · ⊗ Qⁱ^NKL(T ) for each different

|i| = i1 + · · · + iN_KL, we take the advantage of nested sparse grid structure and then perform the Newton interpolating method [98] to globally interpolate T (r, ξ).⁷ Based on the Newton interpolating formula, the approximated on-chip temperature at a specified position of the die, T(r^∗, ξ), can be expressed as

Tb(r^∗, ξ) =

j=N^H−1

j=0

ˆuj(r^∗)φj(ξ). (3.44)

Here, each φj(ξ) is an interpolating polynomial with respect to the j-th sampling vector ξ^j, and the form of each φj(ξ) can be found in [98]. NH = |H(q, NKL)| and |H (q, NKL)| is the number of the sampling vectors in sparse grid. Each ˆuj(r^∗) is an unknown coefficient which needs to be determined.

Based on the basic idea of interpolation that the approximation function must match each known data, the interpolated polynomial in (3.44) satisfies the following equation for each ξⁿ.

j=N^H−1

j=0

ˆuj(r^∗)φj(ξⁿ)= T(r^∗, ξⁿ). (3.45)

7For the sparse grid that does not preserve the nested structure, the Newton interpolating method can also be applied to obtain each Qⁱ¹(T ) ⊗ · · · ⊗ Qⁱ^NKL(T ).

Algorithm Stochastic Collocation Based Electro-thermal Analysis

Input: Geometries of the die; spatial correlation models of device channel length and oxide thickness; design informations such as .def,

.lef, and .lib files; package structure and leakage power models Output: Mean profile, variance profile, and the Smolyak sparse grid

interpolation formula, bT(r, ξ), of on-chip temperature distribution 1 Begin

2 Set thermal parameters and the initial average mean temperature, µⁱⁿⁱ_T , of the die by 1-D thermal model;

3 For m ← 1 to Ng

4 Obtain gL_m and gt_oxm of Lmand toxm by the KL expansion, respectively;

5 EndFor

6 Generate the Smolyak sparse grid, H (q, NKL), for the KL expanded random variables.

7 For n ← 0 to |H (q, NKL)| − 1

8 Obtain T (r, ξⁿ) by using the algorithm shown in Figure 3.12.

9 EndFor

10 Solve equation (3.46) to obtain the Newton interpolation formula in equation (3.44), and calculate the mean and variance profiles.

11 End

Figure 3.13: Stochastic Collocation Based Statistical Expression Generating Algorithm.

With the property of φj(ξ) described in [98], equation (3.45) can be rewritten as the following matrix form for finding each ˆuj(r^∗) at the chip position r^∗.

Each ˆuj(r^∗) can be calculated by using the forward substitution. After each ˆuj(r^∗) is calculated, the mean and variance profiles of the temperature distribution can be estimated as

The algorithm of the developed stochastic collocation based statistical expression generator is shown in Figure 3.13.

Gates Placement

Power density in a grid is obtained by summing the product of the accumulated area of same type of gates and the power value of that type with updated

Figure 3.14: Implementation of solving the deterministic heat transfer equations.

在文檔中超大型積體電路的熱分析技術 (頁 91-97)