An efficient heterogeneous tree multiplexer synthesis technique

(1)

Abstract—In this paper, a novel strategy for designing the heterogeneous tree multiplexer is proposed.The authors build the multiplexer delay model by curve ﬁtting and then formulate the heterogeneous tree multi-plexer design problem as a special type of optimization problem called mixed-integer nonlinear programming (MINLP).A new design parameter, the switch size in each stage, is introduced to improve the speed of the heterogeneous tree multiplexer.The proposed strategy can determine the multiplexer architecture and the switch size in each stage simultaneously. Three optimization methods are provided to synthesize the heterogeneous tree multiplexer according to the design speciﬁcations.

Index Terms—Capacitance, modeling, optimization, simulation, transis-tor sizing.

I. INTRODUCTION

High fan-in multiplexers are used extensively in many applica-tions, including the column decoders of memories [6], [8] and the resistor-chain digital-to-analog converters [7]. Traditionally, high fan-in multiplexers have been designed usfan-ing the uniform approach [9], in which all switches are interconnected with the output node in common, or the binary tree structure [10], which is a tree-like multi-stage structure. However, a more general architecture called the het-erogeneous tree multiplexer was recently proposed [1]. This new architecture has better speed performance than the traditional archi-tectures. Design strategies based on different kinds of switch topol-ogy, such as transmission gate and tri-state buffer, are also proposed in [1]–[3].

Alioto and Palumbo [2] presented a strategy for designing a hetero-geneous tree multiplexer, which uses minimally sized tri-state buffers as switches. It determines the optimum number of stages and the optimum number of switches per group in each stage such that the multiplexer delay is minimal. In [2], the optimum number of switches per group in each stage depends on the technology adopted. In the ex-ample presented in [2], the optimum number of switches per group in each stage is almost four, which is a power of two, as desired. However, changing the technology adopted may cause the number of switches per group in each stage not to be a power of two. Another assumption made by [2] is that the multiplexer delay is roughly independent of the gate sizes. However, the heterogeneous tree multiplexer delay can be improved by suitably selecting the multiplexer architecture, including the number of stages and the number of switches per group in each stage, at the same time as the switch size in each stage. This improvement will be experimentally demonstrated.

This paper introduces a new method that has several important ad-vantages over previous methods. The delay formula of the

hetero-Manuscriptreceived June 18, 2004; revised September 5, 2004 and November 10, 2004. This work was supported in part by the R.O.C. National Science Council under GrantNSC-91-2218-E-009-013. This paper was recom-mended by Associate Editor S. Sapatnekar.

The authors are with the Department of Electronics Engineering, National Chiao Tung University, Hsinchu, Taiwan 300, R.O.C. (e-mail: [email protected]. nctu.edu.tw; [email protected]; [email protected]).

Digital Object Identiﬁer 10.1109/TCAD.2005.852032

design parameters can always be obtained directly and no rounding is required. Another feature of the proposed approach is that the proper multiplexer architecture and the switch size in each stage can be determined simultaneously, useful in reducing the multiplexer delay. In [2], only the multiplexer architecture that minimizes the multiplexer delay could be determined. This paper provides three optimization methods—delay minimization, delay minimization under area con-straint, and area minimization under delay constraint— to help the designers determine the multiplexer architecture and the switch size in each stage according to the design speciﬁcations.

The restof this paper is organized as follows. Section II de-ﬁnes the problem, Section III describes the design strategies, Sec-tion IV presents the experimental results, and SecSec-tion V draws the conclusions.

II. PROBLEMFORMULATION

The structure of the heterogeneous tree multiplexer is shown in Fig. 1. The multiplexer has N inputs and is partitioned into k stages. In the ith stage, the switches are combined into groups of S_ielements with a common output node, which represents an input to the subse-quent stage. This paper considers the case in which the switches are implemented using a tri-state buffer (Fig. 2).

The ﬁrststage includes N/S₁groups of tri-state buffers and N/S₁ outputs. The second stage includes N/S₁S₂ outputs. The kth stage includes N/S₁S2. . . Sk outputs. The last stage has a single output that represents the multiplexer output, so the following constraint can be derived

k

i=1

S_i= N. (1)

In this paper, itwill be assumed thatthe delay of the address decoder of the multiplexer can be neglected. The delay of the address decoder can be reduced by careful circuitdesign, for example, using superbuffer or bipolar complimentary metal– oxide–semiconductor (BiCMOS) drivers [9], [10].

The delay of the heterogeneous tree multiplexer depends strongly on the number of stages and the number of switches per group in each stage, so the problem as defined in previous approaches [1], [2] is to determine the optimal number of stages and the optimal number of switches per group in each stage such that the multiplexer delay is minimal. In [1]–[3], the switch size is kept constant and assumed to be minimal. Here, another parameter— the switch size in each stage—is introduced. This paper will consider the multiplexer architecture, which includes the number of stages and the number of switches per group in each stage, at the same time as the switch size in each stage. The problem will be defined as follows. The problem is to determine the multiplexer architecture and the switch size in each stage such that the multiplexer meets the design specifications. Three optimiza-tion methods—delay minimizaoptimiza-tion, delay minimizaoptimiza-tion under area 0278-0070/$20.00 © 2005 IEEE

(2)

Fig. 1. Heterogeneous tree multiplexer architecture.

Fig. 2. Tri-state buffer.

constraint, and area minimization under delay constraint—are pro-vided to synthesize automatically the heterogeneous tree multiplexer such that the multiplexer meets the design speciﬁcations.

III. PROPOSEDDESIGNSTRATEGY

This section introduces the proposed design strategy. The delay model to be used is ﬁrst considered and then the concept of MINLP is introduced. Next, the path delay model and the load capacitance model in each stage of the heterogeneous tree multiplexer are constructed, and then the delay model of the heterogeneous tree multiplexer is derived. Finally, the objective functions and the constraint functions of the MINLP for the design problem of the heterogeneous tree multiplexer are proposed.

It is easy to show by exhaustive search that there exists a stage number k such that the heterogeneous tree multiplexer meets the design specifications. Since k is a small integer, the proposed strategy will find the optimal value of k by exhaustive search. That is, every possible value of k will be tried in solving MINLP and then the optimal value of k will be selected such that the heterogeneous tree multiplexer meets the design specifications.

A. Delay Modeling Approach

Mostdelay modeling approaches, such as those used for standard cell characterization, estimate the delay of a gate for a given input

transition time and output load capacitance, with the sizes of the transistors inside the gate being kept constant. However, in order to improve the speed performance of the heterogeneous tree multiplexer, the authors would like to consider the effects of varying the transistor sizes and ﬁnd out a strategy to determine appropriate gate size for each stage of the heterogeneous tree multiplexer. The delay model applied in this paper is the convex delay model proposed in [4] and [5]. This model allows us to capture the effects of varying the transistor sizes on the gate delay. The general form of the delay model is given by

Delay = m j=1 P_j× n i=1 x_i + c_ijβij+ C. (2)

Here, the xi’s are characterization variables and the cij’s, βij’s, C, and P_j’s are referred to collectively as characterization constants. m is the parameter used to increase the characterization flexibility and nis the number of characterization variables. The parameter is set to either −1 or 1, depending on the variable. For example, the fall delay increases as the output load capacitance C_L, the positive-channel metal–oxide–semiconductor (PMOS) transistor width W_p, and the input time constant τ are increased, and decreases as the negative-channel metal–oxide–semiconductor (NMOS) transistor width W_nis increased, implying that the value of for the first three variables must be 1 and that for W_nshould be−1. The problem of characterization is to determine appropriate values for the characterization constants. Due to the curve-fitting nature of the characterization procedure, it is not possible to describe direct physical meanings of the characterization constants.

A two-step methodology is used to complete the characterization. In the ﬁrst step, a number of circuit simulations are performed to collect the experimental data using the HSPICE circuit simulator. In the second step, a least-squares procedure is used to ﬁt the data points to (2). The characterization constants are determined by solving the following nonlinear program that minimizes the sum of the squares of the errors over all data points.

N

i=0

(3)

subjectto c_Li f_i(x) c_Ui, i = 1, 2, . . . , m b_Lj g_j(x) b_Uj, j = 1, 2, . . . , p x_Lk x_k x_Uk, k = 1, 2, . . . , n xe∈ Z, xe⊆ xk

where f₀(x)is the objective function, f_i(x)are the nonlinear con-straint functions, g_j(x)are the linear constraint functions, x_kare the variables, c_Ui, b_Uj, and x_Ukare the upper bounds, and c_Li, b_Lj, and x_Lkare the lower bounds.

C. Design Strategy I

This section presents design strategy I, which only determines the multiplexer architecture and keeps the switch size minimal.

1) Delay of Heterogeneous Tree Multiplexer: The switch delay model associated with design strategy I is deﬁned as follows. As the equation shows, the switch delay depends only on the load capacitance of the switch. Switch Delay = m j=1 P_j(C + c_j)βj _{+ q} ₍₄₎

where C is the load capacitance of the switch and P_j, c_j, β_j, and q are the characterization constants.

The path delay of the multiplexer is the sum of the switch delay in each stage of the heterogeneous tree multiplexer, so the path delay model of the heterogeneous tree multiplexer can be obtained as

Path Delay = k i=1 _m j=1 P_j(C_i+ c_j)βj _{+ q} (5)

where C_iis the load capacitance of stage i and P_j, c_j, β_j, and q are the characterization constants.

The switch associated with stage i drives the output capacitance Coutof the other (Si− 1)OFFswitches that belong to the same group, as well as the input capacitance C_inof the switch in the subsequent stage (see Fig. 3). Hence, the load capacitance of stage i is deﬁned as

C_i= (S_i− 1)C_out+ C_in (6)

where C_outis the output capacitance of the switch and C_inis the input capacitance of the switch.

2) Design Optimization: This section presents the objective func-tions and constraint funcfunc-tions of the MINLP for three different opti-mization methods.

Substituting (6) into (5) yields the multiplexer delay equation k i=1 _m j=1 Pj(((Si− 1)Cout+ Cin) + cj)βj + q . (7)

The area of the multiplexer is deﬁned as the number of switches in the multiplexer N + _N S₁ + _N S₁S₂ +· · · + N S₁S₂· · · S_k−1 . (8)

The basic constraints are as

Si≥ 2 and Si∈ Z, i = 1, 2, . . . , k. (9) • Optimization Method I: The multiplexer architecture whose delay is minimal can be obtained using this optimization method. The objective function of MINLP is deﬁned by (7), and the constraint functions of MINLP are deﬁned by (1) and (9).

• Optimization Method II: The multiplexer architecture whose delay is minimal under area constraint can be obtained using this optimization method. The objective function of MINLP is deﬁned by (7), and the constraint functions of MINLP are deﬁned by (1), (8), and (9).

• Optimization Method III: The multiplexer architecture whose area is minimal under delay constraint can be obtained using this optimization method. The objective function of MINLP is deﬁned by (8), and the constraint functions of MINLP are deﬁned by (1), (7), and (9).

D. Design Strategy II

This section presents the design strategy II, by which the multiplexer architecture and the switch size in each stage of the multiplexer are determined simultaneously. The switch size can be different in each stage of the heterogeneous tree multiplexer.

1) Delay of Heterogeneous Tree Multiplexer: The switch delay model associated with design strategy II is deﬁned as follows. The switch delay depends on both the load capacitance and the switch size as Switch Delay = m j=1 P_j(W−1+ c_1j)β1j_{(C + c} 2j)β2j+ q (10)

where W is the NMOS transistor width of the switch, C is the load capacitance of the switch, and P_j, c_1j, β_1j, c_2j, β_2j, and q are the characterization constants.

The path delay model of the heterogeneous tree multiplexer can be obtained as Path Delay = k i=1 _m j=1 P_jW_i−1+ c_1jβ1j(C_i+ c_2j)β2j_{+ q} (11) where W_iis the NMOS transistor width of the switch in stage i, C_iis the load capacitance of stage i, and P_j, c_1j, β_1j, c_2j, β_2j, and q are the characterization constants.

(4)

The load capacitance of stage i is deﬁned as

C_i= (S_i− 1)C_out_i+ C_in_i+1 (12) where C_out_i is the output capacitance of the switch in stage i and C_in_i+1is the input capacitance of the switch in stage i + 1.

The gate capacitance and the source/drain capacitance of the metal– oxide–semiconductor ﬁeld-effect transistor (MOSFET) are proportional to the channel width of the MOSFET, so the input/output capacitance of the tri-state buffer is deﬁned as

C_out_i= aW_i+ b (13)

C_in_i+1= cW_i+1+ d (14)

where a, b, c, and d are the characterization constants and W_i and Wi+1are the NMOS transistor width of switches in stages i and i + 1, respectively.

2) Design Optimization: This section presents the objective func-tions and constraint funcfunc-tions of the MINLP associated with three different optimization methods.

Substituting (12) into (11) yields the multiplexer delay equation k i=1 _m j=1 P_jW_i−1+ c_1jβ1j(((S_i− 1)(aW_i+ b) + (cW_i+1+ d)) + c_2j)β2j_{+ q} . (15)

The area of the multiplexer is deﬁned as the sum of the NMOS transistor widths of the switches in the multiplexer given as

N W1+ _N S₁ W2+ _N S₁S₂ W3+· · · + N S₁S₂· · · S_k−1 Wk. (16) In the experiment, the range of NMOS transistor widths is 0.3–3 µm. In order to obtain reasonable design parameters, 10Wimust be a positive integer. So the constraints shown in (17) at the bottom of the page were obtained.

• Optimization Method I: This optimization method can determine the multiplexer architecture and the switch size in each stage such that the multiplexer delay is minimal. The objective function of MINLP is deﬁned by (15), and the constraint functions of MINLP are deﬁned by (1) and (17).

• Optimization Method II: This optimization method can deter-mine the multiplexer architecture and the switch size in each stage such that the multiplexer delay is minimal under area constraint. The objective function of MINLP is deﬁned by (15), and the constraint functions of MINLP are deﬁned by (1), (16), and (17).

• Optimization Method III: This optimization method can deter-mine the multiplexer architecture and the switch size in each stage such that the multiplexer area is minimal under delay constraint. The objective function of MINLP is deﬁned by (16), and the constraint functions of MINLP are deﬁned by (1), (15), and (17).

TABLE I

DELAYMODELPARAMETERSASSOCIATEDWITHDESIGNSTRATEGYI

TABLE II

SWITCHINPUT/OUTPUTCAPACITANCEASSOCIATED WITHDESIGNSTRATEGYI

IV. EXPERIMENTALRESULTS

This section presents the experimental results obtained using the proposed approach. A 256-input multiplexer is designed and simulated using the 0.18-µm CMOS process. In this experiment, the parameter m is setto 1. The outputloading of the 256-inputmultiplexer is 0.003 pF. The input signal for switch characterization is a rising signal, which rises from 0 to 1.8 V during 0.1 ns. The input signals for the simulation of the multiplexer path delay are a rising signal, which rises from 0 to 1.8 V during 0.1 ns, and a falling signal, which falls from 1.8 to 0 V during 0.1 ns. The PMOS/NMOS ratio is 3.57:1, which is set to equalize the rising and falling times. The experiment has two parts. In part A, the multiplexer is designed according to strategy I. In part B, the multiplexer is designed according to strategy II. In each part, three different optimization methods are applied. The ﬁrst is delay minimization, which is the same as the previous approach [2]. The second is delay minimization under area constraint, and the third is area minimization under delay constraint.

An Intel/Pentium-IV 2.4-GHz personal computer (PC) with 512 MB of random access memory (RAM) was used to perform the experiment. The curve-ﬁtting problem was solved using MINOS [11]–[13]. The MINLP problem was solved using BARON [14], [16], [17], which is a global solver and can guarantee that a global optimal solution to the problem is found.

A. Experimental Results Obtained Using Design Strategy I

Table I shows the switch delay model parameters. Table II shows the input and output capacitances of the switch. Table III shows the experimental results of the three optimization methods. Fig. 4 shows the delay/area tradeoff curve. Table IV shows the parameters associated with Fig. 4.

A two-step methodology is used to complete the switch characteri-zation. In the first step, a number of circuit simulations are performed using HSPICE to collect 37 data points. The NMOS width is fixed at 0.3 µm. The load capacitance ranges from 0.004 to 0.04 pF in steps of 0.001 pF. In the second step, a least-squares procedure is used to fit the data points to (4).

As the experimental results in Table III show, the predicted delay agrees well with the simulated result. The maximum error between the estimated result and the HSPICE simulation result is less than 5%. Several similar optimal results using optimization method I were obtained. They are k = 3, (S₁, S₂, S₃) = (4, 8, 8), (8, 8, 4), (8, 4, 8)

(5)

Fig. 4. Area versus delay curve obtained using design strategy I. TABLE IV

PARAMETERSASSOCIATEDWITHFIG. 4

and k = 4, (S1, S2, S3, S4) = (4, 4, 4, 4). The estimated delay for the ﬁrst three results is 0.5119 ns, and the estimated delay for the last result is 0.5208 ns. The HSPICE simulation results of the four optimal solutions are also similar. So k = 4 and S₁= S₂= S₃= S₄= 4were selected, which are the same as the result of the previous approach [2], [3], as the solution of optimization method I. Using op-timization method II, the multiplexer architecture that yields minimal delay under the area constraint of 290 switches is k = 3, S₁= 16and S₂= S₃= 4. According to optimization method III, the multiplexer architecture that yields minimal area under the delay constraint of 0.65 ns is k = 2, S₁= S₂= 16. The experimental results reveal that the proposed approach can effectively determine the multiplexer architecture according to the design speciﬁcations.

B. Experimental Results Obtained Using Design Strategy II

Tables V and VI show the switch delay model parameters and the switch input/output capacitance model parameters, obtained by curve

TABLE V

DELAYMODELPARAMETERSASSOCIATEDWITHDESIGNSTRATEGYII

TABLE VI

SWITCHINPUT/OUTPUTCAPACITANCEMODELPARAMETERS ASSOCIATEDWITHDESIGNSTRATEGYII

ﬁtting, respectively. Table VII shows the experimental results of the three optimization methods. Fig. 5 shows the delay/area tradeoff curve. Table VIII shows the parameters associated with Fig. 5.

A two-step methodology is used to complete the switch characteri-zation. In the ﬁrst step, a number of circuit simulations are performed using HSPICE to collect 1260 data points. The size of NMOS ranges from 0.3 to 10.3 µm in steps of 0.5 µm. The load capacitance ranges from 0.004 to 0.594 pF in steps of 0.01 pF. In the second step, a least-squares procedure is used to ﬁt the data points to (10).

The same two-step methodology is used to complete the character-ization of switch input/output capacitances. In the ﬁrst step, ten data points for each switch input and output capacitances are collected separately using HSPICE. The size of NMOS ranges from 0.3 to 3 µm in steps of 0.3 µm. In the second step, a least-squares procedure is used to ﬁt the data points to (13) and (14) and then the parameters for switch output and input capacitances can be obtained.

Alioto and Palumbo [2] and Alioto et al. [3] use only minimally sized switches and do not consider the transistor sizing problem. In this experiment, the authors try to show that the delay of the heterogeneous tree multiplexer can be improved by considering the multiplexer architecture and the switch size in each stage simultaneously. As the experimental result shown in the ﬁrst row of Table III, the optimal multiplexer delays for rising inputand falling inputare 0.5278 and 0.5338 ns, which are obtained by considering the multiplexer ar-chitecture only and keeping the switch size in each stage minimal. As the experimental result shown in the ﬁrst row of Table VII, the optimal multiplexer delays for rising input and falling input are 0.4595 and 0.4529 ns, which are obtained by considering the multiplexer

(6)

TABLE VII

EXPERIMENTALRESULTS OFDESIGNSTRATEGYII

Fig. 5. Area versus delay curve obtained using design strategy II.

architecture and the switch size in each stage simultaneously. The minimal multiplexer delays for rising input and falling input can be reduced by almost 15% and 18%, respectively, if the multiplexer architecture and the switch size in each stage are suitably selected using the optimization method I of strategy II.

As the experimental results in Table VII show, the predicted delay agrees well with the simulated result. The maximum error between the estimated result and the HSPICE simulation result is less than 5%. In optimization method I, as desired, the minimal multiplexer delay can be obtained by determining the multiplexer architecture and the switch size in each stage simultaneously. Using optimization methods II and III, the multiplexer architecture and the switch size in each stage, meeting the design constraints, can be obtained.

Table IX shows the detail run time of the experiment. “(number)” is used to represent the run time for ﬁnding an infeasible solution, meaning that no solution that satisﬁes the constraints exists. The time required for solving the MINLP problem depends on the MINLP solver. Several MINLP solvers were tried, including minlpBB, glc-Solve, OQNLP, SBB [15], and BARON [16], [17]. After comparing their performance and run time, BARON [16], [17] is used as the solver of the proposed MINLP problem. SBB [15] is also a good choice for solving the MINLP problem although it does not guarantee that a global optimal solution is found. In the experiment, SBB [15] always yields the same result as BARON [16], [17]. They both provide very good optimization results in a reasonable time. The optimization result using design strategy I can be obtained in a short time. The run

TABLE VIII

PARAMETERSASSOCIATEDWITHFIG. 5

time of design strategy II is larger than design strategy I, but it is also reasonable. As the experimental results for design strategies I and II reveal, the optimal k value is between 2 and 4. The run time for these solutions in Table IX is less than 1 min.

C. Discussion

In this experiment, the PMOS/NMOS ratio is set to balance the rising delay and the falling delay, and a single delay model is applied to predict the rising and falling delays of the multiplexer obtained by HSPICE simulation. In order to extend the proposed approach to handle various PMOS/NMOS ratios, the rising and falling delays need to be modeled separately. The authors propose a method to model the rising and falling delays separately as follows. The switch is separately characterized for rising and falling inputs, and the delay model parameters for rising and falling inputs are separately obtained. To obtain the rising delay model of the multiplexer, the switch rising delay model is used in the odd stages of the multiplexer and the switch falling delay model is used in the even stages of the multiplexer.

(7)

To obtain the falling delay model of the multiplexer, the switch falling delay model is used in the odd stages of the multiplexer and the switch rising delay model is used in the even stages of the multiplexer. A parameter max _delay is deﬁned to handle separately the rising and falling delays. Two additional constraint functions are deﬁned as

max_delay > the rising delay model of the multiplexer max_delay > the falling delay model of the multiplexer. For optimization methods I and II, the parameter max _delay is used as the objective function instead of the original single delay model. For optimization method III, the parameter max _delay is unnecessary and the delay constraints are set separately for the rising and falling delay models of the multiplexer.

In this experiment, input drivers connected to the first stage of the multiplexer are not considered. When input drivers are connected to the first stage of the multiplexer, the delay due to the input drivers and the input capacitance of the multiplexer can be determined by the following method. First, the delay model of the input driver must be determined by curve fitting or from the cell library databook. Then, the input capacitance of the first stage of the multiplexer, which is the load capacitance of the input driver, is determined by the method proposed in this paper. The delay of the input driver is determined using the delay model of the input driver and the input capacitance of the first stage of the multiplexer, which is the load capacitance of the input driver. The total path delay is the sum of the delay of the input driver and the delay of the multiplexer. The authors use an example to explain it. When minimized-size tri-state buffers are used as input drivers and are connected to the first stage of the multiplexer shown in the first row of Table VII, the total path delay can be determined as follows. The delay model parameters of the minimized-size tri-state buffer are shown in Table I. The load capacitance of the input driver is 0.02196 pF, which is determined using (14). The estimated delay of the input driver is 0.224 ns, which is obtained using (4). The estimated delay of the multiplexer is 0.4669 ns, which is shown in the first row of Table VII. The total estimated delay is 0.6909 ns. The HSPICE simulated result is 0.7119 ns. The error is 3%. The method can also be applied to other types of input driver, for example, inverters, by applying the associated delay model of the input drivers.

In Section IV-B, the simultaneous approach, which simultaneously optimizes multiplexer architecture and switch size, is applied to op-timize the multiplexer. Another alternative is the sequential approach.

Initially, design strategy I is used to determine the multiplexer architec-ture, k and S_i. Then, k and S_iare substituted into the MINLP model of design strategy II and the MINLP problem is solved to determine the transistor size in each stage of a multiplexer. A simple experiment is performed to show the run time and performance of the sequential approach. The authors apply the transistor sizing to the multiplexer shown in the ﬁrst row of Table III. The run time for transistor sizing is 2.47 s. The optimal size W_i= (3, 1.5, 0.9, 0.6). The rising delay and falling delay obtained by HSPICE simulation are 0.4694 and 0.4534 ns, respectively. The experimental results reveal that the result obtained using the sequential approach is slightly worse than the result obtained using the simultaneous approach, which is shown in the ﬁrst row of Table VII, but the run time of the sequential approach is less than the run time of the simultaneous approach.

The proposed method has some advantages over the earlier method [2], [3]. The first is that it offers more flexibility in that it can be applied to design various types of multiplexer. The earlier method [2], [3] can be applied to some special multiplexers, whose inputs are powers of 4 or 2. The fourth row in Table VII presents the results of optimizing a 400-input multiplexer using design strategy II. The second advantage of the method is that it can be applied to various processes because the switch delay model is characterized using HSPICE simulation data. The third advantage of the method is that it can be used to size any specified architecture of a multiplexer. For example, if a 400-input multiplexer with an architecture of k = 4 and S_i= (4, 4, 5, 5) needs to be sized, k and S_iare substituted into the MINLP model of design strategy II and then the MINLP problem is solved. The optimal switch size in each stage can be easily obtained. The optimal size W_i= (3, 1.5, 0.9, 0.6)and the estimated delay of the multiplexer is 0.5088 ns.

V. CONCLUSION

In this paper, a novel design strategy for designing a high fan-in heterogeneous tree multiplexer is proposed. Curve ﬁttfan-ing is used to build the multiplexer delay model, and then the heterogeneous tree multiplexer design problem is formulated as a special type of optimization problem called mixed-integer nonlinear programming (MINLP). The proposed design strategy can help designers deter-mine the multiplexer architecture and the switch size in each stage according to the design requirement, such as delay minimization, delay minimization under area constraint, or area minimization under delay constraint. MINLP enables the optimal choice of design parameters to

(8)

be obtained directly. The proposed approach is applicable to different process technologies. Another important advantage is that the mul-tiplexer delay can be improved by suitably selecting the mulmul-tiplexer architecture and the switch size in each stage simultaneously.

ACKNOWLEDGMENT

The authors would like to thank Prof. T. Westerlund of Abo Akademi University for his constructive suggestions about solving the MINLP problems. The anonymous referees are also appreciated for their valuable comments.

REFERENCES

[1] M.-B. Lin, “On the design of fast large fan-in CMOS multiplexers,” IEEE

Trans. Comput.-Aided Des. Integr. Circuits Syst., vol. 19, no. 8, pp. 963–

967, Aug. 2000.

[2] M. Alioto and G. Palumbo, “Optimized design of high fan-in multiplexers using switches with driving capability,” in Proc. 8th IEEE Int. Conf.

Elec-tronics, Circuits and Systems (ICECS), St. Julians, Malta, 2001, pp. 737–

740.

[3] M. Alioto, G. Di Cataldo, and G. Palumbo, “Optimized design of high fan-in multiplexers using tri-state buffers,” IEEE Trans. Circuits Syst. I,

Fundam. Theory Appl., vol. 49, no. 10, pp. 1500 –1505, Oct. 2002.

[4] M. Ketkar, K. Kasamsetty, and S. Sapatnekar, “Convex delay models for transistor sizing,” in Proc. 37th Design Automation Conf. (DAC), Los Angeles, CA, 2000, pp. 655–660.

[5] K. Kasamsetty, M. Ketkar, and S. Sapatnekar, “A new class of convex functions for delay modeling and its application to the transistor sizing problem,” IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst., vol. 19, pp. 779–788, Jul. 2000.

[6] R. T. Howe and C. G. Sodini, Microelectronics: An Integrated Approach. Englewood Cliffs, NJ: Prentice-Hall, 1997.

[7] D. A. Johns and K. Martin, Analog Integrated Circuit Design. New York: Wiley, 1997.

[8] J. Rabaey, Digital Integrated Circuits (A Design Perspective). Engle-wood Cliffs, NJ: Prentice-Hall, 1996.

[9] S. Kang and Y. Leblebici, CMOS Digital Integrated Circuits: Analysis and

Design. New York: McGraw-Hill, 1996.

[10] N. Weste and K. Eshraghian, Principles of CMOS VLSI Design: A System

Perspective, 2nd ed. Reading, MA: Addison-Wesley, 1993.

[11] K. Holmstrom and A. Goran, User’s Guide for Tomlab v4.0.5. San Diego, CA: Tomlab Optimization Inc., Feb. 28, 2003.

[12] K. Holmstrom, A. Goran, and M. Edvall, TOMLAB/MINOS User’s Guide. San Diego, CA: Tomlab Optimization Inc., May 4, 2004.

[13] B. A. Murtagh and M. A. Saunders, MINOS 5.5 User’s Guide, Dept. Op-erations Res., Stanford Univ., Stanford, CA, Rep. SOL 83-20R (Revised Jul. 1998)

[14] A. Brooke, D. Kendrick, A. Meeraus, R. Raman, and R. E. Rosenthal,

GAMS: A User’s Guide. Washington, DC: GAMS Develop. Corp., 1998.

[15] GAMS/SBB Solver Manual, GAMS Develop. Corp., Washington, DC, Apr. 30, 2002.

[16] GAMS/BARON Solver Manual, GAMS Develop. Corp., Washington, DC, Nov. 25, 2003.

[17] N. V. Sahinidis, BARON: Branch and Reduce Optimization Navigator,

User’s Manual. Urbana–Champaign, IL: Dept. Chem. Eng., Univ.

Illi-nois Urbana–Champaign, 2000.

Evaluating the Reliability ofNAND Multiplexing With PRISM

Gethin Norman, David Parker, Marta Kwiatkowska, and Sandeep Shukla

Abstract—Probabilistic-model checking is a formal veriﬁcation tech-nique for analyzing the reliability and performance of systems exhibiting stochastic behavior.In this paper, we demonstrate the applicability of this approach and, in particular, the probabilistic-model-checking tool PRISM to the evaluation of reliability and redundancy of defect-tolerant systems in the ﬁeld of computer-aided design.We illustrate the technique with an example due to von Neumann, namelyNANDmultiplexing.We show how, having constructed a model of a defect-tolerant system incorporating prob-abilistic assumptions about its defects, it is straightforward to compute a range of reliability measures and investigate how they are affected by slight variations in the behavior of the system.This allows a designer to evaluate, for example, the tradeoff between redundancy and reliability in the design.We also highlight errors in analytically computed reliability bounds, recently published for the same case study.

Index Terms—Defect-tolerant architectures, multiplexing, probabilistic-model checking, reliability.

I. INTRODUCTION

Probabilistic-model checking is a formal veriﬁcation technique, which has already been successfully used to analyze the performance and reliability of a wide range of real-life systems, including dynamic power-management schemes [1], embedded systems [2], computer networks, queueing systems, and manufacturing processes. It has also been used to study “quality of service” properties of real-time prob-abilistic communication protocols, such as IEEE 1394 FireWire [3], IEEE 802.3 CSMA/CD [4], Zeroconf [5], IEEE 802.11 wireless local area networks [6], and Bluetooth [7], and to verify both probabilistic security protocols (e.g., [8]) and randomized distributed algorithms (e.g., [9]).

In this paper we present results that demonstrate the advan-tages of using probabilistic-model checking and, in particular, the probabilistic-model-checking tool PRISM [10], to model and analyze defect-tolerant systems. We have chosen to investigate the reliability ofNANDmultiplexing [11], but this approach can be applied to other defect-tolerant systems such as R-fold Modular Redundancy [11] and Reconﬁguration [12], [13]. This work differs from the standard approaches in the literature to analyzing multiplexing in that we evaluate the reliability of speciﬁc cases as opposed to considering the general framework, and hence are not necessarily restricted by the analytical bounds of reliability of, for example, von Neumann [11] and Pippenger [14].

Our results demonstrate that, by applying probabilistic-model checking, it is straightforward to investigate the effect on reliability of slight variations in the behavior of the system’s components, for example the change in reliability as the probability of gate failure

Manuscriptreceived February 2, 2004; revised July 15, 2004. This work was supported by NSF grants CCR-0098335 and CCR-0340740, EPSRC grants GR/N22960 and GR/S11107, FORWARD, SRC, and DARPA/ITO supported PADS projectunder the PAC/C program. This paper was recommended by Associate Editor J. H. Kukula.

G. Norman, D. Parker, and M. Kwiatkowska are with the School of Computer Science, University of Birmingham, Birmingham, B15 2TT, U.K.

S. Shukla is with the Bradley School of Electrical and Computer Engineer-ing, Virginia Polytechnic Institute and State University, Blacksburg, VA 24060 USA.