• 沒有找到結果。

Analysis and design of Banyan and crossbar switches with bypass queues

N/A
N/A
Protected

Academic year: 2021

Share "Analysis and design of Banyan and crossbar switches with bypass queues"

Copied!
10
0
0

加載中.... (立即查看全文)

全文

(1)

computer

-I_.

.

-S

ELSEVIER

Computer Communications 20 (1997) 115- I24

Research

Analysis and design of Banyan and crossbar switches with bypass queues

Jau-Hsiung Huang*, Yo-Song Su

Communications cmd Multimedia hborarory Department of Computer Science and Information Engineering, Nutionul Taiwan Universifi. Taipei, Taiwan (Received 15 December 1994; revised 20 October 1995)

Abstract

Bypass queueing can overcome the problem of head of line blocking within an input queue. We study two different approaches for providing a bypass function in input buffers. They are look-through and look-back service disciplines, both of which can prevent out of sequence situations. Under uniform traffic, if the switch size is large, these two disciplines obtain almost the same throughput. However, under nonuniform traffic, especially a hot-spot traffic pattern, the look-through will achieve a much better throughput than look-back policy. This paper presents the throughput analysis and simulations for crossbar and Banyan switches under uniform and nonuniform traffic. Furthermore, to reduce contention across input queues, the parallel-plane ATM switch with bypass queues is also discussed and analyzed. Keywords: Bypass queueing; Look-through/Look-back policies; Throughput

1. Introduction

Although switches using output queueing obtain better throughput and delay performance than switches using input queueing, the hardware cost is higher and the problem of output contention is still unavoidable [l-5]. On the other hand, input queueing is simple in the hardware aspect, while it suffers from the problem of head of line blocking [6-l 11. This is the phenomenon where a packet in the input buffer is prevented from accessing an available output port because the packets ahead of it in the buffer are blocked. Hence, head of line blocking will reduce the throughput of the switch. This problem can be overcome by allowing other packets in the buffer to be transmitted when the leading packets are blocked. Such an input buffer is called a bypass queue [9,10,12,13].

The problem of output contention confines the throughput of crossbar switches. For Banyan switches, besides output contention, the problem of internal blocking also reduces the throughput. With bypass queueing, when the leading packet is blocked, the packet after it still has the chance to be transmitted. Hence, the throughput can be increased. However, a potential problem of bypass queueing is the out of sequence packets at the output port; thus, a special care should be provided in the design. This will be discussed in detail later.

* Email: [email protected]

0140-3664/97/$17.00 0 1997 Elsevier Science B.V. All rights reserved PZZ: 0140-3664(96)01171-l

As illustrated in Fig. 1, an N x N non-buffered ATM switch with bypass queues consists of N input buffers, N input port controllers, N output port controllers, and an N x N non-buffered fabric. The function of the input and output port controllers is to implement the bypass queueing function. The input port controller will allow a packet to be transmitted through the switch when the packet ahead of it is blocked. Moreover, outside the switch, the transmission line operates in time slots where one time slot is the time for the switch to transmit one ATM cell (i.e. 424 bits). Inside the switch, time is divided into packet slots, each of which contains several offer slots and one transmission slot, as

illustrated in Fig. 2 [lo]. One transmission slot is the time taken to transmit one ATM cell. In an offer slot, the packet routing tag is used to attempt to establish a path through the switch to the desired output port. Note that, because of the asynchronization between the packet and time slots, a 2-packet-deep buffer should be provided at each output port controller to synchronize packets with the time slot.

We further define the window size as the number of offer slots in a packet slot. At the beginning of the transmission slot, all packets which have successfully set up a path to their output ports during the offer slots are transmitted. If the switch is designed carefully, a log, N bit period is required for an offer slot. Hence, the overhead of offer slots will reduce the throughput of an ATM switch by a fraction of 424/(424 + f log2 N), where 424 is the ATM cell length and

f

is the window size of a packet slot.

(2)

116 J.-H. Huang, Y.-S. SdComputer Communications 20 (1997) 115-124

input input port output port buffers controllers controllers

-ru+O O- -W -O- nonbuffered i XII ATM switch -lxm--o 0-c

Fig. 1. Architecture of the ATM switch with bypass queue.

Clearly, a larger f provides more opportunities for

packets in the input buffer to be transmitted, and hence

can increase the switch throughput. On the other hand, a

larger f also incurs more overhead for each packet slot,

and hence will decrease the switch throughput. Therefore,

the value off should be carefully chosen to maximize the

switch throughput.

Before discussing policies to overcome the packet out

of sequence problem, we introduce definitions of nonuni-

form traffic patterns used in this paper. One of the non-

uniform traffic patterns considered is hot-spot traffic, with

h defined as the hot-spot ratio. This means that hp packets

are directed to the hot-spot port and (1 - h)p packets are

directed uniformly to other output ports, where p is the load

of an input port. Another nonuniform traffic pattern consid-

ered is the mixed single-source-single-destination (SSSD)/

uniform traffic pattern [ 141. This means that a specific input

port sends all its packets to a single output port, but packets

on other input ports are directed uniformly to the other out-

put ports. Such traffic might exist if the network supports

leased line services.

In the following we describe two policies to resolve

the out of sequence problem and to enhance the throughput.

With the

look-back

policy, when the window size is

f,

the

input port controller will examine f packets at most in

the front of the input buffer. Whenever a packet in the

bypass window is allowed to be transmitted through

the switch, the input port controller must check (or

look back)

whether any of the leading packets is directed to the

same output port. If it does, the foremost packet with the

same output address is transmitted in the transmission slot

to avoid the problem of packets arriving out of sequence.

With the

look-through

policy, it is reasonable to assume

that if a packet is blocked, then packets after it with the

same output address will very likely be blocked in another

offer slot, too. Based on this assumption, if the input port

controller can skip (or

Eook through) the

packets which are

directed to the same output ports as the leading packets, we

q

offer

slot

- transmission slot A I 1 1 I

Fig. 2. Structure of a packet slot.

‘x x x x

(a)

(b)

Fig. 3. Behavior of look-through and look-back policies when f = 4. (a) Look-back policy; (b) look-through policy. (‘x’ means attempt failure, ‘v’ means attempt success, ‘-’ means ignore).

expect a better throughput. With look-through policy, when

the window size isf, the input port controller will examine

f

packets at most, but thesef packets are not necessarily the

first

f

packets in the input buffer. It will be shown later

that the look-through policy obtains a much higher

throughput than the look-back policy under the hot-spot

traffic pattern.

In Fig. 3 we give an example for each policy with the

window size f equal to 4. A look-back policy is shown in

Fig. 3(a), where all attempts of the first four packets are

failed. Note that there are two packets destined to ‘6’ in

the attempt window. Since the first ‘6’ is failed, there is a

good chance that the second ‘6’ will also fail. In Fig. 3(b)

a look-through policy is shown, where a successful attempt

is found in the sixth buffer position. Note that in this figure

there are two ‘8’s in the first and fifth positions and two ‘6’s

in the second and fourth positions. Since both the first ‘8’

and ‘6’ are failed in their attempts, both the second ‘8’ and

‘6’ are skipped in the following attempts. Instead, the ‘5’ in

the sixth position is attempted and succeeds. This example

shows that the look-through policy can achieve a higher

throughput than the look-back policy.

To implement the look-back policy, cf - 1) registers are

required, While the output address of the kth packet in the

buffer is fed to the switch fabric, it is saved into the kth

register and compared with the contents of the first

(k -

1)

registers in parallel. If the input port controller finds an

unoccupied port for this packet, then the foremost packet

with the same output address is transmitted in the transmis-

sion slot.

To implement the look-through policy, the input port

controller should know the positions of these f packets,

all with different output addresses, in the input buffer at

the beginning of a packet slot. Since the length of the trans-

mission slot is much larger than that of an offer slot and

the input port controller has nothing to do during the trans-

mission slot, the input port controller will find the positions

of thesef packets for the kth packet slot during the trans-

mission slot of the

(k -

1)th packet slot and memorize

their positions by using registers. Hence, at the beginning

of the

kth

packet slot, these

f

packets are available for

the input port controller to feed the switch fabric.

(3)

For a large switch, the maximum throughputs of these two disciplines are almost the same under uniform traffic. It is because the output addresses of different packets are usually different that the leading

f

packets are normally heading for different output ports. However, under hot- spot traffic at heavy loads, hot-spot packets will be queued in front of input buffers because the switch cannot handle so many hot-spot packets as they come in. With the look-back policy, it is likely that the firstf packets may all be hot-spot packets with the same destination, depending on the hot- spot ratio. However, with the look-through discipline, the input port controller will skip these hot-spot packets to find non-hot-spot packets. Hence, the look-through policy will achieve a much better throughput than the look-back policy.

Some previous works are limited to the look-back policy under a uniform traffic pattern. Hluchyj and Karol [ 151 pre- sented simulation results of the maximum throughput for a nonblocking switch, such as crossbar switches, with bypass queues under various switch sizes and window sizes. The overhead incurred by offer slots was not considered, and no analytical model was presented. Sarkies [lo] presented a structure of the input port controller for Banyan switches to overcome the problem of a packet arriving out of sequence with a few simulation results. We analyzed the maximum throughputs of single and parallel-plane Banyan switches with ‘look-back’ bypass queues under uniform traffic [ 161.

In this paper, we study the performance of Banyan and crossbar switches with bypass queues under uniform and nonuniform traffic. Both the look-back and look- through policies are considered and compared. An analyti- cal model is proposed for the look-through policy under a uniform traffic pattern. The rest of the paper is organized as follows. Section 2 analyzes the maximum throughputs of Banyan and crossbar switches with bypass queues under uniform traffic. Simulation results accompany the analysis to show the preciseness of the analytical model. The perfor- mance under nonuniform traffic is presented in Section 3. Section 4 discusses and analyzes the parallel-plane ATM switches with bypass queues. Conclusions are given in Section 5.

2. Performance analysis of look-through policy under uniform traffic

In this section, we will find the maximum throughputs for switches with the look-through policy under a uniform traffic pattern. A Banyan switch will be examined first, and then the crossbar switch.

2.1. Bnnyun switch

With the look-through policy, it is reasonable to assume independence between packet slots, i.e. between packet slots the destinations of packets in the bypass window are

independent. Similarly, we also assume independence between offer slots. Based on these assumptions, the analy- sis will only consider the behavior of the switch in a packet slot. Furthermore, it is assumed that the two input links of a switching element are statistically independent; hence, the state of a stage can be reduced to that of a single input link of the stage [17]. We introduce the following notation for analysis:

N = switch size

n = number of stages in the switch (n = logI N) f = window size

j = number of paths which are already set up

pi = Probrthere is a packet departing from state i] = Problthere is a packet coming to stage i + I] fl =overhead incurred by one offer slot. (Hence, /3 = log2 N/424.)

olj = prob[a connection setup is successful for a packet in the input buffer \ j paths are already set up]

S(k) = total number of input links permitted to transmit a packet after the kth offer slot]

B(k) = total number of blocked input buffers after the kth offer slot

TH(k) = throughput of the switch after the kth offer slot (after considering the overhead).

The throughput analysis under such a policy and uniform traffic pattern is the same as in our previous work [16]. However. we will briefly outline the key ideas in the follow- ing for completeness. The following is the analysis to obtain a,:

PO = P

(1)

p,z=p._,[*-$.;

-(I-$+_

.;.;I

With OLj found, we can derive the throughput as follows. To find the maximum throughput, we set p. = 1 for all computations.

Initially, S(0) = 0, B(0) = N, TH(0) q = 0. Compute (Y() using Eq. (1):

S(1) = NYQ, B(1) = N-S(l), 7X(I) = S(I)I]N(l + P)]. Compute os( Ij using Eq. (1):

S(2) = S(1) + B( I)as~,~, B(2) = N - S(2).

(4)

118 J.-H. Huang, Y.-S. SdComputer Communications 20 (1997) 115-124 0.0 I . I * I . I , I 1 3 5 1 9 11

window size

3 5 7 9

window size

Fig. 4. Maximum throughput of Banyan switches under uniform traffic, Fig. 6. Maximum throughput of crossbar switches under uniform traffic,

Computer CQ(

f _ ,) using Eq. (1):

S(f) = S(f - 1) + B(f - l)*os(f- ,)9

B(f) = N - S(f),

TH(f) = s(fMN(l

+fP)l.

(2)

The maximum throughputs considering the window over-

head are illustrated in Fig. 3. Note that with FIFO input

queueing (window size = l), the maximum throughput of

a lo-stage Banyan switch is only 0.25. However, bypass

input queueing can improve the throughput up to 0.44,

with a window size equal to 12. Fig. 4 shows the simulation

results of the maximum throughput for a 16 x 16 Banyan

switch under various window sizes and policies, compared

with the analytical results. From this figure, we observe that

the analytical results fit well with the simulation results

using the look-through policy. Besides, the assumptions of

independence do not affect the estimation of the optimal

window size.

access each output port. If there are many packets directed

to the same output port, one of them is randomly chosen to

access this output port and the others are blocked in the

input buffers [18]. In the following analysis, input buffers

are checked one-by-one to find the maximum throughput.

Some assumptions and notation are used as stated earlier:

Setj = 0 and S(0) = 0

Initial w = I

for-each w 5 f do

begin

Initial p = 1

for-each

p 5 N

do

begin

setj=j+

[l-E!?+L!].~;

end

Set S(w) = j

2.2.

Crossbar switch

For crossbar switches, at most one packet is permitted to

.

N-1024 - N=64 N=256 E 0.4 i- -I

Set TH(w) =

S(w)

N(1 +

$9

0.8 5 2 0.7 $J 0.6 e 5 0.5 E 0.4 g 0.3 ‘5 0.2 E 0.1 5 7 9 11

window size

Fig. 5. Simulation results of the maximum throughput for a 16 x 16 Banyan switch under uniform traffic. q : analysis; 0: simulation (look-through); + : simulation (look back).

Variable w is the index to calculate THcf), and

p

is the

index variable used to check input buffers one-by-one. The

term 1 - S(w - 1)/N in the inner loop is the probability that

this input port controller is still looking for an unoccupied

output port. That is, none of the leading packets in this input

buffer has found an unoccupied output port. Note that this

term is always 1 in the first loop to calculate TH( l), because

the packets checked are the first packets in the input buffers.

The term (N -

j)/N

in the inner loop is the probability that

the destination for the packet being checked is an unoccu-

pied output port. After checking every input buffer, we find

the value of variable j and use it to calculate

TH(

1). Then, w

is increased by one to calculate

TH(2).

Using this method,

f loops later, we will obtain

THcf).

The

maximum throughputs considering the window over-

head are illustrated in Fig. 5, which shows that, with FIFO

input queueing (window size = l), the maximum throughput

of a 1024 x 1024 crossbar switch is 0.62. With the aid of

(5)

1 3 5 7 9 window size

Fig. 7. Simulation results of the maximum throughput for a 64 x 64 cross-

bar switch under uniform traffic. 0: analysis; +: simulation (look-

through); 0: simulation (look-back).

bypass input queueing, the throughput can be improved to 0.77 with an optimal window size equal to 5. Fig. 6 is the simulation results of the maximum throughput for a 64 x 64 crossbar switch under various window sizes and policies, compared with the analytical results. It is shown in this figure that the analytical results match well with the simula- tion results. We also observe that the look-through and look- back policies achieve almost the same throughput, since the switch size is large and the traffic pattern is uniform.

3. Simulation results under nonuniform traffic

For nonuniform traffic, it is very difficult to find the maxi- mum throughput analytically. Hence, we run numerous simulations to learn the impact of traffic pattern on the throughput under different switch architectures and service policies. 1.0 I 5 2 0.8 g s 0.6 E z 0.4 ._ s 0.2 0.0 1 3 5 7 9 11 window size

Fig. 8. Simulation results of the maximum throughput for a 16 x 16 Banyan switch when the hot-spot ratio equals 0. I, 0.2 and 0.3, respectively. A: look- through (uniform): 0: look-through (h = 0.1); n : look-through (h = 0.2);

0: look-through (h = 0.3): 0: look-back (h = 0.1); 4: look-back (/I = 0.2): 0: look-back (h = 0.3).

I . I . I . I , 1

1 3 5 7 9 11

window size

Fig. 9. Simulation results ofthe maximum throughput for a 16 x 16 Banyan

switch under mixed SSSD/uniform traffic. El: look-back: + : look-through.

3.1. Banyan switch

The switch simulated is a 16 x 16 Banyan switch. Fig. 7 shows the simulation results of the maximum throughput under various window sizes and policies, providing that the hot-spot ratios are 0.1, 0.2 and 0.3, respectively. From this figure, we observe that the maximum throughput obtained by the look-through policy is much higher than that obtained by the look-back policy under higher hot- spot ratios. Moreover, notice that a higher hot-spot ratio only slightly decreases the throughput for the look-through policy as long as the window size is properly chosen and the hot-spot ratio is not too high. For the look-back policy, a higher hot-spot ratio will significantly decrease the through- put. The reason why the maximum throughput of the switch with look-back under a higher hot-spot ratio is almost inde- pendent of window sizes is because there are many hot-spot packets queued in the front of the input buffers, so the input port controllers cannot find more non-hot-spot packets in the bypass windows to send, even for a large window size. For mixed SSSDAmiform traffic, Fig. 8 shows the simu- lation results of the maximum throughput under various window sizes and policies. We observe that the look- through policy achieves a slightly better throughput than the look-back policy.

3.2. Crossbar switch

The switch simulated here is a 64 x 64 crossbar switch. Fig. 9 shows the simulation results of the maximum throughput under various window sizes and policies, pro- viding that the hot-spot ratios are 0.1, 0.2 and 0.3, respec- tively. Again, we observe that the throughput obtained by the look-through policy is much higher than that obtained by the look-back policy. Similar to the result of Banyan switches, the throughput is only slightly decreased for a higher hot-spot ratio for the look-through policy.

Under mixed SSSD/uniform traffic, a 64 x 64 cross- bar switch is composed of a leased line and a 63 x 63

(6)

120 J.-H. Huang, Y.-S. SufComputer Communications 20 (1997) 115-124

0.01

1 3 5 7 9

window size

Fig. 10. Simulation results of the maximum throughput for a 64 x 64 cross- bar switch when the hot-spot ratio equals 0.1, 0.2 and 0.3, respectively.

q : look-through (uniform); 4 : look-through (h = 0.1); 0: look-through (h = 0.2); 0: look-through (h = 0.3); n : look-back (h = 0.1); Cl: look- back (h = 0.2); A: look-back (h z 0.3).

crossbar switch. Fig. 10 shows the simulation results of the maximum throughput under various window sizes and policies. We observe that the look-through and look-back policies achieve almost the same throughput. This is because the switch size is large and the traffic pattern, except the leased line, is uniform.

4.

Parallel-plane ATM switches

Bypass queueing is efficient to overcome the problem of head of line blocking within an input queue. However, when many packets in different input buffers contend for the same internal circuit of the switch, only one of them can be trans- mitted in the transmission slot. To relieve this situation, we propose a parallel-plane architecture [7-lo] to reduce the contention across input queues. In this section, we will find the maximum throughput analytically under uniform traffic.

3 5 7

window size

11. Simulation results of the maximum throughput for a 64 x 64 crossbar switch under mixed SSSD/uniform traffic. +: look-through; El: look-back.

input input port o”tput w shared buffers

ATM switch

Fig. 12. Architecture of the parallel-plane ATM switch with bypass queues.

4.1.

Banyan switch

Although the bypass queue has improved the switch throughput significantly, the throughput is still not high enough. For instance, the throughput for a 1024 x 1024 Banyan switch with an optimal bypass window is only 0.44, as stated earlier. To increase the throughput, the parallel-plane ATM switch is proposed. Fig. 11 shows the architecture of the parallel-plane ATM switch. Its opera- tion is similar to the single-plane switch except that the parallel-plane switch provides multiple paths. A cell will be sent to all switch planes for connecti,on setup. If more than one connection is established within one offer slot, only one connection will be chosen and the others will immediately be released at the end of that offer slot. Since the parallel-plane switch will cause the problem of output contention, a form of output queueing, the shared

buffer, is used to overcome this problem [I]. To keep the

packet loss probability lower than a specified value, the size of the shared buffer must be calculated, and can be found in Ref. [19].

With the same assumptions as stated in Section 2, and assuming j paths have already been set up before an offer slot, it is reasonable to assume that these j paths are uni- formly distributed among all planes. Defining m as the num- ber of switch planes, the following is the analysis to obtain aj> as can be found in Ref. [16]:

PO = P

(3)

Pn =Pn-l

[-A.;- (1

--&$).pn_,.;.;]

OZj = 1 - (1 - p,/p())”

Based on the assumption that j paths are uniformly dis- tributed among these 172 planes, there are j/m paths for each plane. We use this term to calculate p,, as we did in Section

2. Then, oj is obtained by Eq. (3). The second term in Eq. (3) is the probability that all connection attempts in m planes are failed for a packet in the input buffer.

(7)

1 3 5 7 9 11 window size

Fig. 13. Maximum throughput of 2-plane Banyan switches under uniform traffic.

system throughput. The maximum throughputs for m = 2 and 3 are illustrated in Figs. 12 and 13, respectively. Note that, as the number of switch planes increases, the optimal window size decreases such that the switch complexity is reduced. Fig. 14 shows the simulation and analytical results of the maximum throughput for a 16 x 16 Banyan switch with the look-through policy under various window sizes and m = 2. From this figure, we observe that the analytical results conform to the simulation results perfectly. As the number of planes grows, the analytical results conform even better to the simulation results.

In Fig. 15, we show the maximum throughputs under various numbers of switch planes and switch sizes, provid- ing the optimal window size. Note that the throughput incre- ment becomes insignificant when more than four switch planes are used; therefore, only three or four planes are needed to obtain a very high throughput. For N = 1024 the throughput is as high as 0.85 when m Z 4. For N = 256, the throughput is higher than 0.9 when m 2 4. In general, a switch with hundreds of input ports is

0.1

0.0 1 - 1 * 1 - ’ * 1 ”

1 3 5 7 9 11

window size

Fig. 14. Maximum throughput of 3-plane Banyan switches under uniform traffic.

v.v--- ~ ~

1 3 5 7 9 11

window size

Fig. 15. Simulation results of the maximum throughput for a Z-plane I6 x 16 Banyan switch under uniform traffic. Cl: analysis; + : simulation (look-through).

considered a large switch; hence, this architecture obtains a high throughput even for large switches.

For parallel-plane switches under non-uniform traffic pattern, Fig. 16 gives the results for a three-plane 16 x 16 switch under hot-spot traffic with a hot-spot ratio equal to 0.3. This figure shows that, for the look-through policy, the maximum throughput reaches 0.88 when the window size equals only three. If only one plane is used, the window size would be as high as ten to achieve a maxi- mum throughput at 0.75. Note that a smaller window size will lead to a simpler architecture for the input port controller, and also reduce the delay time incurred. Also note that for a window size equal to three, the three-plane switch obtains a throughput at 0.88 while the single-plane switch only obtains 0.52. These results show that parallel- plane with bypass queueing is very effective for non- uniform traffic. For the look-back policy, Fig. 17 shows that the throughput of the three-plane architecture is about

0.8 0.6 I I 1 I I 3 5 7 9 window size

Fig. 16. Maximum throughput of a 3.plane 16 x I6 Banyan switch under

hot-spot traffic with hot-spot ratio=0.3. m: look-through t3-plane); 0:

(8)

122 J.-H. Huang, Y.-S. WComputer Communications 20 (1997) 115-124

0.0

1 3 5 7 9

window size

Fig.

17. Maximum throughput of a 3-plane 64 x 64 crossbar switch under

hot-spot traffic with hot-spot ratio=0.2. n : look-through (3-plane); 0: look-back (3-plane); 0: look-through (I -plane); 0: look-back (1 -plane).

three times that of the single-plane architecture. Hence,

the parallel-plane architecture is indeed very effective for

non-uniform traffic.

4.2.

Crossbar switch

With the same assumptions as stated in Section 2, and

assuming that the paths which are already set up are uni-

formly distributed among all planes, the throughput analysis

is similar to that of Section 2:

Setj = 0 and S(0) = 0

Initial w = 1

for-each w (-f

do

begin

Initial

p =

1

for-each p 5

N do

begin

setj=j+

[1-v].[l-

(--&--I

1.0

4

E

0.4 - .i 0.3 - , 1 2 3 4 5 6 I

number of switch plane

Fig. 18. Maximum throughput of parallel-plane Banyan switch using optimal window size for various switch planes under uniform traffic.

0.0 t I I 1 I

1

1 3 5 7 9

window size

Fig. 19. Maximum throughput of 2-plane crossbar switches under uniform traffic. q : 16 x 16; + : 64 x 64; 13: 256 x 256; 0: 1024 x 1024. end

Set S(w)

= j

Set W(w) =

S(w)

N(1 +

MI)

end

The term (j/Nm)“’ in the inner loop means the probability

that m requests for transmission in M planes are all failed.

Hence, [ 1 - (j/Nnr)“] is the probability that there is at least

one available output port for the packet examined. The

maximum throughputs for

m =

2 are illustrated in Fig. 18.

Fig. 16 contains the simulation results of the maximum

throughput for a 64 x 64 crossbar switch with the look-

through policy under various window sizes and

m =

2.

From this figure, we can observe the preciseness of the

analysis.

In Fig. 19, we show the maximum throughputs under

various numbers of switch planes and switch sizes, provid-

ing the optimal window size. Note that the throughput

increment becomes insignificant when more than two

switch planes are used, therefore, only two or three planes

0.8 - 0.1 - 0.6 - 0.5 - 0.4 - 0.3 - 0.2 - 0.1 - 0.0 1 I I 5 7

window size

Fig. 20. Simulation results of the maximum throughput for a 2-plane 64 x 64 crossbar switch under uniform traffic. El: analysis; + : simulation (look-through).

(9)

0.77

1 2 3 4 5

number of switch plane

Fig. 21. Maximum throughput of parallel-plane crossbar switch using op- timal window size for various switch planes under uniform traffic. 0: N=l6:+:N=64;ID:N=256:+:N=1024.

are needed to obtain a very high throughput. Also, note that the maximum throughput for N = 1024 can be as high as 0.94 when m 2 2. Fig. 20 shows the simulation results of the maximum throughput for a two-plane 64 x 64 crossbar switch under uniform traffic.

For a non-uniform traffic pattern, Fig. 1’7 gives the results for a three-plane 64 x 64 switch under hot-spot traffic with a hot-spot ratio equal to 0.2. This figure shows that the max- imum throughput reaches 0.85 when the window size equals only two for the look-through policy. If only one plane is used, the window size would be as high as ten to achieve a maximum throughput at 0.79. This figure also shows that, for the look-back policy, the throughput of the three-plane architecture is about three times that of the single-plane architecture. These results again show that the parallel- plane architecture is effective under non-uniform traffic.

Since a smaller window size will lead to a simpler archi- tecture for the input port controller, it also reduces the delay time incurred. This result shows that parallel-plane with bypass queueing is very effective for non-uniform traffic. For the look-back policy, Fig. 17 also shows that the three- plane architecture is about three times that of the single- plane architecture; hence, the parallel-plane architecture performs very well indeed. Fig. 21 shows the maximum throughput of the parallel-plane crossbar switch using an optimal window size for various switch planes under uni- form traffic.

5. Conclusions

With the assumption of a uniform traffic pattern, many elegant switch architectures have been proposed, such as the Knockout [ 1, 21 and Christmas tree [4] switches. However, with a non-uniform traffic pattern, especially hot-spot traf- fic, the elegance of these switches will soon disappear. For example, the size of the concentrator of Knockout switches will grow rapidly as the hot-spot ratio rises. Unfortunately, a

hot-spot traffic pattern is common in real networks because there are many databases and file servers distributed in the network. Banyan and crossbar switches have simpler archi- tectures but lower throughputs compared to a Knockout switch. However, with the aid of a bypass input queue and parallel-plane architecture, the throughput can be improved significantly without adding too much complexity, even with a hot-spot traffic pattern. Hence, such a switch archi- tecture is of much practical use. All in all, we can summar- ize the paper as follows:

l A bypass queue can improve throughput significantly. l The look-through policy obtains a much higher through-

put than the look-back policy under a hit-spot traffic pattern.

As long as the hot-spot ratio is not high, the throughput of the look-through policy will decrease only slightly compared to that under uniform traffic.

Based on the above observation, the throughput analysis under a non-uniform traffic pattern can be approximated by the analytical results under uniform traffic as long as the hot-spot ratio is moderate.

The parallel-plane architecture can further improve the throughput.

Both the optimal window size and the optimal number of planes are small (smaller than 4), hence the additional switch complexity is low.

References

[I]

Y.S. Yeh, M.G. Hluchyj and A.S. Acampora. The Knockout switch: a

simple, modular architecture for high-performance packet switching. IEEE J. Select. Areas Commun., (October 1987) 1274- 1283. [2] K.Y. Eng, M.G. Hluchyj and Y.S. Yeh, A Knockout switch for

variable-length packets. IEEE J. Select. Areas Commun., (December 1987) 142661435.

[3] K.Y. Eng, M.J. Karol and Y.S. Yeh, A growable packet (ATM) switch architecture: design principles and applications. IEEE Trans. Commun., 40(2) (February 1992) 423-430.

[4] Weijia Wang and F.A. Tobagi, The Christmas-tree switch: an output queueing space-dtvision fast packet switch based on interleaving distribution and concentration functions. IEEE INFOCOM ‘91. 1991.0163-0170.

[_5] E.W. Zegura. Architectures for ATM switching systems. IEEE Commun. Magazine (February 1993) 28-37.

[6] F.A. Tobagi, Fast packet switch architectures for broadband integrated services digital networks. Proc. IEEE, 78( 1) (January 1990) 133-167. (71 H. Ahmadi and W.E. Denzel, A survey of modem high performance switching techniques. IEEE J. Select. Areas Commun., 7 (September

1989) 1091-I 103.

[8] Y. Oie et al., Survey of switching techniques in high-speed networks and their performance. IEEE INFOCOM ‘90, 1990, 1242- 125 1. [9] P. Newman. A fast packet switch for the integrated services backbone

network. IEEE J. Select. Areas Commun., 6(9) (December 1988) 1468-1479.

[lo] K.W. Sarkies, The bypass queue in fast packet switching. IEEE Trans. Commun., 39(5) (May 1991) 766-774.

[1 I] J.Y. Hui and E. Arthur% A broadband packet switch for integrated transport. IEEE J. Select. Areas Commun., 5(8) (October 1987) 12@-

(10)

124 J.-H. Huang, Y.-S. St/Computer Communications 20 (1997) 115-124 [12] R.G. Bubenik and J.S. Turner, Performance of a broadcast packet

switch. IEEE Trans. Commun., 37( 1) (January 1989) 60-69. [ 131 K. Shiomoto et al., Performance evaluation of cell bypass queueing

discipline for buffered Banyan type ATM switches. IEEE INFOCOM ‘90, 1990.677-685.

[14] H.S. Kim and A. Leon-Garcia, Performance of buffered Banyan networks under nonuniform traffic patterns. IEEE Trans. Commun., 38(5) (May 1990) 648-658.

[ 151 M.G. Hluchyj and M. Karol, Queueing in high-performance packet switching. IEEE J. Select. Areas Commun., 6(9) (December 1988)

1587- 1597.

[16] Y.-S. Su and J.-H. Huang, Throughput analysis and optimal design of Banyan switches with bypass queues. IEEE Trans. Commun., 42( 10) (October 1994) 2781-2784.

[ 171 Y.C. Jeng, Performance analysis of a packet switch based on a single- buffered Banyan network. IEEE J. Select. Areas Commun., l(6) (December 1983) 1014-1021.

[18] S. Nojima et al., Integrated services packet network using bus matrix switch. IEEE J. Select. Areas Commun., 5 (October 1987) 1284-1292. [I93 Y.-S. Su, Performance and optimal design of ATM switches with

bypass queues, MS thesis, Department of Computer Science and Information Engineering, National Taiwan University, June, 1993.

Yo-Song Su received his BS in computer science and information engineering from Tatung Insti- tute of Technology, Taipei, Taiwan in 1988, From 1988 to 1990 he served in the Army as an Electronics Officer. In 1991 he attended the graduate school and received an MS in com- puter science and information engineering from National Taiwan University, Taipei,

Taiwan in 1993. Since then he has worked for Syscom Computer Engineering Co. as an engi- neer in Taiwan. His interest is focused on the design of the ATM switch and its performance evaluation.

Jau-Hsiung Hunng received the BS in electrical engineering from National Taiwan University, Taipei, Taiwan in 1981 and the MS and PhD in computer science from the University of California, Los Angeles in 1985 and 1988, respectively. Since I988 he has been a member of the faculty in the Department of Computer Science and Information Engineering, National Taiwan University, where he is currently a professor. He has published over 40 technical papers in the areas of multimedia networking, high speed networking, parallel and distributed systems and the per- formance evaluation of computing systems.

1

1

數據

Fig.  2.  Structure  of  a  packet  slot.
Fig.  4.  Maximum  throughput  of  Banyan  switches  under  uniform  traffic,  Fig.  6
Fig.  7.  Simulation  results  of  the  maximum  throughput  for  a  64  x  64  cross-  bar  switch  under  uniform  traffic
Fig.  12.  Architecture  of  the  parallel-plane  ATM  switch  with  bypass  queues.
+4

參考文獻

相關文件

• It works as if the call writer delivered a futures contract to the option holder and paid the holder the prevailing futures price minus the strike price.. • It works as if the

< Notes for Schools: schools are advised to fill in the estimated minimum quantity and maximum quantity (i.e. the range of the quantity) of the items under Estimated Quantity

220V 50 Hz single phase A.C., variable stroke control, electrical components and cabling conformed to the latest B.S.S., earthing through 3 core supply cable.. and 2,300 r.p.m.,

mathematical statistics, statistical methods, regression, survival data analysis, categorical data analysis, multivariate statistical methods, experimental design.

dimensional nanomaterials for photodetectors with ultrahigh gain and wide spectral response. II.  Photon down conversion and light trapping in hybrid ZnS nanopartcles/Si

This essay wish to design an outline for the course "Taiwan and the Maritime Silkroad" through three planes of discussion: (1) The Amalgamation of History and Geography;

• What is delivered is now a forward contract with a delivery price equal to the option’s strike price.. – Exercising a call forward option results in a long position in a

• Similar to futures options except that what is delivered is a forward contract with a delivery price equal to the option’s strike price.. – Exercising a call forward option results