• 沒有找到結果。

三級式克勞斯網路與多重對數網路的廣義不阻塞

N/A
N/A
Protected

Academic year: 2021

Share "三級式克勞斯網路與多重對數網路的廣義不阻塞"

Copied!
47
0
0

加載中.... (立即查看全文)

全文

(1)國立交通大學 應用數學系 博士論文 三級式克勞斯網路與多重對數網路的廣義不阻塞 Wide-Sense Nonblocking for 3-stage Clos Network and Multi-logd N Network 研 究 生 : 郭君逸 指導教授: 黃光明 教授. 中華民國九十四年六月.

(2) 三級式克勞斯網路與多重對數網路的廣義不阻塞 Wide-Sense Nonblocking for 3-stage Clos Network and Multi-logd N Network 研 究 生 : 郭君逸 Student: Junyi Guo 指導教授: 黃光明 教授 Advisor: Frank K. Hwang 國立交通大學 應用數學系 博士論文. A Dissertation Submitted to Department of Applied Mathematics College of Science National Chiao Tung University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy in Applied Mathematics June 2005 Hsinchu, Taiwan, Republic of China 中華民國九十四年六月.

(3) Abstract The 3-stage network was first proposed by Clos and is one of the most basic multistage interconnecting network. Clos (1953) showed that the number of middle crossbar required for strictly nonblocking is 2n − 1, where n is the number of inlets of an input crossbar. Beneˇs (1965) constructed an example to show that using packing routing strategy can make the number of middle crossbar required lower. This has remained the only example of wide-sense non-blocking 3-stage Clos network which is not strictly nonblocking. In this thesis, we showed that the number of middle crossbar required for wide-sense nonblocking under several routing strategies: save the unused, packing, minimum index, cyclic static, and cyclic dynamic, which has been studied in the literature is the same as required for strictly nonblocking and extended them to asymmetric 3-stage Clos network. In particular, we prove the same conclusion for the multi-logd N network and extend to a general class of network.. iii.

(4) 摘要 克勞斯 (Clos) 在1953年首先提出了 「三級式網路」, 這也是最基本的多 級式交換網路之一; 假設 n 是一個輸入交換器的進線個數, 克勞斯證明 了這樣的網路只需要用到 2n − 1 個中繼交換器, 即能使得此網路達到 絕對不阻塞。 在1965年, 班尼斯 (Beneˇs) 舉了一個例子說明, 使用 「優 先選最忙的中繼交換器」 的策略, 真的可以降低中繼交換器的使用數目, 而依然還是能讓此網路不阻塞。 不過, 這個例子也是至今唯一個三級式 克勞斯網路是廣義不阻塞, 而不是絕對不阻塞的例子。 在這篇論文裡, 我們證明了文獻中所提到的一些傳遞策略: 不用的最 後(STU)、 最忙錄的優先 (P)、 編號小的優先 (MI)、 從上次的編號開始 (CS) 及從下一個編號開始 (CD), 在這些策略之下, 要達到不阻塞所需 要的中繼交換器個數與要達到絕對不阻塞所需要的一樣。 而我們也將這 個結果推廣到不對稱的情況, 甚至我們還在多重對數網路上也得到了同 樣的結果。 最後, 我們也將此結果推廣到了同類型的網路上。. iv.

(5) 誌謝 我最先要感謝的當然就黃光明老師。 在還沒進到交大以前的我, 從沒有 聽過 「群試」 這個名詞, 直到我聽了黃老師的第一門課之後, 對我產生極 大的影響, 再加上看見了黃老師對研究的熱忱, 讓我對做研究產生了極 大的興趣。 另一方面, 也因為黃老師涉及很多領域, 也讓我學到了很多不 同領域的東西。 此外, 每次有什麼想法跟老師說, 或是有什麼問題去問老 師, 他都非常用心的審視我提的問題, 儘速給我答案或是建議, 因此這幾 年來有了黃老師的幫忙, 我才能如此順利畢業。 我也蠻感激劉昱琪學姊, 因為她極力推薦我黃老師很不錯, 才讓我開 始對黃老師有了認識。 陳秋媛老師也在這幾年來, 給了我很大的幫助, 她 就像是我的好朋友一樣, 不僅是課業上, 生活上有什麼問題都可以找她 聊聊; 她的對學生的關懷、 奉獻, 真的很讓我感動。 此外, 我還要感謝曾 經指導過我的老師: 傅恆霖老師的圖論、 組合設計、 編碼, 他用生動的講 解來取代課本上枯燥的證明, 第一次讓我覺得這些課程怎麼變的這麼有 趣。 黃大原老師啟發性的教學, 也很讓我印像深刻。 翁志文老師對LATEX 熟悉, 常是我討論LATEX的對像。 在交大這幾年, 我也認識了不少朋友: 賓賓、 飛黃、Robin、 琲琪 . . . 等, 感謝他們陪我做研究、 討論, 陪我一 起打球, 甚至是打牌, 有了他們, 我才能這麼順利, 過得這麼快樂。 在這 裡, 我也認識了我的另一半, 惠蘭, 感謝她不厭其煩的聽我講述我做的結 果, 給我建議, 也感謝她在我心情不好的時候, 能夠與我一起分擔。 最重 要的, 我還要感謝我的父母, 提供我這麼好的學習環境, 讓我能夠把心放 在課業上、 研究上, 他們的鼓勵、 讚許, 一直給了我很大的信心及動力。 要感謝的人很多, 感謝的心情, 實在難以形容, 我想, 我還要感謝上 天, 讓我一路走來, 能夠平平穩穩、 順順利利。 感謝各位老師、 同學的支 持與鼓勵, 讓君逸能夠有如今的成果。 謝謝你們, 謝謝。 v.

(6) Contents Abstract. iii. 中文摘要. iv. 誌謝. v. Contents. vi. List of Figures. viii. 1 Introduction 1.1 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2 Literature review and thesis overview . . . . . . . . . . . . . .. 1 1 6. 2 3-stage Clos networks 8 2.1 Strictly nonblocking . . . . . . . . . . . . . . . . . . . . . . . . 8 2.2 Wide-sense nonblocking in symmetric case . . . . . . . . . . . 9 2.3 Wide-sense nonblocking in asymmetric case . . . . . . . . . . 15 3 Multi-logd N networks 3.1 Strictly nonblocking . . . . . . . . . . . . . . . . . . . . . . . . 3.2 Wide-sense nonblocking . . . . . . . . . . . . . . . . . . . . . 3.3 Generalizations . . . . . . . . . . . . . . . . . . . . . . . . . .. 23 23 25 30. 4 Conclusions and future works. 35. vi.

(7) Appendix. 37. Reference. 38. vii.

(8) List of Figures 1.1 1.2 1.3 1.4 1.5 1.6. Xnm . . . . . . . . . . . . . . . . . . . A 4-stage interconnection network . . . (a) asymmetric (b) symmetric . . . . . Linking pattern of the baseline network The graph model of BL2 (4) . . . . . . Multi-logd N Network . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . and an example. . . . . . . . . . . . . . . . . . . . .. . . . . . .. . . . . . .. 3.1. The left figure is an induced graph of the graph model of a multi-. . . . . . .. 2 2 3 4 4 5. logd N network, for n odd. And the right figure is its correspondence to a 3-stage Clos network. . . . . . . . . . . . . . . . . . . 27. 3.2. This is an induced graph of the graph model of a multi-logd N network, for n is even. . . . . . . . . . . . . . . . . . . . . . . . 28. 3.3. γ and M (V, γ) = {a, b, c, d} in C(3, 4, 2) . . . . . . . . . . . . . 33. viii.

(9) Chapter 1 Introduction The need of a switching network first came from the need to interconnect pairs of telephones. Later, it was reinvented for parallel computer to interconnect a set of processors with a set of memories. Currently, it is intended for many other applications, data transmission, conference calls, satellite communication . . . . One frequently discussed topic in switching networks is its nonblocking property. There are different levels of nonblockingness: strictly nonblocking, wide-sense nonblocking, and rearrangeable nonblocking. We will discuss more detail in Section 1.1. Because wide-sense nonblocking networks use an algorithm to route requests, the cost of it is expected to be less than strictly nonblocking networks. In these thesis, we study several routing strategies which have been studied in the literature [9] and gave an amazing result that the cost of these networks are the same with strictly nonblocking networks in 3-stage Clos network and multi-log N network.. 1.1. Preliminaries. The basic components of switching network are crossbar switches, or just crossbars, and links which connect crossbars. A crossbar with n inlets and m outlets, denoted by Xnm , is said of size n × m(See Figure 1.1(a)). (For convenience, we draw a two side crossbar without exposing its internal wiring (See Figure 1.1(b))). Inlets(outlets) on the same crossbar are called co1.

(10) 1 2. m. 1 2. 1 2. 1 2. n. n. m. (a). (b) Figure 1.1: Xnm. inlets(co-outlets). In a crossbar, each inlet and each outlet are connected by a crosspoint. Therefore, there are m × n crosspoints in Xnm . Each crosspoint determine some inlet a and some outlet b is connected or not. In general, we assume one inlet(outlet) can only connect to only one outlet(inlet) and any matching between the inlets and the outlets is routable. In an s-stage interconnection network, the crossbars are lined up into s columns, each called a stage. Crossbars in the same stage have the same size and links exist only between crossbars in adjacent stages(See Figure 1.2). The inlets(outlets) of the first(last) stage are called inputs(outputs) of the. Figure 1.2: A 4-stage interconnection network network. Let X × Y to denote a network structure in which each crossbar of the 2.

(11) last stage of X has a link to each crossbar of the first stage of Y . The 3stage network Xn1 m × Xr1 r2 × Xmn2 was first proposed by Clos(1953) and is now known as the 3-stage Clos network. For convenience, we use the notation C(n1 , r1 , m, n2 , r2 ) to denote Xn1 m × Xr1 r2 × Xmn2 and label the crossbars of the first(second, third) stage from I1 (M1 , O1 ) to Ir1 (Mm , Or2 ). See Figure 1.3(a). Then the number of inputs N1 = n1 r1 and the number of outputs N2 = n2 r2 . If n1 = n2 and r1 = r2 , then we call this 3-stage Clos network symmetric and denoted by C(n, m, r). See Figure 1.3(b).. (a) C(2, 4, 3, 3, 2). (b) C(2, 4, 3). Figure 1.3: (a) asymmetric (b) symmetric A d-nary baseline network of order n denoted by BLd (n) has dn inputs, dn outputs and n stages, each stage contains dn−1 d × d crossbars. The linking pattern of BLd (n) can be obtained by n − 1 recurrent constructions(See Figure 1.4(a)). Figure 1.4(b) gives the example of BL2 (4). Two networks are called equivalent if the crossbars can be labeled such that the linking functions of the two networks become identical. There are many networks equivalent to baseline network, see [3], such as banyan, Omega, . . . . We call this class of network logd N network, where N = nd . In this thesis, only the baseline architecture will be considered. For convenience of analysis, we transform a logd N network to a digraph by converting each link, including the inputs and the outputs, to a node, while a crosspoint connecting two links in the network becomes an arc in the digraph(See Figure 1.5). Nodes are arranged in n+1 stages labeled 0, 1, . . . , n 3.

(12) Xdd 1 2. BLd (n-1). 3 4 5. BLd (n-1). BLd (n-1) dn-1. (a) Linking pattern. (b) BL2 (4). Figure 1.4: Linking pattern of the baseline network and an example.. 0. 0. 1. 1. 15. 15. Figure 1.5: The graph model of BL2 (4). 4.

(13) log216. log216. log216. Figure 1.6: Multi-logd N Network from left to right. The nodes in stage 0 correspond to inputs and the nodes in stage n correspond to outputs. Lea(1990) introduced the multi-logd N network composed of p copies of logd N network connected in parallel. In each copy, there is exactly one link between an arbitrary input and output. See Figure 1.6. A request is an (input, output) pair seeking connection. A set of requests can be routed if there exists link-disjoint paths connecting them. A switching network is said to be strictly nonblocking (SNB) if a request can always be routed regardless how the previous requests are routed. It is said to be widesense nonblocking (WSNB) with respect to a routing strategy A if every request is routable under A. The restraint that no two paths in the original network competes for the same link is translated to that no two paths in the graph model competes for the same node. In this thesis we make the common assumption that a crossbar is SNB. For the 3-stage Clos network, a routing strategy deals with the choice of a middle switch to route the request when many are available. We review the five routing strategies proposed in the literature [9]: (i) Save the unused (STU). Do not route through an empty middle crossbar unless there is no choice. 5.

(14) (ii) Packing (P). Choose a busiest, yet available, middle crossbar. (iii) Minimum index (MI). For each request, route in the order M1 , M2 ,. . . , until the first available one emerges. (iv) Cyclic dynamic (CD). If Mk was used last, try Mk+1 , Mk+2 , . . ., until the first available one emerges. (v) Cyclic static (CS). If Mk was used last, try copy Mk , Mk+1 , . . ., until the first available one emerges. For multi-logd N network, we translate these routing strategies by replacing “choosing a middle crossbar” to “choosing a copy (of logd N )”. Note that P ⇒ STU. So WSNB under STU ⇒ WSNB under P since P is a choice of STU. On the other hand, not WSNB under P implies not WSNB under STU.. 1.2. Literature review and thesis overview. The notion of wide-sense nonblocking in switching networks is a fascinating idea to computer scientists. It suggests that the hardware can be reduced through intelligent software (routing) without affecting the nonblocking property of the network. The existence of a WSNB network was first demonstrated by Beneˇs(1965) for the symmetric 3-stage Clos network. He proved that C(n, m, 2) is WSNB under packing if and only if m ≥ b3n/2c which is the only positive result. Lots of efforts have been spent to expand this result, but without success. Smith[14] proved that C(n, m, r) is not WSNB under P or MI if m < b2n − nr c. Du et al. [7] improved to b2n − 2rn−1 c which was extended to cover CS in Hwang [9]. For P, Yang and Wang [18] gave a linear programming formulation of the problem and ingeniously found the closed-from solution n c where F2r−1 is the 2r−1st Fibonacci number, as a necessary m ≥ b2n− F2r−1 condition for C(n, m, r) to be WSNB. Actually, there was an earlier stronger result of Du et al. reported in the 1998 look of Hwang [9] that m ≥ 2n − 1 6.

(15) is necessary and sufficient for C(n, m, r), n ≥ 3, to be WSNB under P. This result for r ≥ 3 together with Beneˇs result for r = 2 gave a definitive answer to the WSNB property of C(n, m, r) under P. Finally, Tsai, Wang and Hwang [15] proved that for all n, there exists r large enough such that C(n, m, r) is not WSNB under any algorithm. The proof of the m ≥ 2n − 1 result by Du et al. is quite difficult to check and the proof of Yang and Wang is also complicated. In Chapter 2 we give a much simpler proof which not only works for P (hence STU), but also for CD, CS and MI. We also extend all these results to the asymmetric 3-stage Clos network C(n1 , n2 , m, r1 , r2 ). In Chapter 3, we prove a similar conclusion these strategies require the same number of copies as SNB does. In Chapter 4, we extend our results to a more general class.. 7.

(16) Chapter 2 3-stage Clos networks In this chapter, we study the five strategies, CD, CS, STU, P, and MI, in the 3-stage clos network. And we give a sequence of requests to force each of them using the maximum number of middle crossbars, i.e., the same number of crossbars as required by the SNB network.. 2.1. Strictly nonblocking. First, we give classical SNB result on symmetric and asymmetric 3-stage Clos network. Then we can compare it to the result in Section 2.2 and Section 2.3. Theorem 2.1.1 (Clos [6]). Assuming min{r1 , r2 } ≥ 2, C(n1 , r1 , m, n2 , r2 ) is SNB if and only if m ≥ min{n1 + n2 − 1, n1 r1 , n2 r2 }. Proof. Without loss of generality, assume the new request γ is from I1 to O1 . Clearly, m = min{n1 r1 , n2 r2 } is sufficient since only min{n1 r1 , n2 r2 } requests can be generated, and we can route them each through a distinct middle crossbar in worst case. Furthermore, if the busy co-inlets and cooutlets are each routed through a distinct middle crossbar, then at most (n1 − 1) + (n2 − 1) middle crossbars are taken; so n1 + n2 − 1 middle crossbars also suffice. Next we prove necessity. Suppose m = n1 + n2 − 2 and neglect the boundary condition. If we connect n1 − 1 co-inlets to O2 and n2 − 1 co8.

(17) outlets from I2 , and let these n1 + n2 − 2 requests route through distinct middle crossbar, then γ is blocked. Hence m must be greater than n1 +n2 −2. Then we prove the theorem. Corollary 2.1.2. If r ≥ 2, then C(n, m, r) is SNB if and only if m ≥ 2n−1. Proof. Because r ≥ 2, N = nr ≥ 2n > 2n − 1. Hence min{2n − 1, N } = 2n − 1.. 2.2. Wide-sense nonblocking in symmetric case. The only positive result in about WSNB is Beneˇs result [1]. He demonstrated the following theorem on C(n, m, 2): Theorem 2.2.1. C(n, m, 2) is WSNB under STU if m ≥ b3n/2c. Proof. Let s be a state and let n(s) denote the set of busy middle crossbars (carrying at least one connection) in s. Let nij (s) denote the set of middle crossbars carrying a connection from Ii to Oj , i, j = 1, 2. We will prove the theorem by induction on the number of steps to reach s (from the initial empty state): (i) |n(s)| ≤ b3n/2c, (ii) |n11 (s) ∪ n22 (s)| ≤ n, (iii) |n12 (s) ∪ n21 (s)| ≤ n. All three claims are trivially true at empty state. Consider a general step from state s0 to s. If s is obtained from s0 by deleting a connection, the three claims obviously remain true. So assume s is obtained from s0 by adding a connection. Without loss of generality, assume it is from I1 to O1 . If n22 (s0 ) \ n11 (s0 ) 6= ∅, then a crossbar belong to n22 (s0 ) \ n11 (s0 ) will carry the new request under STU. Thus |n11 (s) ∪ n22 (s)| = |n11 (s0 ) ∪ n22 (s0 )| ≤ n and the claims remain true. Therefore, we assume n22 (s0 ) \ n11 (s0 ) = ∅. 9.

(18) (i) Since I1 and O1 can each be engaged in at most n − 1 connections, |n11 (s0 ) ∪ n12 (s0 )| ≤ n − 1, |n11 (s0 ) ∪ n21 (s0 )| ≤ n − 1. Using the induction hypothesis (iii) |n12 (s0 ) ∪ n21 (s0 )| ≤ n. Adding them up, we obtain 2|n(s0 )| ≤ 3n − 2, or |n(s0 )| ≤ b3n/2c − 1. Route the new request through an unused middle crossbar. Then |n(s)| ≤ b3n/2c. (ii) Because n22 (s0 ) ⊆ n11 (s0 ) and |n11 (s0 )| ≤ n − 1, we obtain |n11 (s) ∪ n22 (s)| ≤ n. (iii) |n12 (s) ∪ n21 (s)| = |n12 (s0 ) ∪ n21 (s0 )| ≤ n.. Note that Theorem 2.2.1 is for the almost trivial network r = 2. Lots of efforts have been spent to expand this result, but without success. In the following section, we will give the unexpected results that for r ≥ 3, not only packing and STU, but also CS, CD, and MI do not save any middle crossbars. A state of C(n, m, r) can be represented by an r×r matrix where cell (i, j) consists of the set of the labels of middle crossbars carrying a connection from Ii to Oj . Then each row or column can have at most n entries and the entries must be all distinct. The n-uniform state is the matrix where each diagonal n c result cell contains {1, . . . , n} and all other cells are empty. The b2n − 2r−1 was actually proved [9] for all algorithms which can reach the n-uniform 10.

(19) state, which, as shown in [9], includes P, STU, MI and CS. Hung (private communication) observed that CD can also reach the n-uniform state. Chang et al. [4] give a stronger result based on his method. Lemma 2.2.2. CD can reach any state s from any state s0 . Proof. Since we can disconnect all paths in s0 to reach the empty state, it suffices to prove for s0 the empty state. We prove this by adding each Mk in s to it’s proper cell one by one. Suppose Mk is in cell (i, j). Consider a request γ. Suppose CD assigns Mh to connect γ. If h 6= k, disconnect γ and reconnect it immediately. Then CS would assign Mh+1 to connect the pair. Repeat this until Mk is assigned. Since Mk is arbitrary, s can be reached. Corollary 2.2.3. CD can reach the the n-uniform state. For CS we prove a weaker property. Let [i, j] denote the set {i, i+1, . . . , j} if i ≤ j, and the empty set if i > j. Lemma 2.2.4. Let state s be obtained from s0 by adding [i, j], i < j, to a cell C. Then s can be reached from s0 under CS. Proof. Suppose the last assignment is Mk in s0 . Since i < j, we can add at least two connections in C. Then Mk and Mk+1 will be assigned. If k 6= i, disconnect the connection through Mk and regenerate a connection in C, for which Mk+2 will be assigned. Continue this until Mi and Mi+1 are assigned. Then add j − i − 1 connections to C for which Mi+2 , . . . , Mj will be assigned. Theorem 2.2.5. C(n, m, r) for r ≥ 2 is WSNB under CD and CS if and only if m ≥ 2n − 1. Proof. The ”if” part is trivial since C(n, 2n − 1, r) is SNB, hence WSNB. To prove the ”only if” part, we claim that if m = 2n − 2, then there exists a blocking state. It is well known [9] that it suffices to prove for the minimum r which is 2 here. By Lemma 2.2.2 and 2.2.4, the state in which cell (1, 1) contains 11.

(20) [1, n−1] and cell (2, 2) contains [n, 2n−2] can be reached. But a new request in cell (1, 2) is blocked. Hence m must be grater than 2n − 2. Theorem 2.2.6. For P, hence STU, C(n, m, r) , r ≥ 3, is WSNB if and only if m ≥ 2n − 1. Proof. The ”if” part is trivial. We prove the ”only if” part by showing that for r = 3 there exists a sequence of calls and disconnections forcing the use of 2n − 1 middle switches: [1, n] [1, n]. n → n + 1 [1, n − 1]. n n+1 → n + 1 [1, n − 1] →. n. n+1 [1, n − 1]. n+1 n n+1 n [n + 1, n + 2] → n + 2 [1, n − 1] → n + 2 [1, n − 1] n+1 n+1 n. [n + 1, n + 2] [1, n − 1]. →. → ···. [n + 1, n + 2] n. [n + 1, 2n − 2] [1, n − 1]. → [n + 1, 2n − 2] →. n [n + 1, 2n − 2] 2n − 1 [1, n − 1] [n + 1, 2n − 2]. Note that this proof is much more elementary than the proof in [7]. For MI, we first prove a lemma. Lemma 2.2.7. Consider a state s in C(n, m, 2) consisting of x requests from I1 to O1 carried by the set X of middle switches, and y requests from 12.

(21) I2 to O2 carried by the set Y of middle switches such that X ∩ Y = ∅, X ∪ Y = {1, . . . , x + y}, i.e. cell (1, 1) is X and cell (2, 2) is Y . Then a state s0 can be obtained from s, where s0 is same as s except that x becomes x0 , and y becomes y 0 = x + y − x0 . Proof. Without loss of generality, assume x0 > x(otherwise we work with y). Disconnect x0 − x requests whose indices are smallest in Y from s. Add x0 − x new requests in cell (1, 1). By the MI rule, these new requests must be carried by S. Thus s0 is obtained. Theorem 2.2.8. C(n, m, r) for r ≥ 2 is WSNB under MI if and only if m ≥ 2n − 1. Proof. It suffices to prove that m = 2n − 1 is necessary for WSNB for r = 2. By induction on n, suppose m = 2n − 3 is necessary for C(n − 1, m, 2) to be WSNB. Therefore there exists a state X 2n − 3 Y in C(n, 2n − 1, 2), such that x = y = n − 2, X ∪ Y = {1, · · · , 2n − 4}. Therefore we can obtain a state s0 from s by adding 2n − 2 to the (1, 2) cell. Delete the four calls carried by [2n − 7, 2n − 4] in the (1, 1) and (2, 2) cells, and use Lemma 2.2.7 to rebalance the members of calls carried by them, i.e., each carrying n − 4 calls. Assign [2n − 7, 2n − 4] to cell (2, 1). Next we delete [2n−11, 2n−8] from cells (1, 1) and (2, 2), do the balancing and assign [2n − 11, 2n − 8] to cell (1, 2). Repeatedly doing so, eventually (the last step may delete only two calls) we reach a state consisting of 2n − 2 distinct indices in cells (1, 2) and (2, 1). Thus a new (1, 1) request must be carried by M2n−1 . Example 1. The following example shows that C(n, m, 2) can be routed through 2n − 1-th middle crossbar under MI for n ≤ 6. “⇒” means to do. 13.

(22) the balancing. n=1:. 1. n=2:→. 1, 2. →. 2 1. →. 2 3 1. n=3:→. 2 3, 4 3, 4 5 3, 4 → → 1 1, 2 1, 2. n=4:→. 5, 6 3, 4 5, 6 1, 5, 6 → ⇒ 1, 2 [1, 4] 2, 3, 4. → n=5:→ → n=6:→ ⇒. 1, 5, 6. 1, 5, 6. 7 2, 3, 4. 1 7, 8 7, 8 7, 8 → → 2, 3, 4 [3, 6] 2 [3, 6]. 9 1, 2, 7, 8 [3, 6] 9, 10 1, 2, 7, 8 9, 10 1, 2 [1, 4], 9, 10 → → [3, 6] 3, 4 [5, 8] [5, 8] [2, 4], 9, 10 1, [5, 8]. →. [2, 4], 9, 10. 11 1, [5, 8]. Corollary 2.2.9. For 3-stage Clos network C(n, m, r), let s be the state where X, Y , and Z are in cells (i1 , j2 ), (i2 , j1 ), and (i1 , j1 ), respectively, X. Z Y. where, min{Z} > k, X ∩ Y = ∅, X ∪ Y = [1, k], k ≤ 2(n − |Z|). For each α ≤ k and max{α, k − α} ≤ (n − |Z|), let fα (s) be the state which has fα (X) in cell (i1 , j2 ), |fα (X)| = α, and fα (Y ) in cell (i2 , j1 ), such that fα (X) ∩ fα (Y ) = ∅, fα (X) ∪ fα (Y ) = [1, k]. Then fα (s) can be reached from s under MI. 14.

(23) 2.3. Wide-sense nonblocking in asymmetric case. Without loss of generality, we assume b nn12 c = k ≥ 1 throughout this section. If n1 ≥ r2 n2 , then m = r2 n2 is necessary and sufficient for C(n1 , r1 , m, n2 , r2 ) to be either SNB or WSNB. Therefore we assume r2 > nn21 , or r2 ≥ d n1n+1 e. 2 Theorem 2.3.1. C(n1 , r1 , m, n2 , r2 ) for r2 ≥ 2 is WSNB under CS and CD if and only if m ≥ n1 + n2 − 1. Proof. The ”if” part is trivial since C(n1 , r1 , n1 + n2 − 1, n2 , r2 ) is SNB. To prove the ”only if” part, we show that if m = n1 + n2 − 2, then there exists a blocking state. Clearly, we can reach the state [1, n2 ] [n2 + 1, 2n2 ] . . .. [kn2 + 1, n1 − 1] [n1 , n1 + n2 − 2]. (if [kn2 + 1, n1 − 1] is an empty set, then the corresponding column does not exist). Since row 1 has only n1 −1 entries and the last column has only n2 −1 entries, one new connection can be requested in the cell (1, d nn12 e + 1), but no middle switch is available. The MI case is as following. We first prove a lemma. Lemma 2.3.2. C(n1 , r1 , m, n2 , r2 ) with n1 = n2 + 1, min{r1 , r2 } ≥ 2, is not WSNB under MI if m < 2n2 . Proof. We prove, by induction on n2 , the existence of a state which must use 2n2 middle switches. (i) n2 = 2, [1, 2]. 3. →. [1, 2] 3. →. [1, 2]. 4 3. (ii) suppose that for n2 = n the statement is true. (iii) n2 = n + 1, since for n2 = n the statement is true, we can reach a state s,. 15.

(24) X. 2n , |X| = n, |Y | = n − 1, X ∩ Y = ∅, and X ∪ Y = [1, 2n − 1]. Y. Add 2n + 1 to cell (1, 2), since n2 = n + 1, and delete the four numbers [2n − 4, 2n − 1] from cell (1, 1) and (2, 2). By noting that Corollary 2.2.9 also applies to the asymmetric 3-stage clos network,we can get a state s1 , X1. [2n, 2n + 1] , |X1 | = n − 3, |Y1 | = n − 2, X1 ∩ Y1 = ∅, and Y1 X1 ∪ Y1 = [1, 2n − 5].. Then, we add [2n − 4, 2n − 1] to cell (2, 1), and delete the four numbers [2n − 8, 2n − 5] from cell (1, 1) and (2, 2). By Corollary 2.2.9 again, we can reach a state s2 , X2 [2n, 2n + 1] , |X2 | = n − 4, |Y2 | = n − 5, [2n − 4, 2n − 1] Y2 X2 ∩ Y2 = ∅, and X2 ∪ Y2 = [1, 2n − 9]. Repeat the above steps, without loss of generality, we reach a state s0 , X0 Y. 0. , |X 0 | = n, |Y 0 | = n + 1, X 0 ∩ Y 0 = ∅, and X 0 ∪ Y 0 = [1, 2n + 1].. Finally, we add 2n + 2 to cell (2, 2).. Corollary 2.3.3. C(n1 , r1 , m, n2 , r2 ) with n2 < n1 < 2n2 , r1 ≥ 2, r2 = 2, is not WSNB under MI if m < 2n2 . Theorem 2.3.4. C(n1 , r1 , m, n2 , r2 ) with n1 > n2 , min{r1 , r2 } ≥ 2, is WSNB under MI if and only if m ≥ min{n1 + n2 − 1, r2 n2 }.. 16.

(25) Proof. The ”if” part is trivial. To prove the ”only if” part, it suffices to show for r1 = 2. Case 1. n1 ≤ (r2 − 1)n2 , assume n1 = pn2 + q, 0 ≤ q < n2 . Clearly we can reach the state [1, n2 ] [n2 + 1, 2n2 ] . . .. [x, n1 − n2 ]. ,where x = (p − 2)n2 + 1, if q = 0; x = (p − 1)n2 + 1,if q 6= 0. We can also move [1, n1 − n2 ] from first row to second row by moving cell by cell in the order from left to right. Our focus is actually on the last two columns, i.e., the 2 × 2 submatrix M . The Set [1, n1 − n2 ] in the first p − 1 or p columns serves the sole purpose that all entries in M are larger than n1 − n2 . This is achieved by moving the set[1, n1 − n2 ] to the row where entries are to be added in M . The entries are added according to the proof of Lemma 2.2.7. Hence, eventually, we reach the state [1, n2 ] [n2 + 1, 2n2 ]. .... [x, n1 − n2 ] . . .. X Y. , |X| = |Y | = n2 − 1, X ∩ Y = ∅, and X ∪ Y = [n1 − n2 + 1, n1 + n2 − 2]. Finally, add n1 + n2 − 1 to cell (1, r2 ). Case 2. (r2 − 1)n2 < n1 < r2 n2 , which implies (n1 + n2 − 1) ≥ r2 n2 . Clearly, we can reach the state [1, n2 ] . . .. [(r2 − 3)n2 + 1, (r2 − 2)n2 ]. Similar to Case 1, we can reach the state [1, n2 ] . . .. [(r2 − 3)n2 + 1, (r2 − 2)n2 ]. X Y. 17. ,.

(26) |X| = n2 , |Y | = n2 −1,X ∩Y = ∅,and X ∪Y = [(r2 −2)n2 +1, r2 n2 −1]. Finally,add r2 n2 to cell (1, r2 ). Case 3. r2 n2 ≤ n1 . This is a trivial case with m = r2 n2 .. Finally, we study the packing and STU strategies. Let Xij denote the set of connections from Ii to Oj . We first prove Lemma 2.3.5. Suppose n1 ≥ n2 . Then |X11 ∪ X22 | ≤ n2 , |X12 ∪ X21 | ≤ n2 . Proof. Suppose not, say, |X11 ∪ X22 | = n2 + 1. Let y denote the (n2 + 1)st middle switch added to cell (1, 1) or cell (2, 2). Without loss of generality, assume y is added to cell (2, 2). Then X11 /X22 = ∅ since otherwise, the (I2 , O2 ) connection should be routed through a middle crossbar in X11 /X22 by the packing strategy. Therefore X11 ∪ X22 = X22 , and |X11 ∪ X22 | ≤ n2 − 1 since cell (2, 2) can have at most n2 connections, including y, contradicting the assumption that y is the (n2 + 1)st middle switch in X11 ∪ X22 . Similarly, we can prove |X12 ∪ X21 | ≤ n2 . Theorem 2.3.6. Suppose n1 ≥ n2 . Then C(n1 , 2, m, n2 , 2) is wide-sense nonblocking under the packing or the STU strategy if and only if m ≥ min{2n2 , n2 + bn1 /2c}. Proof. Suppose n1 ≥ 2n2 . Consider 2n2 connections for an input switch. They must be routed through 2n2 distinct middle switches. On the other hand, there are at most 2n2 connections, hence 2n2 middle switchs suffice. Next suppose n1 ≤ 2n2 .. 18.

(27) Necessity. [1, n2 ] [1, n2 ]. → →. [1, n2 − bn1 /2c] [n2 − bn1 /2c] + 1, n2 ] [1, n2 − bn1 /2c] [n2 + 1, n2 + bn1 /2c] [n2 − bn1 /2c] + 1, n2 ]. The last state has n2 − bn1 /2c + bn1 /2c + bn1 /2c = n2 + bn1 /2c elements. Sufficiency. Suppose to the contrary that there exists a state such that a new request under the packing strategy will force the use of an idle middle crossbar y which will be the (n2 + bn1 /2c + 1)st middle crossbar in use. Without loss of generality, assume y is in cell (2, 2). Then by an argument analogous to the one used in proving Lemma 2.3.5, X11 ⊆ X22 in that state. Therefore X11 ∪ X12 ∪ X21 ∪ X22 = X12 ∪ X21 ∪ X22 . Further |X12 ∪ X21 | ≤ n2. (by Lemma 2.3.5). |X12 ∪ X22 | ≤ n2 , |X21 ∪ X22 | ≤ n1 . Hence |X12 ∪ X21 ∪ X22 | ≤ (n2 + n2 + n1 )/2, or |X12 ∪ X21 ∪ X22 | ≤ n2 + bn1 /2c.. Note that the proof of sufficiency is simpler than Beneˇs original proof for the symmetric network. Theorem 2.3.7. Suppose n1 ≥ n2 and max{r1 , r2 } ≥ 3. Then C(n1 , r1 , m, n2 , r2 ) is wide-sense nonblocking if and only if m ≥ min{r2 n2 , n1 + n2 − 1}. 19.

(28) Proof. The ”if” part is trivial since the condition already guarantees strict nonblockingness by an extension of Clos result [6] to the asymmetric case. We now prove the ”only if” part. If n1 ≥ r2 n2 , then trivially, m ≥ r2 n2 is necessary. Therefore we assume n1 < r2 n2 . Case (i) r2 = 2, r1 ≥ 3. [1, n2 ] n2 →. [1, n2 − 1] [1, n2 − 1] n2 + 1 n2 → n2 + 1. [1, n2 − 1], n2 + 1. [1, n2 − 1], n2 + 1 n2 → n2 + 1. →. →. n2 n2 + 1 [n2 , n2 + 1]. [1, n2 − 1] n2 + 2 [n2 , n2 + 1]. Repeat such an operation, eventually we obtain [1, n2 − 1] [n2 , n2 + bn1 /2c − 1] [1, n2 − 1] → n2 + bn1 /2c [n2 , n2 + bn1 /2c − 1] Case (ii) r2 ≥ 3, r1 = 2. Subcase (1): n1 > (r2 − 1)n2 . First, [1, n2 ]. [n2 + 1, 2n2 ]. ···. [(r2 − 3)n2 + 1, (r2 − 2)n2 − 1]. [(r2 − 2)n2 , (r2 − 1)n2 − 2]. Now, consider the last three columns. Define A = [(r2 − 3)n2 + 1, (r2 −. 20.

(29) 2)n2 − 1] and B = [(r2 − 2)n2 , (r2 − 1)n2 − 1]. Then A B, (r2 − 1)n2 − 1 → →. A B, (r2 − 1)n2 − 1 (r2 − 1)n2 − 1 A B. (r2 − 1)n2 (r2 − 1)n2 − 1. →. A B, (r2 − 1)n2 (r2 − 1)n2 (r2 − 1)n2 − 1. →. A B (r2 − 1)n2 (r2 − 1)n2 − 1. → → →. A B, (r2 − 1)n2 (r2 − 1)n2 − 1 A B (r2 − 1)n2 − 1, (r2 − 1)n2. → ···. A B [(r2 − 1)n2 − 1, r2 n2 − 3]. define C = [(r2 − 1)n2 − 1, r2 n2 − 3] → → → → → →. A B r2 n2 − 2 A B r2 n2 − 2 → C r2 n2 − 2 C A, r2 n2 − 2. B r2 n2 − 2 C. A, r2 n2 − 2 B r2 n2 − 1 A, r2 n2 − 2 B → C r2 n2 − 1 C A. B A, r2 n2 − 1 B → r2 n2 − 1 C, r2 n2 − 2 C, r2 n2 − 2. A, r2 n2 − 1 B, r2 n2 − 2 C [1, n2 ] [n2 + 1, 2n2 ] · · ·. A, r2 n2 − 1 B, r2 n2 − 2 r2 n2 C. Subcase (2): n1 ≤ (r2 − 1)n2 . First, 21.

(30) [1, n2 ]. [n2 + 1, 2n2 ]. ···. [n1 − 2n2 + 1, n1 − n2 − 1]. [n1 − n2 , n1 − 2]. Also, consider the last three columns, define A = [n1 −2n2 +1, n1 −n2 −1] and B = [n1 − n2 , n1 − 2]. Similar, we can get the following state A B [(n1 − 1, n1 + n2 − 3] define C = [(n1 − 1, n1 + n2 − 3] → → →. A B n1 + n2 − 2 A B n1 + n2 − 2 → C n1 + n2 − 2 C A B, n1 + n2 − 2 C [1, n2 ] [n2 + 1, 2n2 ] · · ·. A B, n1 + n2 − 2 n1 + n2 − 1 C. Case (iii) r1 ≥ 3, r2 ≥ 3. Let p = bn1 /n2 c. Then n2 ≤ n1 < r2 n2 implies 1 ≤ p < r2 . It suffices to prove the state [1, n2 ] [n2 + 1, 2n2 ] · · ·. [(p − 2)n2 + 1, (p − 1)n2 ]. s= can be reached since we can use Case (i) for the remaining r1 × (r2 − p + 1) array with n02 ≤ n1 < 2n02 . Hence the total number of middle crossbars used is (p − 1)n2 + (n01 + n2 − 1) = n1 + n2 − 1. [1, n2 ] → →. [1, n2 ] [n2 + 1, 2n2 ] [1, n2 ] [n2 + 1, 2n2 ] [2n2 + 1, 3n2 ]. →··· → s. 22.

(31) Chapter 3 Multi-logd N networks The multi-logd N network, first proposed by Lea [12], can be obtained by substituting each middle crossbar of a 3-stage Clos stage with a logd N network. In this chapter, we study the five WSNB strategies in multi-logd N networks and show that the cost is still the same as SNB.. 3.1. Strictly nonblocking. For strictly nonblocking, Shyy and Lea [13] proved the following theorem for d = 2 and Hwang [8] extended it to the d-nary version. Theorem 3.1.1. Multi-logd N network with p copies is strictly nonblocking if p ≥ p(n), where ( n (d + 1) × d 2 −1 − 1 for n even, p(n) = n−1 2×d 2 −1 for n odd. Proof. We consider the graph model of a copy (baseline network). For the note in the j-th link stage is the path of a request (i, o), j = 1, . . . , n, there are at most k(j) requests can intersect it and doesn’t intersect the other node on the path, where ( dj − dj−1 for j < n/2, k(j) = n−j n−j+1 d −d otherwise. 23.

(32) Therefore, there are at most  n/2−1  X    2 (dj − dj−1 ) + (dn/2 − dn/2−1 ) for n even,  n  X j=1 (k(j) − 1) = (n−1)/2  X  j=0  2 (dj − dj−1 ) for n odd.   j=1 ( n (d + 1) × d 2 −1 − 2 for n even, = n−1 2×d 2 −2 for n odd. If these requests are route in distinct copies, then (i, o) must route in another copy. Hence, the theorem holds. Any request has a unique path in a logd N network. Hence two intersecting paths must be routed through different copies of logd N network. Theorem 3.1.1 was stated in [8] only as a sufficient condition. Chang, Guo, and Hwang [5] proved that it is also necessary. Theorem 3.1.2. Multi-logd N network with p copies is strictly nonblocking only if p ≥ p(n). Proof. For any request γ = (x, y), assume that the path of γ consists of links L0 , L1 , . . . , Ln . For n odd, let I1 (O2 ) be the set of inputs(outputs), except n−1 n+1 x(y), which can reach L n−1 , then |I1 | = d 2 − 1 and |O2 | = d 2 − 1. Let 2 O1 (I2 ) be the set of outputs(inputs), except y(x), which can reach L n+1 . n−1. n+1. 2. Then |O1 | = d 2 − 1 and |I2 | = d 2 − 1. Note that γ cannot be routed through the same copy with any request from I1 to O2 or I2 to O1 . Suppose p = p(n) − 1 while |I1 | requests from I1 to O2 \ O1 and |O1 | requests from O1 to I2 \ I1 have already been connected in different copies. In this case, they can occupy |I1 | + |O1 | = p(n) − 1 = p copies, with no copy left for γ. For n even, let I1 (O2 ) be the set of inputs(outputs), except x(y), which can reach n n L n2 −1 , then |I1 | = d 2 −1 − 1 and |O2 | = d 2 +1 − 1. Let O1 (I2 ) be the set of n outputs(inputs), except y(x), which can reach L n2 +1 . Then |O1 | = d 2 −1 − 1 n and |I2 | = d 2 +1 − 1. Let I3 (O3 ) be the set of inputs(outputs), except x(y), 24.

(33) n. which can reach L n2 . Then |I3 | = |O3 | = d 2 . Note that γ cannot be routed through the same copy with any request from I1 to O2 , I2 to O1 , or I3 to O3 . Suppose p = p(n) − 1 while |I1 | requests from I1 to O2 \ O3 , |O1 | requests from O1 to I2 \ I3 , and |I3 \ I1 | requests from I3 \ I1 to O3 \ O1 have already been connected in different copies. In this case, they can occupy |I1 | + |O1 | + |I3 \ I1 | = |I1 | + |O1 | + |O3 \ O1 | = p(n) − 1 = p copies, with no copy left for γ. Hence p must be greater than or equal to p(n). We call such a set of p(n) − 1 requests blocking γ the maximal blocking configuration (MBC), denote by M (n, γ). Note that if a network is SNB, then it is also WSNB. i.e. multi-logd N is WSNB if p ≥ p(n). Therefore, we only need to prove necessity in the following proofs. In all these proofs, we assume that the network carries no traffic at the beginning.. 3.2. Wide-sense nonblocking. We consider strategy CD first. Theorem 3.2.1. Multi-logd N network with p copies is WSNB under CD if and only if p ≥ p(n). Proof. Suppose p < p(n). Consider a sequence of p + 1 requests with p requests from M (n, γ) followed by the request γ. By the property of strategy CD, these p requests will be routed in p copies. Then we cannot route γ any more. Hence p must be greater than or equal to p(n). For strategy CS, Theorem 3.2.2. Multi-logd N network with p copies is WSNB under CS if and only if p ≥ p(n). Proof. Suppose p < p(n). For a request γ and any p requests of M (n, γ), say γ1 , γ2 , . . . , γp , route γ1 in copy 1, then route γ in copy 2(because γ1 blocks γ 25.

(34) in copy 1). Then disconnect γ and route γ2 in copy 2. Then route γ in copy 3. Again disconnect it and route γ3 in copy 3. Doing this iteratively until γp is routed in copy p. Then γ cannot be routed any more. Hence p must be greater than or equal to p(n). For strategies P or STU, we introduce a lemma. Lemma 3.2.3. For any request γ and M (n, γ), there exists a request γ 0 which does not block γ or any request in M (n, γ) in the logd N network. Proof. Use the graph model of the baseline network as an example. Without loss of generality, let γ = (0, 0). For all requests (i, j) in M (n, γ), we obtain i < Nd and j < Nd . Hence γ 0 = (N − 1, N − 1) will satisfy our claim. Theorem 3.2.4. Multi-logd N network with p copies is WSNB under P or STU if and only if p ≥ p(n). Proof. Suppose to the contrary, p < p(n). For any request γ and any p requests of M (n, γ), say γ1 , γ2 , . . . , γp , we route γ1 in copy 1 first. Then route γ in copy 2 and route γ 0 in copy 2 (because copy 1 are as busy as copy 2, we can choose copy 2). Now, we disconnect γ and route γ2 in copy 2. Then disconnect γ 0 . Similarly, we route γ in copy 3 and γ 0 in copy 3, then disconnect γ and route γ3 in copy 2. Finally, we route γp in copy p. Then γ cannot be routed any more. Hence p must be greater than or equal to p(n). MI is more complicated. It’s amazing that we construct a relation between 3-stage Clos network and multi-logd N network, and then we can apply the same process in Theorem 2.2.8 to multi-logd N network. Theorem 3.2.5. Multi-logd N network with p copies is WSNB under MI if and only if p ≥ p(n). Proof. We discuss two cases:. 26.

(35) stage n2 2 0. I2. n1 2 0. n2 2 0. 1. I1. d. n1 2 0. n 1 2. O1. d. n 1 2. d. I1. O1. I2. O2. n 1 2. O2. Figure 3.1: The left figure is an induced graph of the graph model of a multilogd N network, for n odd. And the right figure is its correspondence to a 3-stage Clos network.. (i) n is odd. Select two subset I1 and I2 of inputs and two subset O1 n−1 and O2 of outputs. Set I1 = O1 = {0, 1, 2, . . . , d 2 − 1}, I2 = O2 = n−1 n−1 {d 2 , . . . , 2 × d 2 − 1}. See Figure 3.1. By the configuration of baseline network, every request from I1 to O1 ∪ O2 must intersect node 0 in stage n−1 and every request from I2 to O1 ∪ O2 must intersect 2 node 1 in stage n−1 . Therefore, for i = 1 or 2, all requests from Ii 2 to O1 ∪ O2 must use different copies. Similarly, every request from I1 ∪ I2 to O1 must intersect node 0 in stage n+1 and every request from 2 n−1 I1 ∪ I2 to O2 must intersect node d 2 in stage n+1 . Therefore, for 2 i = 1 or 2, all requests from I1 ∪ I2 to Oi must use different copies. n−1 Now, we match this to a 3-stage Clos network C(d 2 , 1, 2), where Ii is the i-th input switch, Oi is the i-th output switch, for i = 1 or 2, and the complete bipartite graph induced by nodes 0 and 1 of stage n−1 n−1 and nodes 0 and d 2 of stage n+1 is the middle switch. Then 2 2 n−1 a request (i, j) in C(d 2 , p, 2) routed through the k-th middle switch under MI corresponds to a request (i, j) in the multi-logd N using copy. 27.

(36) n 1 2 0. 0. 0. 1. I1. n 1 2. n 2. 0. 0. 1. O1. d 1. d. n 2. 1. d. I1c. n 2. 1. n 2. d. 1. O1c. d. d 2 1 n. n. d2. d 1. d 2 1 n. 2d 1. n. d 1 n 2. n. n. d2. d2. d2. I2. O2. d 2 1 n. n 1. (d  1)d 2. n 1. (d  1)d 2. I 2c. 2d 2  1. O2c. n. 2d  1 n 2. 2d  1 n 2. Figure 3.2: This is an induced graph of the graph model of a multi-logd N network, for n is even. k. Therefore, by Theorem 2.2.8, the network is not WSNB if p < 2 · (d. n−1 2. )−1=2×d. n−1 2. − 1 = p(n).. (ii) n is even. Select four subset I1 , I10 , I2 and I20 of inputs and four subset n O1 , O10 , O2 , and O20 of outputs. Set I1 = O1 = {0, 1, 2, . . . , d 2 −1 − 1}, n n n n I10 = O10 = {d 2 −1 , . . . , d 2 − 1}, I2 = O2 = {d 2 , . . . , (d + 1)d 2 −1 − 1}, n n and I20 = O20 = {(d + 1)d 2 −1 , . . . , 2 × d 2 − 1}. See Figure 3.2. Then every request from I1 to O1 ∪ O2 must intersect node 0 in stage n2 − 1, every request from I2 to O1 ∪ O2 must intersect node d in stage n2 − 1, every request from I1 ∪ I2 to O1 must intersect node 0 in stage n2 + 1, n and every request from I1 ∪ I2 to O2 must intersect node d 2 in stage n + 1. Similar to case (i), we can treat I1 , I2 , O1 , O2 as the inputs and 2 n outputs of C(d 2 −1 , 1, 2), and the subgraph sketch in bold line in Figure 28.

(37) 3.2 is the middle switch. Therefore, by Theorem 2.2.8, the network is not WSNB if n p < 2 · (d 2 −1 ) − 1. (3.2.1) Besides, we observe that, for i, j = 1, 2, every request from Ii to Oj must block every request from Ii0 to Oj0 in the same node in the stage n n . Therefore, if we connect all (d − 1)d 2 −1 requests in Ii0 to Oj0 in copy 2 n 0 to copy (d − 1)d 2 −1 − 1 before every time we connect a request γ from Ii to Oj and disconnect them after γ connected, then we can force n the copy chosen to route γ begin at least (d − 1)d 2 −1 -th copy. Hence (3.2.1) can be enlarged to n. n. p < 2 × (d 2 −1 ) − 1 + (d − 1)d 2 −1 = p(n).. Note that, in Theorem 3.2.5, it doesn’t need to consider all inputs and outputs, because I1 ∪ I2 and O1 ∪ O2 are enough to force p ≥ p(n) which is the bound of SNB. Example 2. Here is an example that multi-log2 32 network can be routed through 7-th copy under MI. A triple (i, o, k) means the request (i, o) routes through k-th copy and (i, o, k)− means the request (i, o) routed through k-th copy disconnected. (0, 0, 1) → (1, 1, 2) → (0, 0, 1)− → (4, 4, 1) → (0, 5, 3) → (2, 6, 4) →(1, 1, 2)− → (4, 4, 1)− → (4, 0, 1) → (5, 1, 2) → (1, 2, 5) → (3, 3, 6) →(4, 0, 1)− → (5, 1, 2)− → (0, 5, 3)− → (2, 6, 4)− → (4, 4, 1) → (5, 5, 2) →(6, 6, 3) → (7, 7, 4) → (4, 4, 1)− → (0, 0, 1) → (2, 4, 7). Example 3. Here is an example that multi-log2 64 network can be routed through 11-th copy under MI. Note that every step following start from 5-th. 29.

(38) copy because the 4 requests from Ii0 to Oj0 through 1-st to 4-th copy. (0, 0, 5) → (1, 1, 6) → (0, 0, 5)− → (8, 8, 5) → (0, 9, 7) → (2, 10, 8) →(1, 1, 6)− → (8, 8, 5)− → (8, 0, 5) → (9, 1, 6) → (1, 2, 9) → (3, 3, 10 →(8, 0, 5)− → (9, 1, 6)− → (0, 9, 7)− → (2, 10, 8)− → (8, 8, 5) → (9, 9, 6) →(10, 10, 7) → (11, 11, 8) → (8, 8, 5)− → (0, 0, 5) → (2, 8, 11).. 3.3. Generalizations. In this chapter, we extend our results to a class of networks including the 3-stage Clos networks, the multi-logd N and the logd (N, k, m) networks as special cases. A vertical-copy network V consists of an input stage of r1 n1 ×m crossbars, an output stage of r2 m × n2 crossbars and a middle stage of m copies of a network ν with r1 inputs and r2 outputs. There exists exactly one link between each input(output) crossbar and each copy of ν. When ν is the r1 × r2 crossbar, V is a 3-stage Clos network. When n1 = n2 = 1 and ν is the logd N network, V is a multi-logd N network. When n1 = n2 = 1 and ν is the k-extra-stage logd N network, then V is the logd (N, k, m) network. In particular, if k = n − 1, then V is the Cantor network. Suppose that the necessary and sufficient condition for ν to be SNB is known. Consider p = p(n) − 1. For any request γ, there must be a state s such that γ is blocked in each of the p(n) − 1 copies ν1 , ν2 , . . . , νp(n)−1 . Let Ri be the set of all requests routing through νi in s and M (ν, γ) = {Ri | i = 1, 2, . . . , p(n) − 1}. i.e., V is SNB if and only if the number of copies is larger than |M (ν, γ)|. Let “Route Ri in νj ” mean “Route all requests in Ri in νj consecutively”. Theorem 3.3.1. A vertical-copy network V is WSNB under the CS routing if and only if V is SNB. Proof. Suppose there are p < p(n) copies ν1 , ν2 , . . . , νp in V . For a request γ, we route R1 in ν1 , then route γ in ν2 (γ is blocked in ν1 ). Then disconnect γ 30.

(39) and route R2 in ν2 . Then route γ in ν3 . Again disconnect it and route R3 in ν3 . Doing this iteratively until Rp is routed in νp . Then γ cannot be routed in any copy. Hence p must be greater than or equal to p(n). For CD, we use another argument. Theorem 3.3.2. A vertical-copy network V is WSNB under the CD routing if and only if V is SNB. Proof. First, we claim every request γ can be routed in νk for a given k. Route γ in νi . If i 6= k, then disconnect γ and route it again in νi+1 . Similarly, if i + 1 6= k, then disconnect γ and route it again in νi+2 until γ is routed in νk . Note that if i = p, then we let i + 1 be 1. Therefore, if p < p(n), then we can route Ri in νi for i = 1 to p as we want. Then γ cannot be routed in any copy. Hence p must be greater than or equal to p(n). For STU, if there exists a request γi0 which does not block {γ} ∪ Ri for all i, theorem 3.2.4 remains true if M (n, γ) is replaced by M (ν, γ) and γi is replaced by Ri . But we use a different argument for P. Theorem 3.3.3. Suppose there exists a request γi0 which does not block {γ}∪ Ri for all i. A vertical-copy network V is WSNB under the P routing if and only if V is SNB. Proof. It suffices to prove the “only if” part. Suppose there are only p = p(n) − 1 copies ν1 , ν2 , . . . , νp in V . For the request γ = (0, 0), without loss of generality, suppose Ri = {γi,j | j = 1, . . . , λi } and λ1 ≤ λ2 ≤ · · · ≤ λp . Let |νi | denote the number of connections in νi . For a given k, let s(k, B) be a state satisfying the following conditions: (i) |νk | < λk , (ii) Connections in νi are those from Ri , (iii) |νi | = |νk | + 1 or |νi | = λi if i ∈ B, where B denotes the set of i such that |νi | > |νk |. 31.

(40) Let S(k) denote the state that νi contains Ri for all 1 ≤ i ≤ k. We make two claims: Claim A. We can add another connection δ of Rk in νk in state s(k, B). Claim B. S(k) can be realized. We prove both claims by induction on k. For k = 1, then B = ∅. Clearly, we can add δ to ν1 , and keep on adding other connections until νi contains Ri . So consider general k > 1. From s(k, B) we can obtain the state s∗ (k, B), which differs from s(k, B) by having νi containing Ri for all 1 ≤ i ≤ k − 1, by applying induction to claim B(with k = k − 1). In state s∗ (k, B), γ must be routed in νk . Now delete all connections in s∗ (k, B) \ s(k, B) so that |νk | ≥ |νi | for all i. Then γk0 can be routed in νk . Delete γ and route δ in νk . Delete γk0 and Claim A is proved. Also, we can keep on adding all remaining connections of Rk to νk to prove Claim B. Setting k = p in Claim B, then γ cannot be routed in any of the p copies. Hence at least p(n) copies are needed.. Example 4. For simplicity, we will represent a state by its |ν|-sequence. To help clarify the state, let |νi |∗ denote the fact that γ is in the νi , |νi |0 the fact that γ 0 is and |νi |00 the fact that both are. Suppose p = 3 and we want to reach the state S(3) = (λ1 , λ2 , λ3 ) = (2, 3, 4). The the |ν|-sequence of our construction in Theorem 3.3.3 would be: (0, 0, 0). ⇒(1, 0, 0). ⇒(2, 0, 0). ⇒(2, 1∗ , 0) ⇒(1, 1∗ , 0) ⇒(1, 200 , 0). ⇒(1, 10 , 0). ⇒(1, 20 , 0). ⇒(1, 1, 0). ⇒(2, 1, 0). ⇒(2, 2∗ , 0) ⇒(2, 300 , 0). ⇒(2, 20 , 0). ⇒(2, 30 , 0). ⇒(2, 2, 0). ⇒(2, 3, 0). ⇒(2, 3, 1∗ ) ⇒(1, 1, 1∗ ). ⇒(1, 1, 200 ) ⇒(1, 1, 10 ). ⇒(1, 1, 20 ) ⇒(1, 1, 1). ⇒(2, 1, 1). ⇒(2, 2∗ , 1). ⇒(2, 300 , 1) ⇒(2, 20 , 1). ⇒(2, 30 , 1) ⇒(2, 2, 1). ⇒(2, 3, 1). ⇒(2, 3, 2∗ ). ⇒(2, 2, 2∗ ) ⇒(2, 2, 300 ) ⇒(2, 2, 20 ) ⇒(2, 2, 30 ) ⇒(2, 2, 2). ⇒(2, 3, 2). ⇒(2, 3, 3∗ ) ⇒(2, 3, 400 ) ⇒(2, 3, 30 ) ⇒(2, 3, 40 ) ⇒(2, 3, 3). ⇒(2, 3, 4).. Therefore, we obtain the state S(3). 32.

(41) ӫ a b. ӫ c d. a b. c d. Figure 3.3: γ and M (V, γ) = {a, b, c, d} in C(3, 4, 2) Corollary 3.3.4. logd (N, k, m) is WSNB under any of CS, CD, STU, and P if and only if it is SNB, i.e., [8], ( n−k k + 3 · 2 2 −1 − 2, for n − k even, m> n−k+1 k + 2 2 − 2, for n − k odd. Proof. Note that logd (N, k, m) is a vertical copy network. Then the results for CS and CD follow from Theorem 3.3.1 and 3.3.2. For P and STU, it is easily verified that γi0 = (N − 1, N − 1) doesn’t block any request in {γ} ∪ Ri for all i Then the results follow from Theorems 3.3.3. That packing is a good routing strategy has been a folklore for a long time and documented in literature [1]. One motivation for that folklore is that C(n, m, 2) is WSNB under P if and only if m ≥ b 3n c [1], while it is SNB 2 if and only if m ≥ 2n − 1. The seemingly discrepancy between the m ≥ b 3n c 2 result and Theorem 3.3.3 is explained by the fact that γ 0 does not exist in C(n, m, 2) since M (V, γ) occupies both input switches(see Figure 3.3). For r ≥ 3, C(n, m, r) is WSNB under P if and only if it is SNB. Thus the saving of C(n, m, 2) under P seems to be a fluke rather than a testimony of its goodness. In this chapter, again we showed that in the worst-case scenario, 33.

(42) P does not help. Instead, MI is the only routing strategy which is still not ruled out to be useful.. 34.

(43) Chapter 4 Conclusions and future works From Chapter 2 and 3, we see that the costs of WSNB under the five routing strategies on the 3-stage Clos network, the multi-logd N , and some more general vertical-copy networks are same as SNB, except C(n, m, 2) with packing strategy. It seems like WSNB is no better than SNB. But for multicast traffic, Yang-Masson [17] suggested a WSNB algorithm such that the required number of middle crossbars of the 3-stage Clos network is strictly less than SNB. For log2 N networks, Tscha and Lee [16] used the window algorithm in the log2 (N, 0, m) network (also see Kabacinski and Danilewicz [11]) to give another example that the number of copies in WSNB is less than in SNB. Recently, Hwang and Lin [10] further extended their result to log2 (N, k, m). Note that if the window algorithm is used on one-to-one (1-cast) traffic, then the result is still same as SNB. For larger r, Tsai, Wang, and Hwang [15] proved that C(n, m, r) is not WSNB under any algorithm. We extended it to the asymmetric version (See Appendix for the proof). ¢ ¡ 2 −2 + 1, C(n1 , r1 , m, Theorem 4.1. For n1 r1 ≤ n2 r2 and r1 ≥ (n2 − 1) n1n+n 1 −1 n2 , r2 ) is WSNB if and only if m ≥ n1 + n2 − 1. For n1 r1 > n2 r2 and ¡ ¢ 2 −2 r2 ≥ (n1 − 1) n1n+n + 1, C(n1 , r1 , m, n2 , r2 ) is WSNB if and only if m ≥ −1 2 n1 + n2 − 1.. 35.

(44) ¡ ¢ Corollary 4.2. For r ≥ (n − 1) 2n−2 + 1, C(n, m, r) is WSNB if and only n−1 if m ≥ 2n − 1. Note that further increasing r does not lead to a stronger result since O1 can be connected to at most n inputs. Moreover, if m ≥ n1 + n2 − 1, then it is SNB. From these theorems, it seems that it is hopeless to find a good algorithm for WSNB in one-to-one traffic for large r. However, finding a better algorithm for smaller r is still possible. Note that the lower bound of r in Corollary 4.2 is exponential in n. Obtaining a polynomial bound will greatly reduce the range of uncertainty whether WSNB can improve over SNB.. 36.

(45) Appendix Proof of Theorem 4.1. For n1 r1 ≤ n2 r2 , it suffices to prove the “only if” ¡ ¢ 2 −2 part. Suppose r1 = (n2 − 1) n1n+n + 1 and m = n1 + n2 − 2. Consider 1 −1 the state s that every input crossbar has n1 − 1 inputs connect to arbitrary output crossbar except O1 . Then every input crossbar connects to'n1 − 1 & r1 middle crossbars. By the pigeonhole principle, there are ¡ m ¢ = n2 n1 −1. input crossbars, say X, connected to the same n2 middle crossbars, say M . Under s, we add n2 new requests from idle input links in X to O1 . Since X is already routed through M , the new requests cannot use M any more. And since the new requests involve the same output crossbar, they must be routed through distinct middle crossbars. Therefore m ≥ |X| + |M | = n1 + n2 − 1. For n1 r1 > n2 r2 , the argument is similar by exchanging “input” to “output”.. 37.

(46) References [1] V. E. Beneˇs, Mathematical Theory of Connecting Networks and Telephone Traffic. Academic Press, 1965, New York. [2] V. E. Beneˇs, Blocking in the NAIU network, AT&T Bell Labs Tech. Memo. 1985. [3] J. C. Bermond, J. M. Fourneau, and A. Jean-Marie, Equivalence of multistage interconnection networks, Information Processing Letter 26 (1987) 45–50. [4] F. H. Chang, J. Y. Guo, F. K. Hwang, C. K. Lin, Wide-sense nonblocking for symmetric or asymmetric 3-stage Clos networks under various routing strategies, Theoretical Computer Science 314 (2004) 375–386. [5] F. H. Chang, J. Y. Guo, F. K. Hwang, Wide-sense nonblocking for multilogd N networks under various routing strategies, Theoretical Computer Science, to appear. [6] C. Clos, A Study of nonblocking switching networks, Bell System Tech. J., 32 (1953) 406-424. [7] D. Z. Du, P. C. Fishburn, B. Gao and F. K. Hwang, Wide-sense nonblocking for 3-stage Clos networks, in Switching Networks: Recent Advances, Eds:D.Z.Du and H.Q.Ngo, pp. 89-100, Kluwer 2001, Boston.. 38.

(47) [8] F. K. Hwang, Choosing the best logk (N, m, P ) strictly nonblocking networks. IEEE Transactions on Communications, Vol. 46, No. 4 (1998) 454–455. [9] F. K. Hwang, The Mathematical Theory of Nonblocking Switching Networks, World Scientific, Singapore, 1st Ed.(1998) and 2nd Ed.(2004). [10] F. K. Hwang and B. C. Lin, Wide-sense nonblocking multicast log2 (N, m, p) network. IEEE Trans. Commun. 51 (2003) 1730–1735. [11] W. Kabacinski and G. Danilewicz, Wide-sense and strict sense nonblocking operation of multicast multi-log2 N switching networks. IEEE Trans. Commun. 50 (2002) 1025–1036. [12] C.-T. Lea, Multi-log2 N networks and their applications in high-speed electronic and photonic switching systems, IEEE Transactions on Communications, Vol. 38, No. 10 (1990) 1740–1749. [13] D.-J. Shyy and C.-T. Lea, log2 (N, m, p) strictly nonblocking networks. IEEE Transactions on Communications, Vol. 39, No. 10 (1991) 1502– 1510. [14] D.G. Smith, Lower bound in the size of a 3-stage wide-sense nonblocking network, Elec. Lett. 13 (1977) 215-216. [15] K.H. Tsai, D.W. Wang and F.K. Hwang, Lower bounds of wide-sense nonblocking Clos networks, Theor. Comput. Sci. 261 (2001) 323-328. [16] Yeonghwan Tscha and Kyoon-Ha Lee, Yet another result on multilog2 N network, IEEE Trans. Commun., 47 (1999) 1425–1431. [17] Y. Yang and G. M. Masson, Nonblocking broadcast switching networks, IEEE Trans. Comput. 9 (1991) 1005–1015. [18] Y. Yang and W.Wang, Wide-sense nonblocking Clos networks under packing strategy, IEEE Trans. Comput. 45 (1999) 265-284. 39.

(48)

數據

Figure 1.2: A 4-stage interconnection network network.
Figure 1.3: (a) asymmetric (b) symmetric
Figure 1.4: Linking pattern of the baseline network and an example.
Figure 1.6: Multi-log d N Network
+4

參考文獻

相關文件

由於較大型網路的 規劃必須考慮到資料傳 輸效率的問題,所以在 規劃時必須將網路切割 成多個子網路,稱為網 際網路。橋接器是最早

(現場取消未講授);三、 「中東地區的女性與發展—女性領導力(Women and Development in the Middle East—Feminizing Leadership) 」 、 「新網路世紀的創新與發展(Innovation and Growth

In addition, we successfully used unit resistors to construct the ratio of consecutive items of Fibonacci sequence, Pell sequence, and Catalan number.4. Minimum number

Isakov [Isa15] showed that the stability of this inverse problem increases as the frequency increases in the sense that the stability estimate changes from a logarithmic type to

The function f (m, n) is introduced as the minimum number of lolis required in a loli field problem. We also obtained a detailed specific result of some numbers and the upper bound of

Enrich the poem with a line that appeals to this missing sense.. __________________imagery that appeals to the sense of____________has not been used in the description

(c) If the minimum energy required to ionize a hydrogen atom in the ground state is E, express the minimum momentum p of a photon for ionizing such a hydrogen atom in terms of E

In this work, for a locally optimal solution to the NLSDP (2), we prove that under Robinson’s constraint qualification, the nonsingularity of Clarke’s Jacobian of the FB system