低密度校正檢測碼用於正交頻域多工系統之能量-載波分配與解碼排程

(1)

國立交通大學

電信工程學系

碩士論文

低密度校正檢測碼用於正交頻域多工系統之

能量-載波分配與解碼排程

Power/subcarrier allocation and decoding

schedule for LDPC coded OFDM systems

研究生：顏佐翰

指導教授：蘇育德

(2)

低密度校正檢測碼用於正交頻域多工系統之能量-載波

分配與解碼排程

Power/subcarrier allocation and decoding schedule for

LDPC coded OFDM systems

研究生：顏佐翰 Student：Johann Yan

指導教授：蘇育德教授 Advisor：Prof. Yu T. Su.

國立交通大學

電信工程學系

碩士論文

A Thesis

Submitted to Department of Communication Engineering College of Electrical and Computer Engineering

National Chiao Tung University in partial Fulfillment of the Requirements

for the Degree of Master

in

Communication Engineering

September 2008

Hsinchu, Taiwan, Republic of China

(3)

低密度校正檢測碼用於正交頻域多工系統之能量-載波

分配與解碼排程

學生：顏佐翰

指導教授

：

蘇育德

國立交通大學

電信工程學系碩士班

摘

要

現今低密度校驗碼的錯誤更正技巧與謝農極限已經能相當接近。

信賴度傳輸這種演算法用於低密度校驗碼的解碼上，是相當有效的。

但是、對於某些低密度校驗碼的架構上的使用，傳統的信賴度傳輸演

算法卻無法提供一個有效率的解法。所以我們提供兩種分群的方法來

提升效能，第一個方法是基於我們能有效收集到的獨立信號而定，第

二個方法是基於抵抗錯誤能力的不同而分群。藉著這兩種方法，我們

可以針對不同的使用方式而得到益處。

(4)

Power/subcarrier allocation and decoding schedule

for LDPC coded OFDM systems

Student : Johann Yan Advisor : Yu T. Su

Institute of Communication Engineering National Chiao Tung University

Abstract

The low density parity check (LDPC) codes are powerful error-correcting codes that, thanks in part to the belief propagation (BP) decoding algorithms, offers near-Shannon Limit performance when the code length is sufficiently (but finite) long. The BP al-gorithms refer to a class of iterative alal-gorithms that passes probabilistic (reliability) messages on a graph that describes the probabilistic (Markovian) relations amongst the associated random variables. With proper message-updating rules and message-passing schedule, a BP algorithm can efficiently compute the a posterior probability (APP) or likelihood function needed in maximum likelihood (ML) or APP decoding.

Two parallel-serial decoding algorithms, namely, the horizontal shuﬄed BP (HSBP) and the vertical shuﬄed BP (VSBP) algorithms, have been proposed in the literature. They partition either the check or variable nodes into several groups where a group consists of (almost) the same number of consecutive nodes according to the natural order of the parity-check matrix and carry out the BP process in group-by-group manner. Three design issues for the resulting parallel-serial decoder arise: the degree of parallelism (the cardinality of a group), the partition rule (which nodes should be in the same group)

(5)

and the associated message-passing schedule. All of these three design concerns aﬀect the decoder complexity, convergence speed and the error rate performance.

The basic per-sub-iteration message-passing behavior of a shuﬄed BP algorithm is determined by the corresponding submatrices of the parity check matrix H. That of a shuﬄed BP using a partition which is not based on the natural order can be described by a permuted version of the original H. An all-zero column (row) in a sub-matrix implies that the corresponding nodes will undergo no information update. It is thus desired that there be as few all-zero columns (rows) in a sub-matrix as possible.

We present two partition criteria; the first criterion is based on the innovation rate (new uncorrelated information collected per sub-iteration) while the second one is based on the bit nodes’ normalized correlation spreads (NCS) which are used to measure the degree of local flooding uniformity of a bit node in each iteration. As the NCS also reveals the unequal error protection (UEP) nature of a irregular LDPC code, the second partition actually divides the bit nodes into groups with different error rate performance. In a muticarrier transmission system, such an UEP property can be exploited to improve the overall performance by using a proper power and subcarrier allocation in carrying out BP decoding. Numerical simulation indicate that both approaches yield improved error rate and convergence performance.

(6)

致謝

兩年來的研究生生活，非常感謝的得到很多人的幫助與鼓勵，也由

於大家的幫助與鼓勵，才有今天這本論文的完成。首先我要感謝蘇育德

老師，跟老師學習的兩年間，不只是學術上的心得，更重要的是待人處

事的啟發，令我受益良多。我相信老師兩年的教導，我將受用一輩子。

再來要感謝我的家人，在我就學期間不斷的支持與鼓勵，陪著我走

過每一段低潮。其次要感謝論文撰寫期間，一起並肩作戰的伙伴們，易

霖、郁文、千雅，這一路上的大家，讓我有繼續拼下去的動力。最後要

感謝一路陪我走來的好朋友們，811 研究室的學弟妹小石、Max、Leodo、

Nuts、鄭瑩、Tofar。學長:小明、人仰、坤昌、彥志、家偉。和大學時

期的好朋友們，謝謝你們的支持與友情贊助，讓我一次又一次的衝過論

文瓶頸。

謝謝大家。

(7)

List of Figures

2.1 A factor graph for fA(x1, x4)fB(x4)fC(x2, x3)fD(x4, x5) . . . 3

2.2 (a) is equality constraint node and (b) is zero-sum constraint node . . . . 3

2.3 Tanner graph of example 2 . . . 4

2.4 Idea of message passing in check node . . . 6

2.5 Idea of message passing in variable node . . . 7

3.1 parallel architecture . . . 10

3.2 serial architecture . . . 10

3.3 multi-stage factor graph representation of VSBP algorithm . . . 11

3.4 multi-stage factor graph representation of HSBP algorithm . . . 13

4.1 MSBP factor graph of (8,4) LDPC code . . . 17

4.2 Statistics for (480,240) QC-LDPC code generated by 802.11n speciﬁcation with traditional BP with transmitted x=1 . . . 21

4.3 Statistics for (480,240) QC-LDPC code generated by 802.11n speciﬁcation with VSBP, G=6 with transmitted x=1 . . . 22

5.1 message ﬂow for example 7 . . . 30

5.2 model of a multipath channel . . . 34

5.3 a channel response example of frequency selective channel . . . 35

6.1 BER performance of the (480,240) QC-LDPC code with Imax=1,VSBP G=2,4,6. . . 37

(10)

6.2 BER performance of the (480,240) QC-LDPC code with Imax=3,VSBP G=2,4,6. . . 38 6.3 BER performance of the (480,240) QC-LDPC code with Imax=1,HSBP

G=2,4. . . 39 6.4 BER performance of the (480,240) QC-LDPC code with Imax=3,HSBP

G=2,4. . . 40 6.5 FER performance of the (480,240) QC-LDPC code with Imax=5,10, G=2. 40 6.6 FER performance of the (480,240) QC-LDPC code with Imax=5,10, G=4. 41 6.7 Message ﬂow behavior of the (480,240) 802.11n speciﬁcation QC-LDPC

code. . . 41 6.8 BER performance of diﬀerent bits for (480,240) 802.11n speciﬁcation

QC-LDPC code. . . 42 6.9 Message ﬂow behavior of the (480,240) 802.16e speciﬁcation QC-LDPC

code. . . 43 6.10 BER performance of diﬀerent bits for (480,240) 802.16e speciﬁcation

QC-LDPC code. . . 43 6.11 Message flow behavior of the (96,48) QC-LDPC code. . . 44 6.12 BER performance of different bits for (96,48) QC-LDPC code. . . 44 6.13 BER of 802.11n specification QC-LDPC code with power allocation at I=5. 45 6.14 magnitude response for frequency selective channel with small difference

of magnitude. . . 47 6.15 802.11n speciﬁcation performance for frequency selective channel with

small diﬀerence of magnitude. . . 47 6.16 magnitude response for frequency selective channel with large diﬀerence

of magnitude. . . 48 6.17 performance for frequency selective channel with large diﬀerence of

(11)

6.18 magnitude response for frequency selective channel with diﬀerence of mag-nitude between Fig 6.14 and Fig 6.16. . . 49 6.19 performance for frequency selective channel with diﬀerence of magnitude

between Fig 6.14 and Fig 6.16. . . 50 B.1 Base matrix of 802.11n . . . 55 B.2 Base matrix of 802.16e . . . 56

(12)

Chapter 1 Introduction

Usually the using of LDPC code always is processed by BP algorithm and con-ventional BP algorithm does not provide an efficiency method which is based on the character of one code to achieve another performance gain. And now our focal point of this thesis is improving the efficiency of a code by grouping it into several small units to analysis. In chapter 2 we introduce some definition and theorem of LDPC code and chapter 3 shows the idea of shuffled BP and how does shuffled BP can speed-up the convergence.

We will use two kinds of grouping method to enhance our performance. First, in chapter 4 with the concept of shuffled BP, we add another processing which focus on the amount of independent information we can get from a sub-iteration and hope after scheduling, more independent information can be achieved. Second, we employ a simple method to find different noise resisting ability of every bit and then group them. This will be introduced in chapter 5. We still use EXIT chart to verify our assumption, then in chapter 6 shows our simulation result. And the latest chapter shows our conclusion and some future work.

(13)

Chapter 2 Low-Density Parity-Check Code

Low-density parity-check (LDPC) code was invented by Gallager in 1962 [1] and the processor of computer was not strong enough to support the complexity computation at that time, so it was ignored until Mackey[2] rediscovered it. LDPC code is a linear block code which is speciﬁed by a very sparse parity check matrix with small numbers of 1’s and 0’s else. The decoding algorithm of LDPC code can be represented by Factor Graph and is proved to achieve a capacity near Shannon limit when block length is large .

2.1 Concept of factor graph

Factor graphs[3][4] are a straightforward generalization of the Tanner graphs and are introduced to describe the families of LDPC codes. Graphical models such as factor graphs support a general trend to iterative processing.

A factor graph is a diagram that represents the factorization of function g(x₁, ..., xn) =

jfj(Xj) where Xj is a subset of x1, x2, ..., xn and fj(Xj) is a function having the

el-ements Xj as arguments. A factor graph contains two kinds of node, variable node

for each variable xi and factor node for each local function fj. There is one edge that

connects a variable node xi with a factor node fj if xi is an argument of fj.

(14)

ʳ

X X” X X” (a) (b)

Figure 2.2: (a) is equality constraint node and (b) is zero-sum constraint node

variables ,and then assume that g can be expressed as a product of 5 factors:

g(x₁, x₂, x₃, x₄, x₅) = fA(x1, x4)fB(x4)fC(x2, x3)fD(x4, x5) (2.1) Where XA = {x1, x4}, XB = {x4}, XC = {x2, x3}, XD = {x4, x5}, and then Fig.2.1 shows the factor graph of equation (2.1).

2.2 Constraint node of factor graph

A factor graph contains two kinds of constraint nodes: equality constraint node and zero-sum constraint node. Show in Fig.2.2:

The local function of equality constraint node is

Figure 2.3: Tanner graph of example 2

The local function of zero-sum constraint node is

f₊(x, x, x”) δ(x + x + x”) (2.3)

where δ(.) denote Delda function and δ(x) = 1 when x = 0.

Example 2(An example of LDPC code): If we have a parity check matrix

H = ⎡ ⎣1 0 0 1 1 0 01 0 1 0 0 1 1 0 1 1 1 0 0 1 ⎤ ⎦

If X = (v₁, v₂, v₃, v₄, v₅, v₆, v₇) is a codeword and syndrom is S = (C₁, C₂, C₃), than from the characteristic of linear code we have HXT = ST = [C₁, C₂, C₃]T = [0 0 0]T. The representation of three check nodes is described as

C₁ : x₁ ⊕ x₄ ⊕ x₅ = 0

C₂ : x₁⊕ x₃⊕ x₆⊕ x₇ = 0

C₃ : x₂⊕ x₃⊕ x₄⊕ x₇ = 0

The Tanner graph of this parity check matrix which has 7 variable nodes and 3 check nodes is shown in Fig.2.3.

2.3 Belief propagation

In this section we will introduce the decoding algorithm of LDPC code. The decoding method is called belief propagation, or message passing algorithm which is a iterative

(16)

algorithm. The main concept of the algorithm is that the information can be updated by passing current information from variable nodes to check nodes and from check nodes to variable nodes. Information from the variable node n to the check node m is computed by collecting all information which is neighbor to the variable node n except to that connects with check node m, and the computation of information from check node m to variable node n is similar to it.

For the convenience, we deﬁne the likelihood of binary random variable x : L(x) as P r[x = 0]/P r[x = 1] and conditional likelihood of x : L(x|y) is deﬁned as P r[x = 0|y]/P r[x = 1|y].

2.3.1 Message from check node to variable node

If we assume p = 2· P r[x₁ = 0|y] − 1 and q = 2 · P r[x₂ = 0|y] − 1 where x₁,x₂ are binary random variables and y = (y₁, y₂, ..., yN) where y1, y2, ..., y3 are continuous random variables, then

pq = (2· P r[x₁ = 0|y] − 1) · (2 · P r[x₂ = 0|y] − 1)

− 2 · P r[x1 = 0|y] − 2 · P r[x2 = 0|y] + 1

= 2· P r[x₁ = 0|y] · P r[x₂ = 0|y] + 2 · (1 − P r[x₁ = 1|y]) · (1 − P r[x₂ = 1|y])

− 2 · (1 − P r[x1 = 1|y]) − 2 · (1 − P r[x2 = 1|y]) + 1

so from above we can extend it as

2· P r[x₁⊕ ... ⊕ xl = 0|y] − 1 = l

i=1

(17)

X1

X2 X4

X3

X4 =X1 +X2 +X3

Figure 2.4: Idea of message passing in check node

where x₁, ..., x_l are all binary random variables. And because

L(xi|y)/(1 + L(xi|y)) = _{(P r[x}P r[xi = 0|y]/P r[xi = 1|y] i = 0|y]/P r[xi = 1|y]) + 1

= 1

1 + (P r[xi = 1|y]/P r[xi = 0|y])

= P r[xi = 0|y]

, the equation 2P r[xi = 0|y] − 1 can be represented as (L(xi|y) − 1)/(L(xi|y) + 1). If

we deﬁne l = ln L(xi|y), then 2P r[xi = 0|y] − 1 = tanh(l/2). Therefore we can get

ln L(x₁⊕, ..., ⊕xl|y) = ln 1 + ( l i=1 tanh(li/2)) 1− ( l i=1 tanh(li/2)) (2.6) where li is L(xi|y).

From Fig 2.4 we know the information of X₄ is just the sum of X₁, X₂, X₃ and

P r(X₄) = P r(X₁ ⊕ X₂ ⊕ X₃), so equation (2.6) can implement the message from the check node to the variable node. For an M × N parity check matrix H = [Hmn], we

deﬁne the set of all bit nodes connecting to check node m by N (m) = {n : Hmn = 1}

and the set of all check nodes connecting to bit node n by M(n) = {m : Hmn = 1}. If

we denote the message from variable node to check node at the (i− 1)−th iteration as

(18)

X1

X2 X4

X3

X4 =X1 =X2 =X3

Figure 2.5: Idea of message passing in variable node

be described as εi_mn = ln 1 + n_{∈N (m)\n} tanh(z_mni−1/2) 1− n_{∈N (m)\n} tanh(z_mni−1/2)

2.3.2 Message from variable node to check node

If P r[x₁ = 0|y] = p and P r[x₂ = 0|y] = q where x₁, x₂ are binary random variables, the probability P r[x₁ = x₂ = 0|y] = (p · q)/n and the probability P r[x₁ = x₂ = 1|y] = (1−p)·(1−q)/n where n is a normalization argument to make sum of the two probability be one. And L(x₁ = x₂|y) = L(x₁)· L(x₂) then ln L(x₁ = x₂|y) = ln L(x₁) + ln L(x₂). So we extend it to the case with several variable nodes:

ln L(x₁ = x₂ = ... = xl) = ln L(x1) + ln L(x2)+, ..., + ln L(xl) (2.7)

where x₁, ..., x_l are all binary random variables. From Fig 2.5: P r(X₄) = P r(X₁ = X₂ =

X₃). Then the information from variable nodes to check nodes can be implemented by equation (2.7). If we denote the check-to-variable message at the i−th iteration as z_mni ,

(19)

then

z_mni = Fn+

m_∈M(n)\n

εi_mn

where Fn = 4· yn/N0 ,and the initial value zmn0 = Fn

We summarize detailed BP as below:

1. Initialization: set i = 1 and the max number of iteration is Imzx. For each m, n,

initialize z_mn0 = Fn = 4· yn/N0.

2. • At check nodes: ∀m ,1 ≤ m ≤ M, and each n ∈ N (m)

τ_mni = n_{∈N (m)\n} tanh(z i−1 mn 2 ) (2.8) εi_mn = ln1 + τ i mn 1− τ_mni (2.9)

• At variable nodes: ∀n ,1 ≤ n ≤ N, and each m ∈ M(n) z_mni = Fn+ m_∈M(n)\m εi_mn (2.10) z_ni = Fn+ m_∈M(n) εi_m_n (2.11)

3. Make hard decision z_ni to obtain a outcome Ri_nwith z_ni > 0 if Ri_n= 0, else Ri_n = 1. If the outcome satisﬁes H·(Ri_n)T = 0 or achieves the maximum number of iteration

Imax, then stop operation and choose Rin as the decoded codeword. Otherwise set

(20)

Chapter 3 Shuﬄed Iterative decoding

The BP algorithm is a multi-iteration parallel procedure that updates information from variable nodes to check nodes and check nodes to variable nodes until achieving max iteration number or the value of syndrome being zero. However the parallel im-plementation architectures (shown in Fig 3.1) of decoders for these codes require a lot of processing elements and complex interconnection between variable nodes and check nodes [5] due to the sparse structure of the underlying Tanner graph. It is nature to have a serial architecture that serializes and distributes the parallel algorithm among a small number of processing elements (shown if Fig 3.2) and the information between variable nodes and check nodes is saved in memory. Diﬀerent with the parallel archi-tecture, the computation of information in serial architecture has data dependencies on information and parallel architecture relies on current computation of information. So serial architecture can achieve better error performances and faster convergence than parallel architecture algorithm.

3.1 Vertical shuﬄed belief propagation

The vertical shuﬄed belief propagation algorithm [1] [6] splits all the variable nodes into some groups and each full iteration of the decoding process consists of many sub-iterations.

(21)

iter-̆̂˹̇ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ iter-̆̂˹̇ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ iter-̆̂˹̇ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ iter-̆̂˹̇ʳ ˼́̃̈̇ʳ˄ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ˼́̃̈̇ʳ˅ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ˼́̃̈̇ʳˆʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ˼́̃̈̇ʳˡʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ̆̂˹̇ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ̆̂˹̇ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ̆̂˹̇ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ̆̂˹̇ʳ ̂̈̇̃̈̇ʳ˄ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ̂̈̇̃̈̇ʳ˅ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ̂̈̇̃̈̇ʳˆʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ʳ ̂̈̇̃̈̇ʳˡʳ ʳ ʳ ʳ ˍʳ˖˻˸˶˾ˀ̇̂ˀ˩˴̅˼˴˵˿˸ʳˣ̅̂˶˸̆̆˼́˺ʳ˘˿˸̀˸́̇ʳ ʳ ˍʳ˩˴̅˼˴˵˿˸ˀ̇̂ˀ˶˻˸˶˾ʳˣ̅̂˶˸̆̆˼́˺ʳ˘˿˸̀˸́̇ʳ PEcv,1 PEcv,2 PEcv,M PEvc,N PEvc,2 PEvc,3 PEvc,1 PEvc PEcv

Figure 3.1: parallel architecture

Memory PECV PECV PECV Memory PEVC PEVC PEVC soft inputs

(22)

V1 V2 V3 V4 V5 V6

C1 C2 C3

V1 V2 V3 V4 V5 V6

C1 C2 C3

V1 V2 V3 V4 V5 V6

Figure 3.3: multi-stage factor graph representation of VSBP algorithm

ation z_mni−1 to update the check-to-variable information at i−th iteration εi_mn . However the information of variable nodes to check nodes at i−th iteration z_mni is based on the computation of some check-to-variable information εi_mn, and from equation (2.9) εi_mn is also related to zi−1_mn. Then we can use it to compute remaining values εi_mn.

For example: assume that we have a parity check matrix

H = ⎡ ⎣1 0 0 1 1 01 0 1 0 0 1 0 1 1 1 0 0 ⎤ ⎦

and we split columns into 2 groups, every group contains 3 columns. Then the repre-sentation of VSBP algorithm in this example can be shown in Fig 3.3. The graph shows that the check-to-variable information obtained from ﬁrst sub-iteration can be used to update the variable-to-check information at second sub-iteration.

So the concept of VSBP algorithm is that every sub-iteration can achieve available in-formation from previous sub-iteration and sends newer inin-formation to next sub-iteration to have better error rate performance. The following is the step of VSBP algorithm:

(23)

1. Initialization: set i = 1 and the max number of iteration is Imax. For each m, n,

initialize z_mn0 = Fn = 4· yn/N0.

2. For 1 ≤ g ≤ G. G is the number of group and NG is the number of columns in

group G.

• At check nodes: ∀n ,1 ≤ n ≤ N, and each m ∈ M(n)

τ_mni = n_{∈N (m)\n, n}_≤g·N_G tanh(z i mn 2 )· n_{∈N (m)\n, n}_>g·N_G tanh(z i−1 mn 2 ) (3.1) εi_mn = ln1 + τ i mn 1− τ_mni (3.2)

• At variable nodes: ∀n, (g − 1) · NG+ 1 ≤ n ≤ g · NG, and each m∈ M(n)

z_mni = Fn+

m_∈M(n)\m

εi_mn (3.3)

3. At variable node: ∀n ,1 ≤ n ≤ N, and each m ∈ M(n)

zi_n= Fn+

m_∈M(n)

εi_mn (3.4)

4. Make hard decision z_ni to obtain a outcome Ri_n with z_ni > 0 if Ri_n = 0, else

R_ni = 1. If the outcome satisﬁes H· (Ri_n)T = 0 or achieving the maximum number of iteration Imax, then stop operation and choose Rin as the decoded codeword.

Otherwise set i = i + 1 then go to Step 2.

3.2 Horizontal shuﬄed belief propagation

Same as VSBP algorithm, HSBP[7] algorithm splits check nodes into several groups and decoding is carried out in a group-by-group manner. HSBP algorithm is similar to VSBP algorithm. First we collect all the variable-to-check information z_mni (check m belong to k-th group) to update information of check nodes to variables nodes εi_mn. And then at next time sub-iteration, the computation of variable-to-check information z_mni

(24)

V1 V2 V3 V4 V5 V6

C1 C2 C3

V1 V2 V3 V4 V5 V6

C1 C2 C3

V1 V2 V3 V4 V5 V6

Figure 3.4: multi-stage factor graph representation of HSBP algorithm

(check m belong to (k+1)−th group) is updated by εi−1_mn and εi_mnfrom early sub-iteration .

For example: we use the same parity check matrix in previous VSBP example and separate rows into 2 groups. The ﬁrst group contain 1− st row and 2 − nd row, and other rows are in group 2. Fig 3.4 shows the representation. The idea of HSBP is to use the newest information to accelerate convergence.

Summarizing HSBP as following:

1. Initialization: set i = 1 and the max number of iteration is Imzx. For each m, n,

(25)

2. For 1≤ g ≤ G. G is the number of group and MG is the number of rows in group

G.

• At check nodes: ∀m, (g − 1) · MG+ 1≤ m ≤ g · MG and each n∈ N (m)

τ_mni = n_{∈N (m)\n} tanh(z i−1 mn 2 ) (3.5) εi_mn = ln1 + τ i mn 1− τ_mni (3.6)

• At variable nodes: ∀n, 1 ≤ n ≤ N, and each m ∈ M(n) z_mni = Fn+ m_{∈M(n)\m, m}_≤g·M_G εi_m_n+ m_{∈M(n)\m, m}_>g·M_G εi−1_mn (3.7)

3. At variable node: ∀n ,1 ≤ n ≤ N, and each m ∈ M(n)

zi_n= Fn+

m_∈M(n)

εi_m_n (3.8)

4. Make hard decision z_ni to obtain a outcome Ri_nwith z_ni > 0 if Ri_n= 0, else Ri_n = 1. If the outcome satisﬁes H · (Ri_n)T = 0 or achieving the maximum number of iteration Imax, then stop operation and choose Rinas decoded codeword. Otherwise

(26)

Chapter 4 Group scheduling methods for

shuﬄed belief propagation algorithm

The shuffled belief propagation algorithm introduced in chapter 3 shows that group-ing parity check into several small sub-matrixes can help to reduce the complexity, processing elements of decoding and the most important speed-up the convergence of error rate. But how to group parity check matrix to achieve better speed of convergence will become a significant problem. The shuffled belief propagation algorithms include Vertical SBP and Horizontal SBP, and there exist some different decoding methods so the scheduling method would have some distinction and we will discuss them in the follows.

4.1 Message passing for several sub-iterations

The main idea of shuﬄed iterative decoding is that the more independent informa-tion can be updated, the more reliable extrinsic informainforma-tion and more correct decoding result we can have. More independent information achieving in every sub-iteration can avoid message dispersing un-uniformly, un-uniform messages might make some bits re-ceiving higher reliability and others lower reliability and after several time iteration, this situation always result in weak convergence speed. So the question now is that ﬁnd a schedule to implement this kind of independent situation.

(27)

con-necting relation between check nodes and variable nodes for every sub-iteration. Taking an example that for HSBP we divided parity check matrix into two groups, and an iteration time will contain two sub-iterations. Every check group has path connecting with variable nodes. If there are no many intersection numbers of the first and second group variables, after first sub-iteration of i-th iteration the updated check-to-variable message will just depend on those variable-to-check information which connects with those variable nodes in first group. At second sub-iteration we will lose the relation of information when we want to update variable-to-check messages. Of course if we want to collect more independent information, the decoding processing of sub-iteration must get more information from more variable nodes.

Example 3(Group shuf f led belief propagation): Considering the parity check

ma-trix: H = ⎡ ⎢ ⎢ ⎣ 1 0 1 1 0 0 0 0 0 1 0 0 0 1 0 1 0 1 0 1 1 0 1 0 1 0 1 0 1 1 0 0 ⎤ ⎥ ⎥ ⎦

The MSBP factor graph of HSBP algorithm is shown in Fig 4.1(a). From Fig 4.1(a) we know that the ﬁrst check group connects with V₁, V₂, V₃, V₄, V₆ and V₈, so at ﬁrst sub-iteration just some check-to-variable messages can be updated and the messages from V₅, V₇ can’t be obtained. But second sub-iteration we also need messages form

V₅, V₇ to update check-to-variable message. Because at ﬁrst sub-iteration messages to

V₅, V₇ can’t be updated, we must use the messages updated from previous iteration for second sub-iteration at this iteration. This situation will cause that we could not get the newest information to update check-to-variable message at second sub-iteration and wouldn’t speed-up the convergence.

With the same idea, Fig 4.1(b) shows the VSBP factor graph of this code. At second sub-iteration there are no check-to-variable messages obtained from check 1, and this situation will bring the un-uniform result to slow down the convergence. If the variable nodes in second group have connection with check 1, the processing can include the

(28)

V1 V2 V3 V4 V5 V6 V7 V8

C1 C2 C3 C4

V1 V2 V3 V4 V5 V6 V7 V8

C1 C2 C3 C4

V1 V2 V3 V4 V5 V6 V7 V8

(a) MSBP factor graph of HSBP algorithm

V1 V2 V3 V4 V5 V6 V7 V8

C1 C2 C3 C4

V1 V2 V3 V4 V5 V6 V7 V8

C1 C2 C3 C4

V1 V2 V3 V4 V5 V6 V7 V8

(b) MSBP factor graph of VSBP algorithm

(29)

variable-to-check 1 messages which are not used at second sub-iteration in the example, and then we will get more information.

In order to avoid the situation above, the easiest thought is that every check group must connect with almost all variable nodes for HSBP and every variable group must connect with almost all check nodes for VSBP. This situation not only can get more independent information at every sub-iteration time but also get newer message which is updated form previous sub-iteration time.

4.2 Group schedule method

Now considering that how to prevent the situation in example 3. Continuous to example 3: at ﬁrst sub-iteration of HSBP the reason that there are no messages from

V₅, V₇ to group 1 is V₅, V₇ doesn’t participate with the check nodes in group 1. H = [hT₁, hT₂]T = [h₁, h₂], where h₁ = 1 0 1 1 0 0 0 0 0 1 0 0 0 1 0 1 h₂ = 0 1 0 1 1 0 1 0 1 0 1 0 1 1 0 0 h₁ = ⎡ ⎢ ⎢ ⎣ 1 0 1 1 0 1 0 0 0 1 0 1 1 0 1 0 ⎤ ⎥ ⎥ ⎦ h₂ = ⎡ ⎢ ⎢ ⎣ 0 0 0 0 0 1 0 1 1 0 1 0 1 1 0 0 ⎤ ⎥ ⎥ ⎦

h₁ is a sub-matrix of H and provide the message passing of first sub-iteration for HSBP, and the same as h₂. h₁ provide the message passing of first sub-iteration for VSBP, and the same as h₂. Because V₅, V₇ have no any connection with group 1, the 5−th column and 7−th column of h₁ (the check nodes of group 1 for HSBP) are zeros. Similar, from h₂ the first row is zero, and it represent that there are no messages from check node 1 to those variable nodes in group 2 at second sub-iteration for VSBP. The all

(30)

zero sub-matrix columns for HSBP and all zero sub-matrix rows for VSBP result in the un-uniform situation which cause information can’t be used eﬃciency.

In order to prevent all rows/columns zero, we must find a group schedule method before spliting parity check matrix. Taking an example that permuting the 4−th column of h₁ with the first column of h₂, then all rows of the two matrixes will not be zeros. But with a parity check matrix, the algorithm to implement the result above is not easy and full searching seems not to be a good and efficiency method. So we try to find a simple method to nearly implement it, the following is for VSBP and the same concept as HSBP:

1. Deﬁne group number group and the number in a group k, and max number of iteration Imax. Initialize iteration=1.

2. Initialize group index i = 1. Iteration> Imax ﬁnish the processing.

3. If i=group, then iteration=iteration+1 and goto step 2, else goto step 4.

4. Initialize group index ii = 1. If j−th row of group i is all zero, then searching the column α with smallest column weight in group i. Else i = i + 1, then goto step 3. Initialize counter=1.

5. Finding the column β with nonzero j−th row in group ii and ii = i. Checking the value of ii, if ii = group goto step 8.

6. If there is no such column β, then ii = ii + 1 and redo step 5.

7. Permutating column α and column β. Checking if the number of nonzero rows in group i after permutating is less than before and the same as group ii. If the number of nonzero rows in group i and ii is all less than before, i = i + 1 goto step 3. Else recover the permutation and then ii = ii + 1, counter = counter + 1, goto step 5.

(31)

8. If counter = group, searching the column β with second(third, forth,... k-th) small column weight in group i and let ii = 1, then goto step 5. Else and no such column β then i = i + 1 and goto step 3.

Finishing this algorithm we can get H produced by H and we hope that comparing to H there are less all zero columns in H after separating them into sub-matrixes which provide message passing.

4.3 Diﬀerence between schedule of VSBP and HSBP

After implement the algorithm, we must pay attention to some details. Group of VSBP is based on variable node and HSBP is based on check node, this is the difference between them. For VSBP i-th sub-iteration can get check-to-variable messages which pass to those variable belonging to group i. Different sub-iteration time in a full iteration always cause different belief degree. The later we update the message the more reliability message we can have.

Because codeword contain information bits and parity check bits, and information bits are more important than parity check bits, in order to achieve better performance the group with information bits must be processed latest. So for VSBP we ﬁrst split bits of codeword into two group, ﬁrst group is parity check bits and second group is information bits. Then using the algorithm described in section 4.2 to schedule them.

But the group of HSBP is check node based, in fact almost every check nodes would connect with nearly the same information bit numbers. Therefore we don’t have another processing like VSBP, just implement the group schedule algorithm.

Example 4 (complete VSBP example of group schedule method ) Let a codeword c =

(i₁, i₂, ..., ik, p1, ..., pn−k) where i1, ..., ik means information bits and p1, ..., pn−k is parity

checks bits. If parity check matrix H = [h₁, h₂, ...hk, hk+1, ..., hn], h1, ..., hk correspond

to information bits and else parity check bits, then taking h₁, ...hk and hk+1, ..., hn into

(32)

[h₁, ..., hk]. Assuming we want to achieve VSBP by 4 groups, then dividing G1 and G2 into 2 small groups (if k = n/2). Let g_{1, a}, g_{1, b} and g_{2, a}, g_{2, b} be the two small groups divided from G₁ and G₂. Then use group schedule method to implement G₁ and G₂. The most important is that we can’t permute the column in group g ⊂ G₁ with column in the group g ⊂ G₂, because it will cause that information bits couldn’t get the newest extrinsic information.

4.4 Performance gain divination from EXIT chart

Now before simulation, we take the concept of EXIT chart[12] to estimate that if

−200 −15 −10 −5 0 5 10 15 20 2000 4000 6000 8000 10000 12000 14000

Extrinsic information value

(a) Statistics of mutual information for tradi-tional BP with Iteration=1

−200 −10 0 10 20 30 40 2000 4000 6000 8000 10000 12000

(b) Statistics of mutual information for tradi-tional BP with Iteration=4

Figure 4.2: Statistics for (480,240) QC-LDPC code generated by 802.11n speciﬁcation with traditional BP with transmitted x=1

our algorithm can improve speed of convergence. Usually we will assume that a priori input A can be modeled by applying an independent Gaussian random variable nA with

variance σ_A2 and zero mean in conjunction with the known transmitted x.

A = μA· x + nA (4.1)

where μA= σA2/2. With equation above the conditional pdf of A can be written as

PA(ζ|X = x) = e −((ζ−((σ2 A/2)·x)2)/2σA2) √ 2πσA (4.2)

(33)

−100 −5 0 5 10 15 20 2000 4000 6000 8000 10000 12000 14000

(a) Statistics of mutual information for VSBP with Iteration=1, G=6 −200 −10 0 10 20 30 40 50 60 1000 2000 3000 4000 5000 6000 7000 8000 9000

(b) Statistics of mutual information for VSBP with Iteration=4, G=6

Figure 4.3: Statistics for (480,240) QC-LDPC code generated by 802.11n speciﬁcation with VSBP, G=6 with transmitted x=1

mutual information IA= I(X; A) is

IA= 1 2 x=−1,+1 _∞ −∞ PA(ζ|X = x) · log₂ 2· PA(ζ|X = x) PA(ζ|X = −1) + PA(ζ|X = +1) dζ (4.3)

taking equation (4.2) into equation (4.3) we can get

IA(σA) = 1− _∞ −∞ e−((ζ−(σA2/2)2)/2σ2A) √ 2πσA log₂(1 + e−ζ)dζ (4.4)

when we want to draw EXIT chart, ﬁrst we will deﬁne a IA value and get σA from

equation (4.4), then taking this σA into equation (4.1) to form a set of a priori input.

Finally send a priori input and intrinsic information into decoder to achieve IE, an

example shown in Fig 4.1. But now there is a problem that we all assume a priori input as a Gaussian random variable. In the processing of decoding we will use IE in

previous step as IA in this step and IE may not has a Gaussian distribution especially

for scheduling BP. Because scheduling BP would separate a full iteration into several sub-iterations and the message updated at later sub-iteration will be more reliable, the total extrinsic information PDF would turn aside Gaussian. Once IE is not Gaussian

distribution and we would not use Gaussian approximation to implement IA which is

(34)

codeword pattern which is send from bit nodes to check nodes, Y-axis means the numbers locates at x (extrinsic information value). We can see that at iteration=4 IE of VSBP is

not already Gaussian, so we use decoding algorithm to plot EXIT chart neither Gaussian approximation.

the following is a EXIT chart table for G=4 HSBP.

1 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

IE_{i−V C}g 0.603338 0.694625 0.740615 0.770734 0.792035 0.807281 0.818275 0.826249 IE_{V C}g 0.594851 0.667506 0.707433 0.735937 0.758411 0.776163 0.790108 0.800967 IE_CVg 0.069809 0.126099 0.180043 0.237488 0.297182 0.352668 0.402199 0.443413 IE_{i−V C} 0.599266 0.687808 0.732027 0.760926 0.781573 0.796804 0.80853 0.817505 IEV C 0.589721 0.660188 0.698271 0.72481 0.745497 0.762259 0.776169 0.787574 IECV 0.066822 0.117586 0.163079 0.209308 0.257665 0.304959 0.349137 0.388982

2 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

mutual information of bit-to-check/check-to-bit for G=4 HSBP at Eb/N0=1

and 2dB

where IE_{i−V C}g means mutual information of information bit nodes send to checks nodes for group scheduling method and IE_{i−V C} is traditional method without scheduling, the same, IE_{V C}g /IE_CVg is mutual information of bit nodes to check nodes/check nodes to bit nodes for group scheduling method and i = n in the table means n−th iteration. We always focus on BER or FER performance, so the most important for us is information bits not codeword bits. Maybe there is a case that the value IE_{i−V C} in our algorithm is bigger than the IE_{i−V C} without scheduling but IEV C inverse. The value which can

reﬂect BER or FER is IE_{i−V C} and this is the reason we choosing additional IE_{i−V C} to analysis. Comparing EXIT chart table we can ﬁnd that according to increasing iteration number, the corresponding mutual information will be enhanced. And the table above

(35)

also reveal that the mutual information which information bit nodes send to check nodes for group scheduling always larger than the mutual information without scheduling under the same iteration. Sometime Exit chart verify that why our algorithm can speed up the convergence of performance. The following shows the complete EXIT chart table for diﬀerent cases.

1 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

2 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

mutual information of bit-to-check/check-to-bit for G=2 HSBP at Eb/N0=1

and 2dB

1 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

2 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

(36)

mutual information of bit-to-check/check-to-bit for G=2 VSBP at Eb/N0=1

and 2dB

1 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

2 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

and 2dB

1 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

2 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

(37)

Scanning over those EXIT chart table, the common phenomenon is that no matter what kind of cases, the value IE_{i−V C}g is larger than IE_{i−V C} all the time. Although the improvement seems not apparent enough for us to distinguish IE_{i−V C}g and IE_{i−V C}, mu-tual information with scheduling indeed excels mumu-tual information without scheduling. In fact we can’t find a relation between tiny improvement of mutual information and performance enhancement, but this is also not our emphasis, we just want to confirm that rising speed of convergence with group scheduling will be reflected in the EXIT chart. With this EXIT chart result we can estimate the simulation performance with group scheduling in the following chapter would be better than without scheduling.

(38)

Chapter 5 Message ﬂow distribution and noise

resisting

Usually when we analysis the error rate performance of a code, a codeword is always taken into the smallest processing unit (we always don’t take process on bits). Hence each bit within a codeword is regarded as having equal noise resisting ability, and generally we doesn’t take extra processing on bits in codeword.

It is impossible that every bit has the same ability of noise resisting for LDPC code because each bit is composed of different numbers of information bits (one bit which is composed of many information bits usually has better noise resisting ability). If the error protection ability is apparent in different bits, the traditional codec algorithm should be modified for this special characteristic, and it would be introduced in this chapter.

5.1 Message ﬂow for BP algorithm

The most important thing in this section is to search an eﬃciency algorithm to quantize the quantity of noise resisting ability at diﬀerent bits.

Use a simple example to express it: assume a codeword containing 4 bits, and use a vector to represent codeword bits contribution to one bit. Considering one vectors [0.5, 0.3, 0.1, 0.1], the first element (contribution from first bits in a codeword) of the vector dominant the{+1, −1} outcome and can’t provide a objective solution. Then the method to find contribution vector is introduced in the next. The contribution coming

(39)

from all bits in a codeword can also provide a rough and easy analysis of decoding. The more uniform information getting from all bits, the more correct result we have.

For a parity check matrix H = [Hmn] as we deﬁned before, the set of bits participating

in check m is N (m) = {n : Hmn = 1} and the set of checks in which bit participates as

M(n) = {m : Hmn = 1}. Some deﬁnition are also needed:

1. ci_mn= the amount of contribution vector which is made by variable nodes passing from check node m to variable node n at i−th iteration.

2. bi_mn= the amount of contribution vector which is made by variable nodes passing from variable node n to check node m at i−th iteration.

3. pi_n= the complete contribution vector of Vn at i−th iteration.

The value we want to find is pi_n. Similar to the message flow of BP algorithm, we could define the relation of ci_mn, bi_mn, pi_n and process of message travel. As the channel values associated with the variable nodes are independent and identically distributed (i.i.d.), we assign equal contribution to all initial messages generated by variable nodes. With the definitions above, message flow can be computed as 4 steps:

1. Initialization: Set b0_mn = [ δ(1− n), δ(2 − n), ..., δ(N − n) ], 1 ≤ n ≤ N, then let

i = 1 and δ deﬁned as

δ(n) =

1, if n = 0 0, if n= 0

2. At check node: ∀m, 1 ≤ m ≤ M, and each n ∈ N (m), compute

ci_mn=

n_{∈N (m)\n}bi−1mn

deg(m)− 1 (5.1)

deg(m) in equation (5.1) is the number of variable nodes connecting with check m, and the purpose of dividing deg(m) is to normalize the value of ci_mn to make sum of every element in this vector be one.

(40)

3. At variable node: ∀n, 1 ≤ n ≤ N, and each m ∈ M(n), then compute bi_mn = [ δ(1− n), δ(2 − n), ..., δ(N − n) ] + m_∈M(n)\mcim_n deg(n)− 1 (5.2) pi_n = [ δ(1− n), δ(2 − n), ..., δ(N − n) ] + m∈M(n)cimn γ (5.3)

the deﬁnition and purpose of deg(n) in equation (5.2) are very similar to deg(m), and γ is also to normalize pi_n.

4. ∀n, 1 ≤ n ≤ N, compute mean and variance and after the processing set i = i + 1. Deﬁne bi_j,n is j−th element of vector pi_n. If i < Imax, goto step 2.

μi_n = N j=1bij,n N = 1 N (5.4) (σi_n)2 = n j=1[bij,n− μin]2 N (5.5)

Finally, the quantization of contribution can be introduced by the value μi_n/σi_n. Be-cause μi_n is ﬁxed, we just focus on the value σ_ni. If the diﬀerence of every element in the same vector is not much, this vector would be uniform and under this condition the value (σ_ni)2 is always small. Since(σ_ni)2 is positive, small value of (σ_ni)2 implies large value of 1/σi_n. The more uniform vector we have, the larger 1/σ_ni value we can get, and this is the reason why using the value to quantize uniformity of a vector. Because the noise is not considered on our design, we expect that at high SNR we achieve the desired analytic result we have.

Example 5 (example of message flow ): For a parity check matrix H of LDPC code:

H = ⎡ ⎣1 0 0 1 1 01 0 1 0 0 1 0 1 1 1 0 0 ⎤ ⎦

with this matrix, Fig 5.1 shows the first two iteration message flow, and the following two tables are contribution vector set of first and second iteration. The tuple (Va, Vb)

(Va is the label of ﬁrst column in the two tables and Vb is the label of ﬁrst row in the two

tables) denote the contribution from Vb to Va, so each row of table represent contribution

(41)

V1 V2 V3 V4 V5 V6 C1 C2 C3 V1 V2 V3 V4 V5 V6 C1 C2 C3

first step:

V

=[0 , 0 , 0 , 0 , 0 , 1]

Second step:

C

1

-> V

1

=[0 , 0 , 0 , 0.5 , 0.5 , 0]

C

2

-> V

1

=[0 , 0 , 0.5 , 0 , 0 , 0.5]

Third step:

V

1

-> C

1

=[0.5 , 0 , 0.25 , 0 , 0 , 0.25]

Figure 5.1: message ﬂow for example 7

(42)

V₁ V₂ V₃ V₄ V₅ V₆ σ2 1/σ V₁ 0.3333 0 0.1666 0.1666 0.1666 0.1666 0.0556 4.24264 V₂ 0 0.5 0.25 0.25 0 0 0.2083 2.19089 V₃ 0.1666 0.1666 0.3333 0.1666 0 0.1666 0.0556 4.24264 V₄ 0.1666 0.1666 0.1666 0.3333 0.1666 0 0.0556 4.24264 V₅ 0.25 0 0 0.25 0.5 0 0.2083 2.19089 V₆ 0.25 0 0.25 0 0 0.5 0.2083 2.19089 iteration = 1 V₁ V₂ V₃ V₄ V₅ V₆ σ2 1/σ V₁ 0.3333 0.0833 0.125 0.125 0.0833 0.1667 0.1667 5.11682 V₂ 0.125 0.5 0.125 0.125 0.0625 0.0625 0.138 2.69171 V₃ 0.125 0.1667 0.3333 0.125 0.0833 0.1667 0.0382 5.11682 V₄ 0.125 0.1667 0.125 0.3333 0.1667 0.0833 0.0382 5.11682 V₅ 0.125 0.0625 0.125 0.125 0.5 0.0625 0.138 2.69171 V₆ 0.125 0.0625 0.125 0.125 0.0625 0.5 0.138 2.69171 iteration = 2

5.2 Comparing EXIT chart to message ﬂow

Before this section, we must jump to Fig 6.7 at section 6.2 first. Fig 6.7 show the result of message flow of QC-LDPC generated by 802.11n specification. Message flow in previous section illustrates the method to search different noise resisting ability in different variable nodes. Now we observe the amount of mutual information in different bit to forecast this phenomenon roughly. The following shows mutual information for traditional BP algorithm and IE_{b−V C} denotes IE of variable nodes to check nodes subject to those bits corresponding to Fig 6.7 with value 1/sqrt(V ar) larger than 150.

1 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

IE_{b−V C} 0.563895 0.68092 0.735431 0.769741 0.793753 0.811279 0.824541 0.834421

IEV C 0.563327 0.628319 0.664278 0.68898 0.70823 0.724211 0.738177 0.750516

2 dB i=1 i=2 i=3 i=4 i=5 i=6 i=7 i=8

IE_{b−V C} 0.643299 0.80696 0.884653 0.930109 0.956167 0.970733 0.978948 0.983486

(43)

mutual information of bit-to-check/check-to-bit for traditional BP at Eb/N0=1

and 2dB

From the table above we can see that in every iteration IE_{b−V C} is always better than

IEV C. Larger IEb−V C shows that given a set of a priori input, those bits with better

performance can get more mutual information than other bits and imply that those variable nodes in Fig 6.7 with larger 1/sqrt(V ar) value have better error correcting ability. EXIT chart is another method to explain unequal error protection and it’s result seems to coincide with that of message ﬂow.

5.3 Power allocation based on message ﬂow

After considering message ﬂow and expecting the phenomenon of unequal noise resisting, it arise a question: is there any processing to further improve the performance? Because diﬀerent variable has distinct error protection, it is instinctive to add a weight to some variables.

We consider that how does the weight be attached. There are many kinds of weighting method and are used in several decoding algorithm, for example iterative decoding of turbo product codes by Pyndiah[8]. An easy weighting processing is to give a heavy weight to those variable-to-check messages which come from the variable nodes with better performance, and slight weight to messages come from another variables with worse performance. It looks like a good method but this assumption may seem to clash with the principle of BP algorithm in equation (2.10) of section 2.3.

We focus on how to seek a eﬃciency method to put concept of weighting into de-coding without clashing with the principle of BP algorithm. But from equation (2.8) to equation (2.10), the messages εi_mn and z_mni all depend on messages εi−1_mn and z_mni−1, and any adjustment will cause aberration of BP at next iteration for iterative decoding algorithm. So we choose Fn as the tuning argument.

(44)

with iteration. Second, equation (2.10) shows z_mni is composed of Fn, and in other words,

adjustment of Fn sometimes can represent that messages are weighted. Third, all the

messages are consisted of Fn after step by step iterations and imply idea of weighting.

Since Fn = 4· yn/No and No is ﬁxed, only the remaining part yncan be tuned. Since

yn = xn +AWGN , yn is related to xn where xn is a codeword bit after modulation.

Let x = [x₁, x₂, ..., xN] be modulated codeword vector. The value of xn depend on total

transmitting power. Therefore, allocating power to each xn would produce variation of

yn under ﬁxed power constrain. As long as a adequately power allocation method can

be found, we can further improve the performance.

The value of xn is changed, so Fn of decoding algorithm must be modiﬁed. Let An

be the amplitude of xn, and An= 1 is the condition without power allocation.

Fn = lnP r(xn= +An|yn) P r(xn=−An|yn) = lne −(yn− An)2 No/2 e− (yn+ An)2 No/2 = 4· An· yn No

At receiver, we must use the modiﬁed Fn to replace with the Fn without power

allocation.

5.4 Channel mapping based on message ﬂow

Before starting this section, we brieﬂy introduce frequency selective channel. In wireless telecommunications, multipath is the propagation phenomenon that results in radio signals’ reaching the receiving antenna by two or more paths (Fig 5.2). The eﬀects of multipath include constructive and destructive interference, and phase shifting of the signal. When signal is viewed in the frequency domain, the parameter of concern is the channel’s coherence bandwidth, Bc, which is a measure of the transmission bandwidth

(45)

Figure 5.2: model of a multipath channel

for which signal distortion across the channel becomes noticeable. A multipath channel is said to be frequency-selective if the coherence bandwidth of the channel is smaller than the bandwidth of the signal. Fig 5.3 shows a channel response example for frequency-selective channel.

Next, we enter the emphasis on this section. If the performance indeed is improved using power allocation, we must discuss that what does power allocation brings to this system. The method of weighting mentioned in previous section is just a idea, and for system the concept of weighting actually forms diﬀerent Eb/No at diﬀerent bit position.

Diﬀerent variable nodes have diﬀerent noise resisting ability and performance, therefore distinct Eb/No can be used to improve the performance of those variable nodes with

worse BER.

Generally, there are several methods achieving distinct Eb/No, if all the channel

gains are the same, power allocation seems to be the sole solution, but when channel response is not ﬂat we are not necessary using power allocation. Assuming one channel has channel gain |H|, then we can write Y = |H| ∗ X + W (if we assume the phase

(46)

Figure 5.3: a channel response example of frequency selective channel

response is known) where X is the signal, Y the received signal and W is noise. We know|H| is like an adjustable argument, comparing with |H| = 1 if |H| > 1, this means

X is ampliﬁed and the equivalent Eb/No is enhanced, therefore if|H| < 1 implies Eb/N o

is reduced.

Finally, like power allocation we just ﬁnd out all the channel gains, and choosing the channels with higher channel gains to transmit those bits with better or worse performance for diﬀerent requirement.

(47)

Chapter 6 Simulation result

In this chapter we report some results from computer simulation. We use the codes derived from expanding the base model matrix defined in the IEEE 802.11n[9] specification and those from the base model given in the IEEE 802.16e[10] specification to complete our analysis. First, we compare shuffled method and group schedule method for VSBP and HSBP (with different group G=2,4,6) with conventional BP decoders. For convenient we use C-BP, C-VSBP, and C-HSBP to represent conventional BP, VSBP and HSBP. The same, G-VSBP, G-HSBP represent group schedule method for VSBP and HSBP. Second, we perform the message flow of BP algorithm to estimate distinct noise resisting ability of different bits in a codeword, then we put the result into channel mapping algorithm and compare the performance of channel mapping with performance of non-mapping .

6.1 Error rate performance comparison

We use the quasi-cyclic LDPC (QC-LDPC) code with code rate=1/2, codeword length=480, created by the base model matrix in the IEEE 802.11n speciﬁcation as fundamentality of analysis. Fig 6.1 and Fig 6.2 present the bit error rate (BER) per-formance with Imax=1, 3 for G=2, 4 and 6. They show that our algorithm can help

to speed up the convergence of VSBP if we compare it with conventional shuﬄed BP algorithm. Fig 6.3 and Fig 6.4 also display similar results in the case of HSBP.

(48)

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 10−3 10−2 10−1 100 Eb/No BER G=2 C−VSBP G=2 G−VSBP G=4 C−VSBP G=4 G−VSBP G=6 C−VSBP G=6 G−VSBP

Figure 6.1: BER performance of the (480,240) QC-LDPC code with Imax=1,VSBP G=2,4,6.

Taking Fig 6.2 and Fig 6.4 into compare, for fixed SNR and G, the gap between C-HSBP and G-C-HSBP seems to be larger than the gap between C-VSBP and G-C-HSBP. We explain it: in i-th sub-iteration, just some check-to-variable messages which connect with variable nodes in i-th group be updated for VSBP. A complete iteration include several sub-iterations. The variable nodes in the later group (they almost are information bits) usually receive reliable information updated from previous sub-iterations, so the variable nodes in different groups will get different information reliability. Because different variable nodes in different groups during one iteration would get distinct reliability and lead to un-uniform information, the un-uniform information will pass to next iteration and become more un-uniform after several iteration. However, for HSBP, almost every check-to-variable messages which pass from the check nodes in i-group will convey to information bits, so in every sub-iteration we can get messages from check nodes to information bits and compare to VSBP it has more uniform information dispersion.

低密度校正檢測碼用於正交頻域多工系統之能量-載波分配與解碼排程

國 立 交 通 大 學

電信工程學系

碩 士 論 文

低密度校正檢測碼用於正交頻域多工系統之

能量-載波分配與解碼排程

Power/subcarrier allocation and decoding

schedule for LDPC coded OFDM systems

研究生 ：顏佐翰

指導教授：蘇育德

低密度校正檢測碼用於正交頻域多工系統之能量-載波

分配與解碼排程

Power/subcarrier allocation and decoding schedule for

LDPC coded OFDM systems

研 究 生：顏佐翰 Student：Johann Yan

指導教授：蘇育德 教授 Advisor：Prof. Yu T. Su.

國 立 交 通 大 學

電 信 工 程 學 系

碩 士 論 文

低密度校正檢測碼用於正交頻域多工系統之能量-載波

分配與解碼排程

學生：顏佐翰

指導教授

蘇育德

國立交通大學

電信工程學系碩士班

摘

要

現今低密度校驗碼的錯誤更正技巧與謝農極限已經能相當接近。

信賴度傳輸這種演算法用於低密度校驗碼的解碼上，是相當有效的。

但是、對於某些低密度校驗碼的架構上的使用，傳統的信賴度傳輸演

算法卻無法提供一個有效率的解法。所以我們提供兩種分群的方法來

提升效能，第一個方法是基於我們能有效收集到的獨立信號而定，第

二個方法是基於抵抗錯誤能力的不同而分群。藉著這兩種方法，我們

可以針對不同的使用方式而得到益處。

Power/subcarrier allocation and decoding schedule

for LDPC coded OFDM systems

致謝

兩年來的研究生生活，非常感謝的得到很多人的幫助與鼓勵，也由

於大家的幫助與鼓勵，才有今天這本論文的完成。首先我要感謝蘇育德

老師，跟老師學習的兩年間，不只是學術上的心得，更重要的是待人處

事的啟發，令我受益良多。我相信老師兩年的教導，我將受用一輩子。

再來要感謝我的家人，在我就學期間不斷的支持與鼓勵，陪著我走

過每一段低潮。其次要感謝論文撰寫期間，一起並肩作戰的伙伴們，易

霖、郁文、千雅，這一路上的大家，讓我有繼續拼下去的動力。最後要

感謝一路陪我走來的好朋友們，811 研究室的學弟妹小石、Max、Leodo、

Nuts、鄭瑩、Tofar。學長:小明、人仰、坤昌、彥志、家偉。和大學時

期的好朋友們，謝謝你們的支持與友情贊助，讓我一次又一次的衝過論

文瓶頸。

謝謝大家。

Contents

List of Figures

Chapter 1

Introduction

Chapter 2

Low-Density Parity-Check Code

2.1

Concept of factor graph

ʳ

x

x

x

x

x

f

f

f

f

2.2

Constraint node of factor graph

v

v

v

v

v

v

v

2.3

Belief propagation

2.3.1

國立交通大學

碩士論文

研究生：顏佐翰

研究生：顏佐翰 Student：Johann Yan

指導教授：蘇育德教授 Advisor：Prof. Yu T. Su.

國立交通大學

電信工程學系

碩士論文