• 沒有找到結果。

X-Structured Precoder Designs for Spatial-Multiplexing MIMO and MIMO Relay Systems

N/A
N/A
Protected

Academic year: 2021

Share "X-Structured Precoder Designs for Spatial-Multiplexing MIMO and MIMO Relay Systems"

Copied!
14
0
0

加載中.... (立即查看全文)

全文

(1)

X-Structured Precoder Designs for

Spatial-Multiplexing MIMO and

MIMO Relay Systems

Chun-Tao Lin, Member, IEEE, and Wen-Rong Wu, Member, IEEE

Abstract—In multiple-input–multiple-output (MIMO)

trans-mission, precoding has been considered a promising technique to improve the system performance. In general, the criterion of precoder design depends on the detector used at the receiver. For the maximum-likelihood (ML) detector, the criterion is to maximize the minimum distance of the received signal constel-lation. Unfortunately, the derivation of the optimum solution is known to be difficult, and suboptimum solutions have then been developed. One promising approach confines the precoder having an X-structure. Several methods have been developed to solve the X-structured precoder. However, most of them use numerical searches to find their solutions and require lookup tables during run time. In this paper, we propose a systematic design method to solve the problems. The proposed precoder has a simple closed-form expression, and no numerical searches and lookup tables are required. Simulation results show that the proposed method can yield almost the same performance as the existing methods. We also consider the problem of joint source/relay precoder design in a two-hop amplify-and-forward MIMO relay system. Since the problem is much more involved and a closed-form solution is intractable to find, we then extend the use of the proposed X-structured precoder so that the problem can be reformulated as a simple scalar-valued optimization problem. Simulations show that the proposed method can significantly outperform existing joint design methods.

Index Terms—Free distance, geometric mean

decomposi-tion (GMD), maximum-likelihood (ML) detecdecomposi-tion, MIMO, multiple-input–multiple-output (MIMO) relay systems, spatial multiplexing, X-structured precoder.

I. INTRODUCTION

I

N RECENT years, the multiple-input–multiple-output (MIMO) technique has been widely adopted in wireless communication systems. In MIMO systems, it is well known that transmission with spatial multiplexing can provide higher spectral efficiency without bandwidth expansion [1], [2]. How-ever, its performance heavily depends on the condition number of the channel matrix [3]. Precoding is an effective method to

Manuscript received September 4, 2012; revised February 13, 2013; accepted April 28, 2013. Date of publication May 22, 2013; date of current version November 6, 2013. This work was supported in part by the National Science Council under Grant NSC-101-2221-E-009-095-MY3. This paper was presented in part at the 2012 IEEE Global Communications Conference. The review of this paper was coordinated by Dr. Y. Ma.

The authors are with the Institute of Communications Engineering, National Chiao Tung University, Hsinchu 300, Taiwan (e-mail: tow.cm91g@nctu.edu.tw; wrwu@faculty.nctu.edu.tw).

Digital Object Identifier 10.1109/TVT.2013.2264688

overcome this problem. With a low-rate feedback link, channel state information (CSI) is acquired at the transmitter, and a precoder can then be designed and applied.

The precoder design problem has been extensively studied in the literature. For the mutual information criterion, a method that is referred to as mercury/water-filling [4] was proposed for parallel Gaussian channels, and the obtained precoder, being diagonal, is shown to be the optimum power-allocation scheme. For general MIMO channels, an iterative algorithm was proposed in [5]; however, optimality is not guaranteed. The globally optimum and low-complexity suboptimum precoders have been also proposed in [6] and [7], respectively. Except for mutual information, there are also other criteria used for the pre-coder design. Prepre-coders that maximize the signal-to-noise ratio (SNR) or achieve the minimum mean square error (MMSE) were developed for linear receivers in [8]–[11]. Although the computational complexity of linear receivers is low, the per-formance is often not satisfactory. Two nonlinear receivers are well known, i.e., successive interference cancelation (SIC) and maximum-likelihood (ML) detection. The optimum precoder for the QR-SIC receiver [12], minimizing the block error rate (BLER), has been solved with geometric mean decomposition (GMD) [13], [14]. In addition, the optimum precoder for the MMSE-SIC receiver was also solved with uniform channel decomposition [15]. It is known that the performance of an ML receiver is dominated by the minimum distance of received signal constellations, which is referred to as free distance. It is a common criterion used for the precoder design of an ML detector [16]–[20]. Unfortunately, finding the precoder that maximizes the free distance is known to be difficult; the opti-mum precoder remains unsolved. Note that mutual information can have a lower bound expressed by a function of free distance, and the bound will become tight when the SNR is high [7]. However, the solutions in [5]–[7] do not have closed-form expressions; their implementations require high computational complexity.

Recently, a promising suboptimum approach for the ML receiver has been developed. It first adopts singular value decomposition (SVD) to transform the MIMO channel into parallel subchannels. Then, the subchannels are paired to obtain a set of 2× 2 MIMO subsystems, and 2 × 2 subprecoders are designed to maximize the free distance in the MIMO subsys-tems. We refer to this approach as precoding with X-structure, which has been considered in [17]–[19]. Precoding with X-structure not only facilitates the precoder design but yields

(2)

a low-complexity ML receiver as well since we only have to deal with 2 × 2 MIMO subsystems. For 4-quadrature amplitude modulation (QAM), the optimum complex-valued subprecoder was first found in [16]. The result was extended to higher QAM constellations in [20]; however, optimality no longer held. Recently, an orthogonal subprecoder derived from a rotation matrix has been proposed in [18]. The advantage of this approach is that the design complexity can be reduced. However, the closed-form expression of the optimum rotation angle is only available for 4-QAM. A numerical search is required for the angle in higher QAM constellations. More recently, an optimum real-valued subprecoder has been de-veloped [19]. It also requires the numerical search in design and lookup tables in run time. In this paper, we propose a design method for the X-structured precoder. The main idea is using the GMD method in the subprecoder design. With the proposed method, no numerical searches are required in the design phase, and no lookup tables are required during the application phase. Simulation results show that the performance of the proposed method is almost as good as the existing methods.

In recent years, cooperative communications have been con-sidered as a promising method to improve the performance of point-to-point MIMO systems [21], [22]. By employing relays between the source and the destination, the signal can be transmitted via the source-to-relay and then the relay-to-destination link. Multiple antennas can be equipped at each node to form a MIMO relay system. In the MIMO relay system, both the source and relay nodes can conduct precoding, which is referred to as joint source/relay precoding. Several joint precoder designs for amplify-and-forward (AF) MIMO relay systems have been proposed. In [23] and [24], the precoders maximizing the channel capacity were developed. In [25]–[31], the precoders using the MMSE criterion were designed for the linear receivers. To the best of our knowledge, the joint precoder design in the MIMO relay system with an ML receiver has yet to be considered. We then extend the proposed method to the MIMO relay system. However, the problem becomes much more involved in this case, and a closed-form solution is difficult to obtain. To solve the problem, we propose iterative methods to derive the source and relay precoders alternatingly. First, assume that the relay precoder is given; the source precoder design can then be easily solved by the proposed MIMO precoder. Then, with the solved source precoder, the relay precoder can be solved and updated. The derivation of the relay precoder for a given source precoder, however, is much more complicated due to the fact that the MIMO relay channel is a nonlinear and complicated function of the relay precoder. We then propose two methods overcoming the problem such that the relay precoder can be efficiently solved with the Karush–Kuhn–Tucker (KKT) conditions [32]. Simulation results show that the proposed methods significantly outperform existing methods.

The remainder of this paper is organized as follows: Section II briefs the system model we use. Sections III and IV describe the proposed design methods for MIMO and MIMO relay systems, respectively. Section V reports the simulation results, and finally, Section VI draws conclusions.

Fig. 1. System model for a precoded spatial-multiplexing MIMO system.

II. SYSTEM ANDSIGNALMODELS

Consider a precoded spatial-multiplexing MIMO system with Nt transmit antennas and Nr receive antennas, as de-scribed in Fig. 1. Let x = [x1, x2, . . . , xM]T denote the M× 1 symbol vector, where (.)T represents the transpose operation. In spatial multiplexing, each entry of x is independently chosen from a finite-set constellationX . For L-QAM, X is given by

X =√1 K  ±a ± bj|a, b ∈ (1, 3, . . . ,√L− 1)  (1) where K = (2/3)(L− 1). Let H denote the Nr× Ntchannel matrix, which is assumed to be perfectly known at both the transmitter and the receiver. In addition, we assume that M =

Nt≤ Nrand that H remains unchanged for a given precoding period. Using the CSI at the transmitter, we can design an

Nt× M precoding matrix M and conduct precoding by left-multiplying M on x. In general, the power of M has to be con-strained, i.e., tr{MHM} = P

T, where (.)H is the Hermitian transpose operation, and PT is a constant. The received symbol vector can then be expressed as

y = HMx + n (2)

where n is the Gaussian noise vector. Assume that each entry of n is identically and independently distributed (i.i.d.) with the covariance matrix of σ2

nINr, where σ

2

n is the noise variance, and INr denotes an Nr× Nr identity matrix. At the receiver, the ML detector searches all possible symbol vectors to obtain an estimate ˆx such that

ˆ

x = min

x∈XMy − HMx (3)

whereXM is a set consisting of all possible transmitted symbol vectors. Let di,j denote the distance between two distinct received symbol vectors HMxiand HMxj. The free distance of a MIMO system is then defined as

dfree Δ

= min

i=j di,j =xi,xj∈XminM,xi=xj

HM(xi− xj) . (4) The error probability of (3) can then be represented as

Pe= 1 LM LM  i=1 LM  j=1,j=i Pij (5)

where Pij is the probability that transmitted symbol vector xi is falsely detected to some other xj. If dfree is much smaller

(3)

by (4). Under this assumption, the design criterion for M can be chosen as maximizing the free distance of a MIMO system. This problem is known to be difficult since dfree depends on

discrete setXM. Note that error probability P

eis also referred to as the BLER in this paper.

A recently developed method to overcome the problem is the application of the X-structured precoder [17]–[19]. To do that, H is first transformed into parallel subchannels, and precoding is applied on the subchannels. Using SVD, we can have H = UΣVH, where U is an Nr× M matrix, Σ is an M× M real-valued diagonal matrix, and VH is an

M× Nt matrix. Note that UHU = VHV = IM and Σ = diag(λ1, λ2, . . . , λM), where λi’s are the singular values of

H. Let λi’s be arranged in a descending order, i.e., λ1≥ λ2

· · · ≥ λM > 0 and M = VF, where F is an M× M matrix. Left-multiplying UHon y, we can have the equivalent received signal model as

r = ΣFx + n (6)

where r = UHy and n= UHn is the noise vector with the same covariance matrix as n. In what follows, we will refer to F as the precoder since M can be derived if F has been obtained. Note that the power constraint of the precoder still holds, i.e., tr{MHM} = tr{FHF}. Therefore, we can have the problem reformulated as max F x,x∈XminM,x=xΣF(x − x ) s.t. tr{FHF} = PT. (7)

The diagonal structure of Σ in (7) greatly facilitates the deriva-tion of the soluderiva-tion. The key idea is to apply an X-structure on F, which means that subchannels ordered according to their singular values are paired to obtain a set of 2× 2 subsystems, and 2× 2 subprecoders are designed to maximize the free distance of the subsystems.

We first focus on the design of a 2× 2 precoder in (7). For

M = 2, Σ can be expressed as Σ =  λ1 0 0 λ2  = ρ  cos γ 0 0 sin γ  (8) where ρ =λ2

1+ λ22is the channel gain, and γ = tan−1(λ2/

λ1) is an angle related to the channel. Clearly, 0 < γ≤ (π/4)

so that λ1≥ λ2 is satisfied. With SVD, F can be written as F = UFΣFVHF, where both UF and VF are 2 × 2 unitary matrices, and ΣFis a 2× 2 diagonal matrix. It has been shown in [16] that UF can be an identity matrix without affecting the system performance for symmetric QAM modulations. Letting VH

F = VF,1VF,2[16], we can have F expressed as follows:

F =PTΣFVF,1VF,2 =PT  cos ψ 0 0 sin ψ   ΣF  cos θ sin θ − sin θ cos θ   VF,1  1 0 0 eiϕ   VF,2 (9) where 0≤ ψ ≤ (π/2), 0 ≤ θ ≤ (π/4), and 0 ≤ ϕ ≤ (π/2). Given a set of difference vectors, the subprecoder can be found with (8) and (9) by a numerical search over all possible values of ψ, θ, and ϕ. In [16], the optimum subprecoder for 4-QAM was found, and its application to higher QAM constellations was studied in [20]. Note that the subprecoder has complex values. To reduce the design complexity, VF,2in (9) can be set as an identity matrix, yielding a real-valued subprecoder [19]. For further reduction of design complexity, ΣFcan be set as an identity matrix as well, giving an orthogonal subprecoder [18]. As mentioned, most existing subprecoders require numerical searches in design or lookup tables in run time. In this paper, we propose a design method to avoid these problems.

III. PROPOSEDSUBPRECODERS FOR

MULTIPLE-INPUT–MULTIPLE-OUTPUTSYSTEMS

To avoid numerical search or lookup tables, we propose a systematic design giving simple closed-form expressions for any QAM sizes. To do that, we consider an alternative design criterion in (7). Instead of maximizing the free distance itself, we propose maximizing a lower bound.

A. Lower Bound for Free Distance

Considering the QR decomposition (QRD) of a full column-rank matrix H, we can have H = QR, where Q is an Nr× Nt columnwise orthonormal matrix, and R is an Nt× Nt upper-triangular matrix with positive real-valued diagonal entries. Let

dmin denote the minimum distance of a QAM constellation.

From (1), we have dmin= 2/

K. In [14], it was shown that

the free distance of a spatial-multiplexing MIMO system can be lower bounded by

dfree≥ min

i R(i, i)dmin= [R]min 2

K (10)

where R(i, i) denotes the ith diagonal entry of R, [R]min

represents the minimum value among R(i, i)’s, and the equality holds when all R(i, i)’s are equal [14]. It can be seen that the free distance is dominated by [R]min and K. Note that K

is independent of the channel matrix. Hence, maximizing the lower bound of the free distance is equivalent to maximizing [R]min. With a unitary precoder, the QRD of the equivalent

channel will have the property that Nt

i=1R(i, i) = C, where C is a constant. Clearly, the lower bound in (10) is maximized

when R has equal diagonal entries. This precoder can be obtained by GMD given by [13], [14]

H = ˜Q ˜RPH (11)

where ˜Q is an Nr× Nt columnwise orthonormal matrix, P is an Nt× Nt unitary matrix, and ˜R is an Nt× Nt upper triangular matrix having equal diagonal entries. From (11), we can have HP = ˜Q ˜R. In other words, P can be used as a precoder so that the lower bound in (10) can be maximized.

(4)

It can be seen that each diagonal element of ˜R is equal to the geometric mean of R(i, i)’s, i.e.,

˜ R(i, i) = N t  i=1 R(i, i) 1 Nt . (12)

Note that P is independent of the QAM constellation, which is a great advantage when designing the precoder for the ML receiver. Although the GMD method is analytically tractable, its corresponding free distance will be significantly degraded for ill-conditioned channels. To overcome the problem, we propose a method described in the following.

B. GMD-Based Subprecoder With Rank Deficiency

As discussed, we now focus on a 2× 2 subsystem. Note that for Σ with M = 2, FGMDhas a simple expression as

FGMD=  PT 2 ⎡ ⎣  λ2 λ12  λ1 λ12  λ1 λ12  λ2 λ12 ⎤ ⎦ . (13)

Furthermore, the resultant free distance can be also easily obtained by dfree,FGMD=  PTρ2 4 K cos γ sin γ 2 . (14)

The derivations of (13) and (14) can be found in [14]. From (14), we can see that dfree,FGMD≈ 0 as γ approaches zero.

Note that γ is associated with the condition number of the channel matrix. In other words, the performance of FGMD

can be significantly degraded for ill-conditioned channels. This is because FGMD is a full-rank matrix and always uses two

subchannels for signal transmission. This structure constraint becomes inefficient when the value of γ is small, and too much power is allocated in the worse subchannel. This problem can be avoided by pouring all the transmit power on the stronger subchannel λ1 and maps the two bit streams into a scalar

symbol for transmission. In such case, the resultant subprecoder will be rank deficient. From (9), it is simple to see that a complex-valued rank-deficient subprecoder, which is denoted by Fd,c, can be expressed as Fd,c=  PT  cos θd,c sin θd,cejϕd,c 0 0  (15) where 0≤ θd,c≤ (π/4), and 0 ≤ ϕd,c≤ (π/2). With (13) and (15), we then propose a subprecoder expressed as

Fp,1= ⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩  PT  cos θd,c sin θd,cejϕd,c 0 0   Fd,c , for γ≤ γ1 FGMD, for γ > γ1. (16) By letting N = 2(1/2) logL2 − 1, the optimum angles for Fd,c

can be expressed as ϕd,c= tan−1(1/(2N +

3)) and θd,c= tan−1(2 sin ϕd,c) [16]. Threshold γ1 in (16) can be derived

by the channel angle satisfying dfree,FGMD = dfree,Fd,c, where

dfree,Fd,c denotes the corresponding free distance of Fd,c. Therefore, we can have γ1= tan−1(2/(N2+

3N + 2)). We can also use a real-valued rank-deficient subprecoder, which is denoted by Fd,r, to derive another subprecoder. Note that Fd,r can be simply obtained by removing VF,2 in (9). Let l = (1/2) log2(L) and θd,r= tan−1(2−l), where l∈ {1, 2, . . . , 5}. We then propose an alternative subprecoder as

Fp,2= ⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩  PT  cos θd,r sin θd,r 0 0   Fd,r , for γ≤ γ2 FGMD, for γ > γ2. (17)

With the similar method used for Fp,1, we can have γ2= tan−1(2 sin2(tan−1(2−l))). From (16) and (17), we

can see that both Fp,1and Fp,2have closed-form expressions. Thus, no lookup tables are required.

Note that the lower bound in (10) instead of dfreeis used in

our design. Optimality then becomes a main concern. Since a theoretical analysis is very difficult, we only provide a simple qualitative analysis to this problem. It is easy to see that the tightness of the lower bound in (10) is proportional to channel angle γ. For well-conditioned channels, γ is large, and the error due to the use of the lower bound will be small. For ill-conditioned channels, γ is small, and the bound will be loose; however, a rank-deficient subprecoder is used instead. The subprecoder in [18], which is assumed to be orthogonal, also suffers from performance loss for ill-conditioned channels, whereas the subprecoder in [18], which is assumed to be real, does not suffer from performance loss. When γ is small, the scheme in [19] also applies a rank-deficient subprecoder. Simulation results in Section V show that our proposed methods can provide near-optimum results.

C. Extension to MIMO Systems forM > 2

For a MIMO system with M > 2, the X-structured precoder allows a simple solution for the ML receiver. Assuming that

M is even, then every two subchannels can be paired to

yield (M/2) subsystems. Therefore, the 2 × 2 subprecoders discussed in the previous sections can be directly applied. Let the subchannels be ordered with respect to their sin-gular values. Then, the X-structured precoder pairs the ith and the (M− i + 1)th subchannels, and the corresponding channel gain and channel angle can be expressed as ρi= 

λ2

i + λ2M−i+1 and γi= tan−1(λM−i+1/λi), respectively. Using γi, we can construct the resultant subprecoder Fp,(i) for the ith subsystem. Let each subprecoder have the unit-power constraint, i.e., tr{FH

p,(i)Fp,(i)} = 1. In addition, let dfree,i denote the free distance provided by the ith

subsys-tem, and dfree,min is the minimum value among all dfree,i’s.

Clearly, the overall system performance is dominated by the subsystem with dfree,min. Hence, we can adopt a

power-allocation scheme aiming to maximize dfree,min so that the

performance can be further enhanced. Let Υ = diag(Υ1,

Υ2, . . . , ΥM−1, ΥM, ΥM, ΥM−1, . . . , Υ2, Υ1), where Υiis the power allocated on the ith subsystem, and Mis the number

(5)

TABLE I

COMPLEXITYCOMPARISONS FORSUBPRECODERDESIGNS

of subsystems. Similar to the method used in [17], the solution of Υiis given by Υi=  PT d2free,i M  k=1 1 d2 free,k 1 2 . (18)

With (18), we can see that Υ is to equalize the free distance among all subsystems. That means the subsystem with a smaller free distance will be allocated higher transmit power. It can be easily verified that Υidfree,i≥ dfree,min, and hence, the

performance is improved. With Υ, the resultant precoder is still of X-structure. If M is odd, the ((M + 1)/2)th symbol is then independently precoded and detected without coupling another symbol. Note that computing Υi requires information on dfree,i. In this regard, the proposed subprecoders are more

efficient since the free distance can be easily calculated.

D. Complexity Comparisons

First, we use the number of floating operations (FLOPS) required in a precoding scheme as the measure for computa-tional complexity. The complexity for conducting the SVD of H isO(Nr2Nt+ Nt2Nr), that for calculating M is O(NtM ) FLOPS, and that for calculating r isO(Nr2) FLOPS. The given operations are involved in all the methods. Note that both the orthogonal and real-valued precoders need lookup tables in their applications, and this is an additional overhead that is not required for the proposed precoders.

Next, we compare the ML-detection complexity in each method. It has been shown in [19] that the ML-detection complexity in a subsystem with the complex-valued precoder is a function of O(L√L). The detection complexity can be

reduced toO(√L) when the precoder consists of real values.

Thus, the ML-detection complexity associated with Fp,1 is higher than that with Fp,2. Nevertheless, the probability for the event that γ≤ γ1decreases for large values of L. Therefore, the

detection complexity corresponding to Fp,1can be alsoO(

L)

for higher QAM constellations. The complexity comparison for all precoders is summarized in Table I.

IV. PRECODERDESIGN FORMULTIPLE-INPUT– MULTIPLE-OUTPUTRELAYSYSTEMS

Here, we will consider the precoder design in two-hop AF MIMO relay systems. Similar to the conventional MIMO system, we can conduct precoding at both the source and relay nodes. These two precoders can be jointly optimized for further

Fig. 2. System model for a precoded spatial-multiplexing MIMO relay system.

performance enhancement. Note that the ML receiver is used at the destination.

A. Problem Formulation and Source Precoder Design

The system model we consider is shown in Fig. 2. Let HSR denote the Nre× Nssource-to-relay channel matrix and HRD denote the Nd× Nre relay-to-destination channel matrix. In the AF MIMO relay scheme, the signal transmission is divided into two phases. In Phase I, x is transmitted from the source and then received at the relay. We assume that the destination cannot receive the signal from the source in Phase I due to long distance. In the scenario, the relay node can serve as a repeater for the extension of transmission coverage. Let FS denote the Ns× M source precoding matrix. The received signal at the relay can then be expressed as yR= HSRFS x + nR, where

nR is the Gaussian noise vector with a covariance matrix of σ2

n,rINre. In Phase II, yR is first left-multiplied by an Nre×

Nre relay precoding matrix FR and then retransmitted to the destination through HRD. The received signal at the destination can be expressed as

yD= HRDFRHSRFS x + HRDFRnR+ nD (19) where nD is a Gaussian noise vector with a covariance ma-trix of σ2

n,dINd. The received signal can then be rewritten as yD= HFSx + n, where H = HRDFRHSR, and n =

HRDFRnR+ nD. It is simple to see that n is not white, and the covariance matrix of n can be found as

(6)

whereE[.] denotes the expectation operation. To facilitate our derivation, we first conduct whitening processing on yD. Let

W denote the whitening matrix. From (20), we can have W = R 1 2 n =  σ2n,rHRDFRFRHHHRD+ σ2n,dINd 1 2. (21)

After whitening, the received signal can be written as ˜

yD= WyD= ˜HFSx + ˜n (22) where ˜n = Wn and ˜H = WH is the Nd× Ns equivalent channel matrix, which can be expanded as

˜ H =σn,r2 HRDFRFHRHHRD+ σ2n,dINd 1 2 HRDFRHSR. (23) The free distance corresponding to ˜H can then be defined as

dfree= min

xixj∈XM,xi=xj 

 ˜HFS(xi− xj) . (24) Therefore, the objective now is to find the precoders, i.e., FS and FR so that the free distance can be maximized. The optimization problem can be formulated as

max FS,FR dfree s.t. C1: tr  FSFHS  = PS,T C2: tr  FRyRyRHFHR  = trFR  σn,r2 INre+ HSRFSF H SHHSR  FHR= PR,T (25) where C1 and C2 represent the transmit power constraints at

the source and the relay, respectively. As we can see, solving the problem in (25) is complicated since dfreeis a nonlinear and

complicated function of FSand FR. In addition, both precoders are coupled in constraint C2. As a result, the optimum solution

is difficult to find. To solve the problem, we propose an iterative method solving FS and FR alternatingly. First, we let FRbe given and then have the optimization as

max FS min xixj∈XM,xi=xj   ˜HFS(xi− xj) s.t. trFSFHS  = PS,T. (26)

It is easy to see that (26) is similar to a MIMO precoder design problem. Consider the following SVDs:

HRD= URDΣRDVHRD and HSR= USRΣSRVSRH (27) where URD, VRDH , USR, and VHSR are columnwise ortho-normal matrices, both ΣSRand ΣRDare M×M diagonal mat-rices, and UH

RDURD= UHSRUSR= VHRDVRD= VHSRVSR=

IM. Assume that M = Ns= Nre≤ Nd. We can let FS be of the form

FS = VSRFS (28)

where FS is an M× M matrix. With (28), we can further rewrite received signal ˜yDas

˜

yD= ˜HVSRFSx + ˜n = ˜HFSx + ˜n (29) where ˜H= ˜HVSR, which can be expanded as

˜

H=σ2n,rHRDFRFHRHHRD+ σ2n,dINd 1

2

× HRDFRUSRΣSR. (30) Let ˜Σdenote the matrix with the singular values of ˜H. With

˜

Σ, we can solve FS by applying the methods proposed in Section III. For a given FS, however, finding the solution of

FRis much more involved. This problem is investigated in the following section.

B. Relay Precoder Design

From (30), we first observe that ˜H is still a nonlinear and complicated function of FR, and finding the optimum FR is still difficult. Even if the optimum solution can be found, we have to conduct a new SVD and matrix inversion when solving FS at each iteration. This will greatly increase the computational complexity of the problem. To overcome these problems, we propose imposing a special structure on FRsuch that the X-structure of FS can be maintained, and at the same time, FRcan be solved with a closed-form solution. The main idea is to let FR diagonalize the equivalent channel ˜H. This can be easily accomplished by choosing FRas

FR= VRDΣRUHSR (31) where ΣR is an M× M diagonal matrix, which needs to be designed. With (31), we can rewrite whitening matrix W as follows:

W =σ2n,rΣRDΣRΣHRΣHRD+ σn,d2 IM 1

2UH

RD. (32) With (31) and (32), the resultant ˜H is an M × M diagonal matrix expressed as ˜ H=σ2n,rΣRDΣRΣHRΣHRD+ σ2n,dIM 1 2 ΣRDΣRΣSR. (33) Thus, ˜H in (33) is diagonal and can be directly used for constructing FS without extra SVD and matrix inversion op-erations. In addition, the relay precoder to be determined is reduced to diagonal matrix ΣR. This can significantly reduce the complexity of the joint design.

Let Pe,idenote the BLER of the ith subsystem. We can have the overall BLER, i.e., Pe, expressed as

Pe= 1 M  i=1 (1− Pe,i) M  i=1 Pe,i (34)

where Mis the number of the subsystems. The approximation error in (34) is neglectable under high-SNR scenarios. As discussed, Pe,i is dominated by the free distance of the ith subsystem. Due to the whitening matrix in (32), the covariance

(7)

matrix of ˜n is an identity matrix. Using the Chernoff bound, we can further have

Pe≈ 1 2 M  i=1 exp d2 free,i 4  Pw = 1 2Pw. (35)

As seen, minimizing Pw is approximately equivalent to mini-mizing Pe. Since directly minimizing Pw is still difficult, we then seek a lower bound from which our optimization can be conducted.

Proposition 1: Given a full-rank FS, the relay precoder in (31), and the channel matrix in (33), maximizing det( ˜HHH˜) is equivalent to minimizing a lower bound of Pw. The lower bound is equal to Mexp 1 4  M  i=1d 2 free,i 1 M 

where Mis the number of the subsystems.

Proof: To prove the proposition, we need the following

lemmas. 

Lemma 1: Let w1and w2be two positive values satisfying

w1+ w2= 1 and consider two positive values x and y. If y≥

x≥ 1, then the following inequality holds:

exp−xw1yw2 ≤ w1exp−x+w2exp−y. (36)

The proof of the lemma is provided in Appendix A. With the lemma, we can then derive another lemma.

Lemma 2: Assume that FS, ΣSR, and ΣRDare given. Then, Pwis lower bounded as Pw≥ Mexp 1 4  M  i=1d 2 free,i 1 M  . (37)

The proof of the lemma is provided in Appendix B. To proceed, without loss of generality, we can first consider the system with a full-rank FS. Using (14), d2

free,i can be

expressed as d2free,i= ˜H(i, i) ˜H(M− i + 1, M − i + 1) for

i = 1, 2, . . . , M. Here, is a constant for a given QAM con-stellation and PS,T. Then, we have

M  i=1 d2free,i= M  det( ˜HHH˜). (38)

We then come to the conclusion that maximizing det( ˜HHH˜) is equivalent to minimizing the lower bound in (37).

Note that the equality in (37) holds when d2

free,i’s are equal

for all i = 1, 2, . . . , M. In other words, the tightness of (37) depends on the spread of d2

free,i’s. The smaller the spread is,

the tighter the lower bound will be. A large spread of d2 free,i’s

often means a poor channel condition, and a rank-deficient sub-procoder will be used instead. In X-structured FS, the ith and the (M− i + 1)th subchannels are paired together. Therefore, all d2free,i’s will be close. This property further tightens the lower bound in (37). From the given discussions, we see that the lower bound we use will be tight, and the errors due to the approximations applied will be small. Simulation results

in Section V conform to the conjecture, and our methods can significantly improve precoding performance.

Using the fact that the arithmetic mean is greater than or equal to the geometric mean, we can easily derive another lower bound for Pwas Pw≥ Mexp 1 4 !M  i=1d 2 free,i 1 M  . (39)

The following property compares the lower bound in (37) and that in (39).

Property 1: The lower bound in (37) is tighter than that

in (39).

Proof: See Appendix C. 

Note that the result in Proposition 1 is only valid for the proposed subprecoders with which the properties of the GMD solution can be applied. For other subprecoders, (37) may not hold, and finding a solution for the precoders will become much more complicated. For Lemma 1 to apply, both x and y have to be greater than one. This translates into the fact that the BLER of the subsystem (in Lemma) must be less than 10−1, which is usually satisfied in real-world applications. It is worth noting that ( Mi=1 d2

free,i)(1/M

)

in (37) can be seen as a geometric free distance of the MIMO relay system, and it can be used to evaluate Pw. It is interesting to note that this is similar to the geometric SNR in multicarrier systems [33]. However, the geometric SNR is derived from the maximization of the channel capacity. Instead, the geometric free distance we derived here is to minimize the BLER. To proceed, let r denote the rank of FS. We first solve ΣRby assuming r = M . From (33), we have

˜

HHH˜= ΣHSRΣHRΣHRDσ2n,rΣRDΣRΣHRΣHRD+σ2n,dIM −1 × ΣRDΣRΣSR. (40) Defining A = ΣRDΣRΣHRΣHRD, we can have det( ˜HHH˜) as

det( ˜HHH˜) = detΣSRΣHSR  det  Aσn,r2 A+σ2n,dIM −1 . (41) Since det(ΣSRΣHSR) in (41) is independent of the relay precoder, ΣR can then be solved through the following optimization: max ΣR det  Aσ2n,rA + σn,d2 IM −1 s.t. trσn,r2 ΣRΣHR + ΣRΣSRFSFHS ΣHSRΣHR  = PR,T. (42) Let B = FSFHS . In addition, let σR,i, σsr,i, and σrd,ibe the ith diagonal entry of ΣR, ΣSR, and ΣRD, respectively. Taking ln operation, we can reformulate (42) as

min σ2 R,i M  i=1 ln σ2 R,iσrd,i2 σ2 R,iσrd,i2 σn,r2 + σ2n,d  s.t. M  i=1

(8)

We can see that the relay precoder design problem has been transformed into a scalar-valued optimization problem. The reformulated problem in (43) can be solved by using the KKT conditions [32]. The solution is then expressed in closed-form as σR,i2 = " # # $ μσ2 n,d σ2 rd,iσn,r2  σ2

n,r+ B(i, i)σ2sr,i  + σ2 n,d 2 rd,iσ2n,r 2 σ2n,d 2 rd,iσn,r2 (44)

where μ is chosen to satisfy the relay power constraint. As we can see, the proposed subprecoder does lead to a simple expression of ΣR.

As seen from (44), we have to know B when solving the relay precoder. To simplify the problem, first, we do not consider source power allocation Υ. Thus, B can be expressed as

B(i, i) = PS,T

M (45)

for i = 1, 2, . . . , M . Using the resultant ΣR and ˜H, the next step is to construct FSwith the methods proposed in Section III. If the channel angles of all subsystems are larger than the angle threshold, r is still equal to M . From the structure of B in (45), we can see that its diagonal entries are unchanged. In other words, ΣRwill not be changed by the updated FS. Then, the iteration stops in just one iteration. On the other hand, if at least one subprecoder is of rank deficiency, (45) does not hold. Then, the iteration cannot stop in just one iteration. Note that the order of the diagonal entries of ˜H can be changed during the iteration. In such case, we have to reformulate B and resolve

σ2

R,i, which may be complicated. The following proposition states that these operations can be simplified.

Proposition 2: With the relay precoder given in (44), the

diagonal entries of resultant ˜H exhibit a descending order when the SNR of the relay-to-destination path is much higher than that of the source-to-relay path.

Proof: Without loss of generality, we can let σ2

n,r = σ2

n,d= 1. Therefore, the SNRs of the source-to-relay and relay-to-destination channels are defined via the variances of HSR and HRD. Using (44), we can have

σR,i2 σ2rd,i= % μσ2 rd,i  1 + B(i, i)σ2 sr,i  +1 4 1 2. (46)

Without Υ at the source node, we can have B = αIM, where α = PS,T/M . Assume that the SNR of the relay-to-destination path is much higher than that of the source-to-relay path so that

μσ2

rd,i 1 + ασ2sr,i. Then, σ2R,iσ2rd,i can be approximately expressed as σ2R,iσ2rd,i % μσ2 rd,i 1 + ασ2sr,i. (47)

Hence, ˜Hcan be expressed as ˜

H(i, i) = σR,i2 σrd,i2 + 112σ

R,iσrd,iσsr,i

= % μσ2 rd,i 1 + ασ2 sr,i + 1 1 2 % μσ2 rd,i 1 + ασ2 sr,i 1 2 σsr,i = 1 + 1 + ασ 2 sr,i μσ2 rd,i 1 2 σsr,i≈ σsr,i. (48)

Then, the diagonal entries of ˜Hhave a descending order since

σsr,idecreases with i. 

Proposition 2 shows that the ordering of ˜H(i, i) will not be changed, and thus, the new pairing operations are not necessary during the iteration. We now investigate the case of rank defi-ciency. When at least a subsystem is rank deficient, r becomes smaller than M . From Proposition 2, it is simple to see that the channel angles of the paired subsystems have an ascending order, i.e., tan−1 ˜ H(M, M ) ˜ H(1, 1)  ≤ tan−1 ˜ H(M − 1, M − 1) ˜ H(2, 2)  ≤ · · · ≤ tan−1 ˜ HM 2 + 1, M 2 + 1  ˜ HM2,M2  . (49)

This property indicates that when r < M , the corresponding B will be of the following structure:

B = diag(2α, 2α, . . . , 2α M−r , α, α, . . . , α 2r−M , 0, 0, . . . , 0 M−r ). (50)

In the following proposition, we show how to solve ΣR when

B is of the structure in (50).

Proposition 3: Consider a rank-deficient FS with a rank equal to r, where (M/2)≤ r < M. Let ˜H1:mdenote a square matrix consisting of the first m columns and rows of ˜H. The objective function for the design of ΣR can be chosen as det( ˜HH1:M−rH˜1:M−r) det( ˜HH1:rH˜1:r) such that the lower bound in (37) can be minimized.

Proof: From (50), it is clear that the rank-deficient

subpre-coder is used in the first M − r subsystems, whereas the full-rank subprecoder is adopted in the last r− (M/2) subsystems. For the ith subsystem, its free distance can be expressed as

d2free,i= &

1,iH˜(i, i) ˜H(i, i), for 1≤ i ≤ M − r

2,iH˜(i, i) ˜H(M− i + 1, M − i + 1), for M − r + 1 ≤ i ≤ M2

(9)

(51), shown at the bottom of the previous page, where both 1,i

and 2,iare constants. Hence, we can rewrite

M i=1d2free,ias M  i=1 d2free,i= M−r j=1 ˜ H(j, j) ˜H(j, j)1,j× M  l=M−r+1,l≤M ˜ H(l, l) × ˜H(M− l + 1, M − l + 1)2,l = M−r j=1 1,j M  k=M−r+1 2,k   M−r l=1 ˜ H(l, l) r  p=1 ˜ H(p, p) =   det  ˜ HH1:M−rH˜1:M−r det  ˜ HH1:rH˜1:r . (52)

Similar to (38), it can be verified that  is a constant. Hence, the objective function is equivalent to maximizing det( ˜HH1:M−rH˜1:M−r) det( ˜HH1:rH˜1:r) when (M/2)≤ r < M.

 When (M/2)≤ r < M, we can conduct the rank deficiency operations for M− r subsystems, equivalently switching off

M− r subchannels at one time. However, this approach may

not be efficient. This is because the relay precoder will re-allocate its power, and some of the rank-deficient subsys-tems may become nondeficient in the next iteration. For this reason, we only let one subchannel be switched off at one time. Starting from the first subsystem, if we find that tan−1( ˜H(M, M )/ ˜H(1, 1)) is smaller than the angle thresh-old, then let r = M− 1 and reformulate B as

B = diag{2α, α, α, . . . , α, 0}. (53) Let κibe defined as κi= & 2, for i = 1, 2, . . . , M− r 1, for i = M− r + 1, M − r + 2, . . . , r. (54) From the results in Proposition 3, the optimization of σ2

R,ifor (M/2)≤ r < M can then be reformulated as

min σ2 R,i r  i=1 ln κiσ2R,iσ2rd,i σ2 R,iσrd,i2 σ2n,r+ σn,d2  s.t. r  i=1

σR,i2 σ2n,r+ B(i, i)σ2sr,i= PR,T. (55)

Then, the solution of the problem in (55) can be expressed as

σR,i2 = " # # $ μ¯2n,d σ2 rd,iσn,r2  σ2

n,r+ B(i, i)σ2sr,i  + σ2 n,d 2 rd,iσ2n,r 2 σ2n,d 2 rd,iσn,r2 (56) where ¯μi= μκi. From (53) and (54), we see that ¯μiand B(i, i) are equally affected by κi. Hence, we can assume ¯μσ2rd,i 1 + B(i, i)σ2sr,i, and then rewrite (47) as

σR,i2 σrd,i2 % ¯ μiσ2rd,i 1 + B(i, i)σ2 sr,i .

Fig. 3. Algorithm 1: Joint source/relay precoder design without source power allocation Υ.

TABLE II

JOINTSOURCE/RELAYPRECODERDESIGNWITHSOURCE

POWERALLOCATION

As a result, the statement in Proposition 2 is still valid here. The given procedure is then repeatedly conducted until all nonzero channel angles are larger than the angle threshold or

r is equal to (M/2). When r = M/2, ¯μi becomes a constant for all i = 1, 2, . . . , r. In this case, the structure of σ2

R,iin (56) will be reduced to that in (44). Fig. 3 summarizes the complete operations of the proposed algorithm (Algorithm 1).

In previous discussions, we do not conduct source power allocation among all subsystems. This simplification may result in performance loss since the error rate performance can be affected by the subsystem with the minimum free distance. Exploiting Υ will further complicate the optimization of ΣR since the behavior of B is no longer easy to follow. To overcome this, we propose another iterative method to conduct power allocation. We assume that initial FSis of full-rank, and initial Υ, which is denoted by Υ(0), is a scaled-identity matrix. Hence, initial B, which is denoted by B(0), is also a scaled-identity matrix. Using Algorithm 1, we can solve Σ(1)R and then obtain the resultant F(1)S . With (18), we can have the updated Υ(1). Consequently, B(1) can be obtained and then used in the next iteration. This method is repeatedly conducted until no further improvement can be obtained. The details of the algorithm (Algorithm 2) are summarized in Table II.

(10)

Fig. 4. BLER performance comparisons for MIMO systems with Nt= 2,

Nr= 2, and M = 2.

V. SIMULATIONRESULTS

Here, we report simulation results demonstrating the effec-tiveness of the proposed algorithms. In the simulations, flat-fading MIMO channels are considered, and each entry of H is assumed to be an i.i.d. complex Gaussian random variable with zero mean and unit variance.

A. Performance Comparisons for MIMO Systems

First, we evaluate the performance of various X-structured precoders. Five subprecoders are compared: 1) complex-valued subprecoder Fc; 2) real-valued subprecoder Fr; 3) orthogonal subprecoder Fo; 4) the proposed subprecoder Fp,1; and 5) the proposed subprecoder Fp,2. The SNR is defined as SNR = PT/σ2n. Fig. 4 compares the precoding performance for a MIMO system with Nt= 2, Nr= 2, and M = 2, showing the results of 4- and 16-QAM schemes simultaneously. As we can see, the proposed subprecoders outperform Fo and give comparable performance with that of Fr. As mentioned, Fr requires lookup tables in run time. Furthermore, Fc slightly outperforms other subprecoders in 4-QAM but suffers from per-formance loss in 16-QAM. It is known that the perper-formance of Fofor 16-QAM is worse than that of Fr[19] and is not shown here. Although FGMD provides the suboptimum solution, the

performance loss of either Fp,1or Fp,2is limited. As discussed, the corresponding free distance for each subprecoder decreases when γ is decreased, and the error rate performance tends to be seriously affected. Instead of using FGMD, the proposed

method uses a rank-deficient subprecoder for a small value of

γ, and the performance loss can be mitigated.

Fig. 5 compares the precoding performance for a MIMO sys-tem with Nt= 4, Nr= 4, and M = 4. In this scenario, power allocation is conducted with (18). In the figure, we observe similar behaviors as those in Fig. 4. Note that although Fccan provide optimum performance for 4-QAM, it requires higher detection complexity compared with the other methods. Fig. 6 compares the precoding performance when spatial correlation is considered at the transmitter (and 4-QAM is used). The

cor-Fig. 5. BLER performance comparisons for MIMO systems with Nt= 4,

Nr= 4, and M = 4.

Fig. 6. BLER performance comparisons for correlated MIMO channels with

Nt= 4, Nr= 4, and M = 4.

related channel matrix is modeled by Hcorr= HR(1/2)t , where

Rtdenotes the transmit correlation matrix. In the simulations, two scenarios are considered, i.e., partially decorrelated and highly correlated; the corresponding Rt’s can be found in [34] and [35]. As shown in the figure, our proposed methods still provide comparable performance with Fr.

B. Performance Comparisons for MIMO Relay Systems

Next, we evaluate the performance of various precoding methods in a two-hop AF MIMO relay system. Seven sys-tems are considered; the first five syssys-tems use ML detection in the receiver, whereas the last two systems use QR-SIC detection. They are as follows: 1) jointly precoded system with Fp,1 and FR (JP-Fp,1-FR-ML-PA); 2) jointly precoded system with Fp,2 and FR (JP-Fp,2-FR-ML-PA); 3) jointly precoded system with Fp,2 and FR without source power allocation (JP-Fp,2-FR-ML-WPA); 4) source precoded system

(11)

Fig. 7. BLER performance comparisons for MIMO relay systems with 4-QAM.

with Fp,1 (SP-Fp,1-ML-PA); 5) unprecoded system with ML detection (UP-ML); 6) jointly precoded system (JP-QR-SIC); and 7) source precoded system (SP-QR-SIC). Both the JP-QR-SIC and SP-JP-QR-SIC are obtained from [36] without con-sidering the source-to-destination link. In addition, let SNRsr and SNRrd be defined as SNRsr = PS,T/σn,r2 and SNRrd= PR,T/σn,d2 , respectively.

Fig. 7 shows the performance comparison for 4-QAM. Here, we let Ns= 4, Nre= 4, Nd= 4, SNRrd= 25 dB, and SNRsr be varied. In addition, let I = 3 in Algorithm 2 for reasonable precoding complexity. As seen, the proposed JP-Fp,1-FR-ML and JP-Fp,2-FR-ML schemes indeed outperform other precod-ing methods. It is also observed that about 1-dB performance loss (at the BLER of 10−3) will be induced when source power-allocation matrix Υ is not included. Compared with the UP-ML scheme, the proposed methods can provide more than 4-dB performance improvement when the BLER is 10−3. Note that the detection complexity of the UP-ML scheme can be much higher than the other methods due to the requirement of 4× 4 ML detection. The SP-Fp,1-ML-PA scheme suffers from performance loss since only source precoding is considered in the system. As for the JP-QR-SIC and SP-QR-SIC schemes, their performances significantly degrade for a high SNR. Note that in QR-SIC detection, the detection complexity at each receive antenna isO(L). Moreover, a complete GMD operation is required for finding the source precoder in either the JP-QR-SIC or the SP-QR-JP-QR-SIC scheme. It is clear that the proposed precoding methods are more efficient. We also compare the detection complexity of all methods; the result is shown in Table III. As we can see, the computational complexity of the proposed methods is similar to that of QR-SIC.

Fig. 8 shows the performance comparison for 16-QAM. Here, we let Ns= 4, Nre= 4, Nd= 4, I = 5, SNRrd= 35 dB, and SNRsr be varied. It can be observed that the proposed methods still provide significant performance im-provement. Comparing with the results in Fig. 7, we can see that the performance improvement of the proposed methods is slightly reduced. This result can be explained by the fact that

TABLE III

DETECTIONCOMPLEXITYCOMPARISON INMIMO RELAYSYSTEMS

Fig. 8. BLER performance comparisons for MIMO relay systems with 16-QAM.

the free distance yielded by the proposed subprecoders will be rapidly decreased for small values of γ when a large QAM is considered. In other words, the improvement obtained from the rank-deficient subprecoders is reduced.

VI. CONCLUSION

In this paper, we have proposed a systematic method to construct the X-structured precoder for spatial-multiplexing MIMO systems. The proposed subprecoders do not require any numerical search in the design phase and lookup tables in the application phase. In addition, the proposed subprecoders result in almost the same performance and detection complexity as those in the real-valued subprecoder. Except for MIMO systems, we also consider the joint source/relay precoder design in two-hop AF MIMO relay systems. The proposed methods remain the X-structure of the precoder, and the detection com-plexity is low. This is a great advantage for real-world appli-cations. Simulation results show that the proposed methods can significantly outperform the existing methods. Note that X-structured precoding is suboptimal in the sense that only 2 × 2 subsystems are considered. To approach the optimum solution more closely, we can consider subsystems whose dimensions are larger than 2 × 2. In this case, the design of the subprecoder will become very complicated with the conventional methods. It will be much easier to extend the proposed methods in the design of the subprecoder. Research in this subject is now underway.

(12)

APPENDIXA PROOF OFLEMMA1

Let x be fixed, and define two functions f (y) and g(y) as

f (y) = exp−xw1yw2 (57)

g(y) = w1exp−x+w2exp−y (58)

where we assume y≥ x ≥ 1 and w1+ w2= 1. The problem

now is to prove that f (y)≤ g(y). Assume that both f(y) and

g(y) are continuous and differentiable for y≥ x. Then, we can

rewrite f (y) and g(y) as

f (y) = f (x) + y ' x f(t1)dt1 (59) g(y) = g(x) + y ' x g(t1)dt1 (60)

where f(t1) and g(t1) denote the first derivative of f (t1) and

g(t1), respectively. From (57) and (58), it is easy to verify that

f (x) = g(x). Hence, the problem is equivalent to showing that y ' x f(t1)dt1 y ' x g(t1)dt1. (61)

It is simple to see that if f(t1)≤ g(t1) for t1≥ x, then (61)

holds. Consequently, f(t1) and g(t1) can be expressed as

f(t1) = − w2xw1exp−x

w1tw2

1 tw2−1

1 (62)

g(t1) = − w2exp−t1. (63)

Since −w2< 0, we see that conditions f(t1)≤ g(t1) and

(−1/w2)f(t1)≥ (−1/w2)g(t1) are equivalent. It can be seen

that both (−1/w2)f(t1) and (−1/w2)g(t1) are positive

val-ues. Define two functions f1(t1) and g1(t1) as

f1(t1) Δ = ln−f (t 1) w2 = w1ln x− w1ln t1− xw1tw12 (64) g1(t1) Δ = ln−g (t 1) w2 =−t1. (65)

The problem now is equivalent to proving f1(t1)≥ g1(t1).

Using a method similar to that in (59) and (60), we can first rewrite f1(t1) and g1(t1) as follows:

f1(t1) = f1(x) + t1 ' x f1(t2)dt2 (66) g1(t1) = g1(x) + t1 ' x g1(t2)dt2. (67)

It can be verified that f1(x) = g1(x). Hence, if f1(t2)≥ g1(t2)

for t2≥ x, then f1(t1)≥ g1(t1). From (64) and (65), we have

f1(t2) = w1 t2 − x w1tw2−1 2 w2 (68) g1(t2) = − 1. (69) For t2= x≥ 1, we have f1(t2)|t2=x= w1 x − x w1xw2−1w 2 =−w1 x − w2 ≥ − (w1+ w2) = g1(t2). (70)

For t2> x≥ 1, we observe that (x/t2) < 1 and that

f1(t2)|t2>x= w1 t2 − x w1tw2−1 2 w2 =−w1 t2 − w2 ( x t2 )w1 ≥ − w1− w2 ( x t2 )w1 >− w1− w2= g1(t2). (71)

From (70) and (71), we can see that f1(t2)≥ g1(t2) for any

given t2≥ x. With this result, we can conclude that f(y) ≤

g(y) for y≥ x ≥ 1, which completes the proof of this lemma.

APPENDIXB PROOF OFLEMMA2

Using Lemma 1, we can now use mathematical induc-tion to prove Lemma 2. Without loss of generality, we can assume that dfree,1 ≥ dfree,2 ≥ · · · ≥ dfree,M. Letting w1=

w2= 1/2, y = d2free,1/4, and x = d2free,2/4, we have the

fol-lowing inequality: exp  d2 free,1 4 d2 free,2 4 1 2exp −d2free,1 4 +1 2exp −d2free,2 4 . (72)

From (72), it is obvious that (37) is true for M= 2. Then, we assume that the statement in Lemma 2 is true for M− 1(M> 2). That is exp14  d2 free,1d 2 free,2···d 2 free,M −1  1 M −1 1 M− 1 × exp −d2free,1 4 + exp d2 free,2 4 +· · ·+exp− d2 free,M −1 4  . (73) Now, for the MIMO system with Msubsystems, we can have the following equivalence:

exp14

 d2

free,1d2free,2···d2free,M 

 1 M  = exp  1 4  d2

free,1d2free,2···d2free,M −1

 1 M −1 M −1 M  1 4d2free,M   1 M  . (74) Using Lemma 1 again, we obtain

exp  1 4  d2 free,1d 2 free,2···d 2 free,M −1  1 M −1 M −1 M  1 4d 2 free,M   1 M  ≤M− 1 M exp  M i=1−1d2free,i 4 1 M −1 + 1 Mexp −d2free,M  4 . (75)

(13)

Substituting (73) into (75), we can have the following in-equality: exp14  d2 free,1d 2 free,2···d 2 free,M   1 M  M− 1 M 1 M− 1 M−1 i=1 exp −d2free,i 4 + 1 Mexp −d2free,M  4 (76) = 1 M M−1 i=1 exp d2 free,i 4 + 1 Mexp −d2free,M  4 (77) = 1 M M  i=1 exp d2free,i 4 . (78)

It can be verified that the equality in (76) holds when all dfree,i’s

are equal. From (76)–(78), we can have

Pw= M  i=1 exp d2free,i 4 ≥ Mexp 1 4  M  i=1d 2 free,i 1 M  (79)

which completes the proof of Lemma 2. APPENDIXC PROOF OFPROPERTY1

First, using the fact that the arithmetic mean is greater than or equal to the geometric mean, we can prove (39) as follows:

Pw= M  i=1 exp d2 free,i 4 ≥ M M i=1 exp d2 free,i 4  1 M  = M ( exp14 !M  i=1d 2 free,i ) 1 M  = Mexp 1 4  1 M  !M  i=1d 2 free,i . (80)

Second, using the fact again, we can obtain M i=1 d2free,i  1 M  1 M M  i=1 d2free,i. (81)

From (79)–(81), we then have

Pw= M  i=1 exp d2 free,i 4 ≥ Mexp 1 4  M  i=1d 2 free,i 1 M  ≥ Mexp14  1 M  !M  i=1d 2 free,i . (82) As seen in (82), the lower bound in (37) is tighter than that in (39).

REFERENCES

[1] G. J. Foschini, “Layered space–time architecture for wireless commu-nication in a fading environment when using multi-element antennas,”

Bell Labs. Tech. J., vol. 1, no. 2, pp. 41–59, 1996.

[2] P. W. Wolniansky, G. J. Foschini, G. D. Golden, and R. A. Valenzuela, “V-BLAST: An architecture for realizing very high data rates over the

rich-scattering wireless channel,” in Proc. URSI Int. Symp. Signals, Syst.,

Electron., Pisa, Italy, Sep. 1998, pp. 295–300.

[3] L. Zheng and D. N. C. Tse, “Diversity and multiplexing: A fundamental tradeoff in multiple-antenna channels,” IEEE Trans. Inf. Theory, vol. 49, no. 5, pp. 1073–1096, May 2003.

[4] A. Lozano, A. M. Tulino, and S. Verdu, “Optimum power allocation for parallel Gaussian channels with arbitrary input distributions,” IEEE Trans.

Inf. Theory, vol. 52, no. 7, pp. 3033–3051, Jul. 2006.

[5] F. Perez-Cruz, M. R. D. Rodrigues, and S. Verdu, “MIMO Gaussian channels with arbitrary inputs: Optimal precoding and power allocation,”

IEEE Trans. Inf. Theory, vol. 56, no. 3, pp. 1070–1084, Mar. 2010.

[6] C. Xiao, Y. R. Zheng, and Z. Ding, “Globally optimal linear precoders for finite alphabet signals over complex vector Gaussian channels,” IEEE

Trans. Signal Process., vol. 59, no. 7, pp. 3301–3314, Jul. 2011.

[7] W. Zeng, C. Xiao, and J. Lu, “A low-complexity design of linear pre-coding for MIMO channels with finite-alphabet inputs,” IEEE Wireless

Commun. Lett., vol. 1, no. 1, pp. 38–41, Feb. 2012.

[8] P. Stoica and G. Ganesan, “Maximum-SNR spatial–temporal formatting designs for MIMO channels,” IEEE Trans. Signal Process., vol. 50, no. 12, pp. 3036–3042, Dec. 2002.

[9] H. Sampath, P. Stoica, and A. Paulraj, “Generalized linear precoder and decoder design for MIMO channels using the weighted MMSE criterion,”

IEEE Trans. Commun., vol. 49, no. 12, pp. 2198–2206, Dec. 2001.

[10] A. Scaglione, P. Stoica, S. Barbarossa, G. B. Giannakis, and H. Sampath, “Optimal designs for space–time linear precoders and decoders,” IEEE

Trans. Signal Process., vol. 50, no. 5, pp. 1051–1064, May 2002.

[11] D. P. Palomar, J. M. Cioffi, and M. A. Lagunas, “Joint Tx–Rx beam-forming design for multicarrier MIMO channels: A unified framework for convex optimization,” IEEE Trans. Signal Process., vol. 51, no. 9, pp. 2381–2401, Sep. 2003.

[12] D. Wubben, R. Bohnke, J. Rinas, V. Kuhn, and K. D. Kammeyer, “Effi-cient algorithm for decoding layered space–time codes,” Electron. Lett., vol. 37, no. 22, pp. 1348–1350, Oct. 2001.

[13] Y. Jiang, J. Li, and W. W. Hager, “Joint transceiver design for MIMO com-munications using geometric mean decomposition,” IEEE Trans. Signal

Process., vol. 53, no. 10, pp. 3791–3803, Oct. 2005.

[14] J.-K. Zhang, A. Kavcic, and K. M. Wong, “Equal-diagonal QR decompo-sition and its application to precoder design for successive-cancellation detection,” IEEE Trans. Inf. Theory, vol. 51, no. 1, pp. 154–172, Jan. 2005.

[15] Y. Jiang, J. Li, and W. W. Hager, “Uniform channel decomposition for MIMO communications,” IEEE Trans. Signal Process., vol. 53, no. 11, pp. 4283–4294, Nov. 2005.

[16] L. Collin, O. Berder, P. Rostaing, and G. Burel, “Optimal minimum distance-based precoder for MIMO spatial multiplexing systems,” IEEE

Trans. Signal Process., vol. 52, no. 3, pp. 617–627, Mar. 2004.

[17] B. Vrigneau, J. Letessier, P. Rostaing, L. Collin, and G. Burel, “Extension of the MIMO precoder based on the minimum Euclidean distance: A cross-form matrix,” IEEE J. Sel. Topics Signal Process., vol. 2, no. 2, pp. 135–146, Apr. 2008.

[18] S. K. Mohammed, E. Viterbo, Y. Hong, and A. Chockalingam, “MIMO precoding with X- and Y-codes,” IEEE Trans. Inf. Theory, vol. 57, no. 6, pp. 3542–3566, Jun. 2011.

[19] K. P. Srinath and B. S. Rajan, “A low ML-decoding complexity, full-diversity, full-rate MIMO precoder,” IEEE Trans. Signal Process., vol. 59, no. 11, pp. 5485–5498, Nov. 2011.

[20] Q.-T. Ngo, O. Berder, and P. Scalart, “Minimum Euclidean distance based precoders for MIMO systems using rectangular QAM modula-tions,” IEEE Trans. Signal Process., vol. 60, no. 3, pp. 1527–1533, Mar. 2012.

[21] J. N. Laneman, D. N. C. Tse, and G. W. Wornell, “Cooperative diversity in wireless networks: Efficient protocols and outage behavior,” IEEE Trans.

Inf. Theory, vol. 50, no. 12, pp. 3062–3080, Dec. 2004.

[22] B. Wang, J. Zhang, and A. Høst-Madsen, “On the capacity of MIMO relay channels,” IEEE Trans. Inf. Theory, vol. 51, no. 1, pp. 29–43, Jan. 2005. [23] X. Tang and Y. Hua, “Optimal design of non-regenerative MIMO wireless

relays,” IEEE Trans. Wireless Commun., vol. 6, no. 4, pp. 1398–1407, Apr. 2007.

[24] R. Mo and Y. H. Chew, “Precoder design for non-regenerative MIMO relay systems,” IEEE Trans. Wireless Commun., vol. 8, no. 10, pp. 5041– 5049, Oct. 2009.

[25] W. Guan and H. Luo, “Joint MMSE transceiver design in non-regenerative MIMO relay systems,” IEEE Commun. Lett., vol. 12, no. 7, pp. 517–519, Jul. 2008.

[26] F.-S. Tseng, W.-R. Wu, and J.-Y. Wu, “Joint source/relay precoder design in nonregenerative cooperative systems using an MMSE criterion,” IEEE

數據

Fig. 1. System model for a precoded spatial-multiplexing MIMO system.
Fig. 3. Algorithm 1: Joint source/relay precoder design without source power allocation Υ.
Fig. 5 compares the precoding performance for a MIMO sys- sys-tem with N t = 4, N r = 4, and M = 4
Fig. 7. BLER performance comparisons for MIMO relay systems with 4-QAM.

參考文獻

相關文件

„  Exploit antenna diversity to increase the SNR of a single stream. „  Receive diversity and

„  Exploit antenna diversity to deliver multiple streams concurrently. • 

A spoken language understanding (SLU) component requires the domain ontology to decode utterances into semantic forms, which contain core content (a set of slots and slot-fillers)

Matrix Factorization is an effective method for recommender systems (e.g., Netflix Prize and KDD Cup 2011).. But training

Matrix factorization (MF) and its extensions are now widely used in recommender systems.. In this talk I will briefly discuss three research works related to

 積熱電驛(Thermal relay, TH-RY),又稱為積熱過載 電驛(Thermal over-load relay, OL).  電磁開關(Magnetic

where L is lower triangular and U is upper triangular, then the operation counts can be reduced to O(2n 2 )!.. The results are shown in the following table... 113) in

In practice, ρ is usually of order 10 for partial pivot