Complex symmetric stabilizing solution of the matrix equation X plus A(T)X(-1)A = Q

(1)

Contents lists available atScienceDirect

Linear Algebra and its Applications

journal homepage:w w w . e l s e v i e r . c o m / l o c a t e / l a a

Complex symmetric stabilizing solution of the matrix

equation X

+

A

X

−

1 A

=

Q

Chun-Hua Guo

a,1

, Yueh-Cheng Kuo

b,2

, Wen-Wei Lin

c,d,

∗

,3

a_{Department of Mathematics and Statistics, University of Regina, Regina, Saskatchewan, Canada S4S 0A2} b_{Department of Applied Mathematics, National University of Kaohsiung, Kaohsiung 811, Taiwan} c_{Department of Mathematics, National Taiwan University, Taipei 106, Taiwan}

d_{Center of Mathematical Modelling and Scientific Computing, National Chiao Tung University, Hsinchu 300, Taiwan}

A R T I C L E I N F O A B S T R A C T Article history:

Received 5 May 2010 Accepted 16 March 2011 Available online 20 April 2011 Submitted by V. Mehrmann

AMS classification:

15A24 65F30

Keywords:

Nonlinear matrix equation Complex symmetric solution Stabilizing solution Doubling algorithm

We study the matrix equation X+AX−1A=Q , where A is a com-plex square matrix and Q is comcom-plex symmetric. Special cases of this equation appear in Green’s function calculation in nano research and also in the vibration analysis of fast trains. In those ap-plications, the existence of a unique complex symmetric stabilizing solution has been proved using advanced results on linear opera-tors. The stabilizing solution is the solution of practical interest. In this paper we provide an elementary proof of the existence for the general matrix equation, under an assumption that is satisfied for the two special applications. Moreover, our new approach here re-veals that the unique complex symmetric stabilizing solution has a positive definite imaginary part. The unique stabilizing solution can be computed efficiently by the doubling algorithm.

1. Introduction

The matrix equation X

+

A∗X−1A

=

Q , where Q is Hermitian positive definite, arises in several applications. The corresponding real case is the matrix equation X

+

AX−1A

=

Q , where A is real and Q is real symmetric positive definite. In both cases, we may assume without loss of generality that Q

=

I, the identity matrix. These equations have been studied in [1,3,5,10,13,15], for example.

∗ Corresponding author at: Department of Mathematics, National Taiwan University, Taipei 106, Taiwan.

E-mail addresses:[email protected](C.-H. Guo),[email protected](Y.-C. Kuo),[email protected](W.-W. Lin). 1

The work of this author was supported in part by a grant from the Natural Sciences and Engineering Research Council of Canada. 2 _{The work of this author was partially supported by the National Science Council in Taiwan.}

3

The work of this author was partially supported by the National Science Council and the National Center for Theoretical Sciences in Taiwan.

(2)

Recently, there arises the need to consider the matrix equation

X

+

AX−1A

=

Q

,

(1) where A is complex and Q is complex symmetric. First, it is explained in [6] that the computation of the surface Green’s function in nano research [2,8,9] can be reduced to the problem of solving the matrix equation (1), where Q

=

Q1

+

iQ2 with Q1 real symmetric and Q2

= η

I for a

pos-itive scalar

η

, but the matrix A is still a real matrix. And then it is shown in [7] that a quadratic eigenvalue problem arising from the vibration analysis of fast trains [11] can be solved efficiently and accurately by solving a matrix equation of the form (1), where A is complex and Q is complex symmetric.

In those two applications, the existence of a unique complex symmetric stabilizing solution has been proved using advanced results on linear operators (see [4, Chapter XXIV, Theorem 4.1,12]). The stabilizing solution is the solution of practical interest. In Section 2 we provide an elementary proof of the existence for the general matrix equation (1), under an assumption that is satisfied for the two special applications. Moreover, our new approach reveals that the unique complex symmetric stabilizing solution has a positive definite imaginary part. In Section 3 we make some concluding remarks. In particular, we mention that the unique stabilizing solution can be computed efficiently by the doubling algorithm, as for the special case studied in [7].

2. Existence of complex symmetric stabilizing solution

For Eq. (1) we write

A

=

A1

+

iA2

,

Q

=

Q1

+

iQ2 (2)

with A1, A2, Q1

=

Q1, Q2

=

Q2

∈ R

n×n. A solution X of (1) is said to be stabilizing if

ρ(

X−1A

) <

1,

where

ρ(·)

denotes the spectral radius. The assumption we need to guarantee the existence of a stabilizing solution is

Q2

+

eiθA2

+

e−iθA2

>

0

,

for

θ ∈ [

0

,

2

π].

(3)

Here W

>

0 denotes the positive definiteness of a Hermitian matrix W . This assumption is satis-fied for the two applications we mentioned earlier. In particular, the assumption is trivially satissatis-fied for the nano application since A2

=

0 and Q2

= η

I with

η >

0 there. Note that we do not need

any further assumptions on the matrices A1 and Q1. Also, if (3) has been verified for the matrices

A2 and Q2, then it also holds when any positive semi-definite matrix is added to Q2. From [3] we

also know that (3) holds if and only if the matrix equation Y

+

A₂Y−1A2

=

Q2 has a real

sym-metric positive definite stabilizing solution Y . So one way to verify the assumption (3) is to use the doubling algorithm in [10] or the equivalent cyclic reduction algorithm in [13] to find the stabilizing solution Y .

We now assume (3) and let

M

=

⎡ ⎢ ⎣ A 0 Q

−

I ⎤ ⎥ ⎦

,

L

=

⎡ ⎢ ⎣ 0 I A 0 ⎤ ⎥ ⎦

.

(4)

It is easily seen that the matrix pair

(

M

,

L

)

satisfies the relation MJM

=

LJL, where J

=

⎡ ⎣ 0 I

−

I 0 ⎤ ⎦. The matrix pair

(

M

,

L

)

or the matrix pencil M

− λ

L is called

−

symplectic. It holds that

λ

is an eigenvalue of

(

M

,

L

)

if and only if 1

/λ

is an eigenvalue of

(

M

,

L

)

, with the same multiplicity. Here

λ

can be 0 or

∞

.

(3)

Proof. We show that M

−

eiθL is nonsingular for all

θ ∈ [

0

,

2

π]

. Suppose there are a

θ

0

∈ [

0

,

2

π]

and a nonzero vector x

=

x₁

,

x₂ with x1

,

x2

∈ C

nsuch that

(

M

−

eiθ0L

)

x

=

0. This implies that

Ax1

=

eiθ0x2

,

Qx1

−

x2

=

eiθ0Ax1

.

(5) By eliminating x2in (5) we have Hx1

≡

eiθ0_A

−

_Q

+

_e−iθ0_A_x 1

=

0

.

(6)

Write H

=

H1

+

iH2, where H1

=

eiθ0A1

−

Q1

+

e−iθ0A1and H2

=

eiθ0A2

−

Q2

+

e−iθ0A2. It is easily

seen that H1and H2are Hermitian. From assumption (3) it holds that H2is negative definite. By the

classical Bendixson theorem (see [14] for example) H1

+

iH2is invertible. From (6) and (5) it follows

that x1

=

0 and x2

=

0. Thus, M

−

eiθL is nonsingular for all

θ ∈ [

0

,

2

π].

From Lemma1we see that there is a matrix ⎡ ⎣U

V ⎤

⎦

_{∈ C}

2n×n_{of full rank spanning the stable invariant}

subspace of M

− λ

L corresponding to the stable eigenvalue matrix S

∈ C

n×n, i.e., ⎡ ⎣A 0 Q

−

I ⎤ ⎦ ⎡ ⎣U V ⎤ ⎦

₌

⎡ ⎣ 0 I A 0 ⎤ ⎦ ⎡ ⎣U V ⎤ ⎦ S

,

(7)

where

ρ(

S

) <

1. From (7) we get

AU

=

VS

,

(8)

QU

−

V

=

AUS

.

(9)

Multiplying (9) by U∗from the left we get U∗QU

−

U∗V

=

U∗AUS

=

U∗

A∗

+

2iA₂

US

.

(10)

Substituting (8) into (10), we have

U∗QU

−

U∗V

=

S∗V∗US

+

2iU∗A₂US

.

(11) Taking conjugate transposes in (11) and subtracting the result from (11) we obtain

2iU∗Q2U

+ (

V∗U

−

U∗V

) =

S∗

(

V∗U

−

U∗V

)

S

+

2i U∗A₂US

+

S∗U∗A2U

.

(12) Let K

=

i

(

V∗U

−

U∗V

).

(13) Then K is Hermitian. From (12) it follows that K satisfies the equation

K

−

S∗KS

=

2U∗Q2U

−

U∗A2US

−

S∗U∗A2U

.

(14)

Lemma 2. The matrix K in (13) is positive definite.

Proof. From (14), for any positive integer

we have K

− (

S∗

)

KS

=(

K

−

S∗KS

) +

S∗

(

K

−

S∗KS

)

S

+ · · ·

+ (

S∗

)

−1

(

K

−

S∗KS

)

S−1 (15)

=

2U∗Q2U

+

S∗U∗Q2US

+ · · · + (

S∗

)

−1U∗Q2US−1

−

U∗A₂US

−

S∗U∗A₂US2

− · · · − (

S∗

)

−1U∗A₂US

−

S∗U∗A2U

− (

S∗

)

2U∗A2US

− · · · − (

S∗

)

U∗A2US−1

.

(16)

(4)

Since

ρ(

S

) <

1, S

→

0 as

→ ∞

. Hence from (16) we have K

=

2Q2

−

A∗2S

−

S∗A2

,

(17) where Q2

=

∞ =0

(

S∗

)

U∗Q2US

,

A2

=

∞ =0

(

S∗

)

U∗A2US

.

(18)

Note that Q2

+

eiθA2

+

e−iθA2

>

0 for all

θ ∈ [

0

,

2

π]

is equivalent to that

A2

=

⎡ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ Q2

−

A2

−

A2 Q2

−

A2

... ... ...

... ...

⎤ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ (19)

is positive definite. From (17) and (18) it is easy to check that

K

=

2U∗

,

S∗U∗

, · · ·

A2 ⎡ ⎢ ⎢ ⎢ ⎢ ⎣ U US

...

⎤ ⎥ ⎥ ⎥ ⎥ ⎦

.

(20)

We need to show that z∗Kz

>

0 for all z

=

0. SinceA2 is positive definite, it is enough to show

Wz

=

0 for all z

=

0, where W is the rightmost block matrix in (20). Suppose Wz

=

0. Then Uz

=

0 and USz

=

0. It follows from (9) that Vz

=

QUz

−

AUSz

=

0. Thus

⎡ ⎣U

V ⎤

⎦ z

=

0 and then z

=

0 since ⎡

⎣U V ⎤

⎦ is of full rank.

The next result follows readily.

Theorem 3. The matrix U in (7) is invertible.

Proof. Suppose Ux

=

0 with x

∈ C

n. From (13) we have x∗Kx

=

x∗iV∗U

−

U∗V x

=

0

.

So x

=

0 since K is positive definite by Lemma2. Thus U is invertible.

Since U is invertible, we can define X

=

VU−1.

Theorem 4. Let X

=

VU−1. Then (a) X is complex symmetric; (b) X is invertible;

(c) X is a stabilizing solution of (1); (d) X2

≡

Im

(

X

)

is positive definite.

Proof. (a) Multiplying (9) by Ufrom the left we get

(5)

Subtracting the transpose of (21) from (21) and using (8) we have UV

−

VU

=

SUAU

−

UAUS

=

SUVS

−

SVUS

=

SUV

−

VUS

.

(22) Since

ρ(

S

) <

1, UV

=

VU. Then X

=

VU−1

=

U−

(

UV

)

U−1is a complex symmetric matrix.

(b) From (8) and (9), and noting that UV

=

VU, we have

λ

2_A

_{− λ}

_Q

₊

_A

=λ

2_U−_S_U_VU−1

_{− λ}

_VU−1

₊

_U−_S_U_VSU−1

₊

_VSU−1

=

I

− λ

USU−1 VU−1

−λ

I

+

USU−1

.

(23)

Since det

(λ

2A

− λ

Q

+

A

) =

det

(

M

− λ

L

) =

0 for every unimodular

λ

(by Lemma1), we know that X

=

VU−1is nonsingular.

(c) From (8) and (9) we have

A

=

X

(

USU−1

),

Q

−

X

=

A

(

USU−1

).

(24) Eliminating USU−1in (24) gives X

+

AX−1A

=

Q and we also have

ρ(

X−1A

) = ρ(

USU−1

) = ρ(

S

) <

1.

(d) From (13) it follows that

U−∗KU−1

=

iX∗

−

X

=

2Im

(

X

).

(25) So X2

≡

Im

(

X

)

is positive definite by Lemma2.

We have shown that the (unique) stabilizing solution of (1) must be complex symmetric, and that it has a positive definite imaginary part. When A is not a real matrix, it is quite possible that some other complex symmetric solutions of the Eq. (1) also have a positive definite imaginary part. In fact, for a real matrix A2and a real symmetric positive definite matrix Q2satisfying the assumption (3), the

equation Y

+

AT₂Y−1A2

=

Q2may have many positive definite solutions Y (see [3]). So for each such

Y , X

=

iY is a solution of X

+ (

iA2

)

X−1

(

iA2

) =

iQ2with a positive definite imaginary part.

We can also provide an elementary proof for the following statement proved in [3] using advanced results in operator theory: for a real matrix A2and a real symmetric positive definite matrix Q2

satisfy-ing the assumption (3), the equation Y

+

AT₂Y−1A2

=

Q2has a positive definite stabilizing solution Y .

In fact, we have already proved that the equation X

+ (

iA2

)

TX−1

(

iA2

) =

iQ2has a complex symmetric

stabilizing solution X with a positive definite imaginary part. We only need to show that the real part of X must be zero. Since A

=

iA2and Q

=

iQ2now, we have from (10) and (8) that

U∗QU

−

U∗V

= −

U∗A∗US

= −

S∗V∗US

.

(26) Taking conjugate transpose on (26) gives

−

U∗QU

−

V∗U

= −

S∗U∗VS

.

(27) It follows from (26) and (27) that

(

U∗V

+

V∗U

) −

S∗

(

U∗V

+

V∗U

)

S

=

0

.

(28) So U∗V

+

V∗U

=

0 since

ρ(

S

) <

1. Now 2Re

(

X

) =

X

+

X∗

=

U−∗

(

U∗V

+

V∗U

)

U−1

=

0.

3. Conclusions

We have provided an elementary proof of the existence of a (unique) complex symmetric stabilizing solution X for the nonlinear matrix equation (1) with assumption (3). Our new approach here has revealed that the imaginary part of X is positive definite. We also mention that the solution X can be

(6)

found efficiently by a doubling algorithm, as presented in [7, Algorithm 4.1]. A convergence result for the algorithm is given in [7, Theorem 4.1] for the Eq. (1) with the matrices A and Q having special block structures. However, those special structures were not used in the proof of convergence in [7]. So the statements in that theorem are also valid for our general Eq. (1) with assumption (3).

Acknowledgements

The authors thank Beatrice Meini and two referees for their helpful comments. In particular, com-ments by one referee significantly simplified the argucom-ments leading to the conclusion in Theorem3.

References

[1] C.-Y. Chiang, E.K.-W. Chu, C.-H. Guo, T.-M. Huang, W.-W. Lin, S.-F. Xu, Convergence analysis of the doubling algorithm for several nonlinear matrix equations in the critical case, SIAM J. Matrix Anal. Appl. 31 (2009) 227–247.

[2] S. Datta, Nanoscale device modeling: the Green’s function method, Superlattices and Microstructures 28 (2000) 253–278. [3] J.C. Engwerda, A.C.M. Ran, A.L. Rijkeboer, Necessary and sufficient conditions for the existence of a positive definite solution

of the matrix equation X+A∗X−1_A₌_{Q , Linear Algebra Appl. 186 (1993) 255–275.}

[4] I. Gohberg, S. Goldberg, M.A. Kaashoek, Classes of Linear Operators, vol. II, Operator Theory: Advances and Applications, vol. 63, Birkhäuser, 1993.

[5] C.-H. Guo, P. Lancaster, Iterative solution of two matrix equations, Math. Comp. 68 (1999) 1589–1603.

[6] C.-H. Guo, W.-W. Lin, The matrix equation X+AX−1A=Q and its application in nano research, SIAM J. Sci. Comput. 32

(2010) 3020–3038.

[7] C.-H. Guo, W.-W. Lin, Solving a structured quadratic eigenvalue problem by a structure-preserving doubling algorithm, SIAM J. Matrix Anal. Appl. 31 (2010) 2784–2801.

[8] D.L. John, D.L. Pulfrey, Green’s function calculations for semi-infinite carbon nanotubes, Phys. Status Solidi B Basic Solid State Phys. 243 (2006) 442–448.

[9] A. Kletsov, Y. Dahnovsky, J.V. Ortiz, Surface Green’s function calculations: a nonrecursive scheme with an infinite number of principal layers, J. Chem. Phys. 126 (2007) 5 (Article No. 134105).

[10] W.-W. Lin, S.-F. Xu, Convergence analysis of structure-preserving doubling algorithms for Riccati-type matrix equations, SIAM J. Matrix Anal. Appl. 28 (2006) 26–39.

[11] D.S. Mackey, N. Mackey, C. Mehl, V. Mehrmann, Structured polynomial eigenvalue problems: good vibrations from good linearizations, SIAM J. Matrix Anal. Appl. 28 (2006) 1029–1051.

[12] C. van der Mee, G. Rodriguez, S. Seatzu, LDU factorization results for bi-infinite and semi-infinite scalar and block Toeplitz matrices, Calcolo 33 (1996) 307–335.

[13] B. Meini, Efficient computation of the extreme solutions of X+A∗X−1_A₌_{Q and X}₋_A∗_X−1_A₌_{Q , Math. Comp. 71 (2002)} 1189–1204.

[14] H. Wielandt, On eigenvalues of sums of normal matrices, Pacific J. Math. 5 (1955) 633–638. [15] X. Zhan, J. Xie, On the matrix equation X+AX−1_A₌_{I, Linear Algebra Appl. 247 (1996) 337–345.}