Structured Quadraticization and Structure-Preserving Algorithm for Palindromic Polynomial Eigenvalue Problems

(1)

STRUCTURED QUADRATICIZATION AND

STRUCTURE-PRESERVING ALGORITHM FOR PALINDROMIC

POLYNOMIAL EIGENVALUE PROBLEMS∗

TSUNG-MING HUANG†_{, WEN-WEI LIN}‡_, _AND _{WEI-SHUO SU}§

Abstract. In this paper, we mainly propose a structured quadraticization to transform even degree palindromic polynomial eigenvalue problems (PPEPs) into palindromic quadratic eigenvalue problems (PQEPs), instead of the structured linearization to palindromic generalized eigenvalue problems (PGEPs). Then, the structure-preserving algorithm for solving PQEPs based on (S +

S−1_{)-transformation and Patel’s algorithm 1993 can be applied directly to solve PPEPs. Numerical} experiments show that the backward error for PPEP computed by PQEP is better than that by PGEP and the standard GEP (“polyeig” in MATLAB).

Key words. palindromic polynomial eigenvalue problem, palindromic quadratic eigenvalue problem, quadraticization, structure-preserving algorithm, (S + S−1_{)-transformation, Patel’s} algo-rithm, balance

AMS subject classifications. 65F15, 15A18, 15A57

1. Introduction. In this paper, we study the (?, ε)-palindromic polynomial eigenvalue problem (PPEP) with degree 2d of the form

P(λ)x ≡   d−1 X j=0 λ2d−j_A? d−j+ λdA0+ ε d X j=1 λd−j_A j   x = 0, (1.1) where λ ∈ C, x ∈ Cn_{\{0}, A}

j∈ Cn×n, for j = 0, 1, . . . , d, A?0= εA0, ε = ±1 and “?”

denotes either “H” (Hermition/conjugate transpose) or “T” (transpose). The scalar

λ and the nonzero vector x in (1.1) are the eigenvalue and the associated eigenvector

of P(λ), respectively. The equation (1.1) is also called a ?-PPEP if ε = 1 or a ?-anti-PPEP, if ε = −1.

The underlying matrix P(λ) in (1.1) has the property that reversing the order of the coefficients, followed by taking the (conjugate) transpose, leads to the original matrix polynomial (anti-)invariant, which explains the world “(anti-)palindromic”. Consequently, taking the (conjugate) transpose of (1.1), we easily see that the eigen-values of P(λ) satisfy a “symplectic” property, that is, they are paired with respect to the unit circle, containing both eigenvalues λ and 1/λ?_.

The (?, ε)-PPEPs arise in solving numerical solutions of higher order systems of ordinary and partial differential equations. A T-palindromic quadratic eigenvalue problem (T-PQEP) was first raised in the study of the vibration in the structural analysis for fast trains in Germany [5, 6], and then of the behavior of periodic surface acoustic wave (SAW) filters [21]. A H-PQEP was raised in the computation of the Crawford number by bisection and level set methods for detecting a definite Hermitian pair and a hyperbolic or elliptic QEP [4]. A typical even degree ?-PPEP is derived by

∗_{Version December 12, 2008}

†_{Department of Mathematics, National Taiwan Normal University, Taipei 11677, Taiwan;} [email protected]. This work is partially supported by the National Science Council and the National Center for Theoretical Sciences in Taiwan.

‡_{Department of Mathematics,} _{National Chiao-Tung University,} _{Hsinchu 300,} _Taiwan; [email protected]

§_{Department of Mathematics,} _{National Chiao-Tung University,} _{Hsinchu 300,} _Taiwan; [email protected]

(2)

solving linear quadratic discrete-time optimal control problem subject to the higher order discrete system [1, 20].

A standard approach for solving the (?, ε)-PPEP (1.1) is to transform it to a 2dn × 2dn linear eigenvalue problem

             εAd 0 · · · 0 0 −In .. . . .. 0 −In     + λ         εAd−1 · · · A0 · · · A?d In 0 .. . . .. ... .. . . .. ... 0 · · · · · · In 0                        x λx .. . λ2d−2_x λ2d−1_x        = 0 (1.2) and compute its generalized Schur forms (See e.g. [19]). However, the symplectic property of eigenvalues of (1.1) is not preserved by computation, generally, producing large numerical errors ([3, 7, 8]). Recently, some pioneering work [6, 12, 13] proposed a good linearization of the (?, ε)-PPEP which linearizes (1.1) into a palindromic lin-ear pencil λZ?_{+ Z and preserves symplecticity to some extent. This does lead to} a vast improvement over previous approaches. Later, a QR-like algorithm [16] and a Jacobi-type method [6] combined with the Laub’s trick, a postprocessing step of the generalized Schur form [11], have been proposed for solving the palindromic gen-eralized eigenvalue problem (PGEP) (λZT _{+ Z)x = 0. The latter method works} well, if there are no eigenvalues near ±1. The Jacobi-type method typically requires

O(n3_{log(n)) flops and the QR-like algorithm requires O(n}4_{) flops. Recently, a U RV}

-decomposition based structured method of cubic complexity was developed in [17] to solve PGEP, producing eigenvalues which are paired to working precision. On the other hand, for solving a (?, ε)-PQEP, a structure-preserving doubling algorithm was developed in [2, 3] via the computation of a solvent of a nonlinear matrix equation associated with the (?, ε)-PQEP. Recently, a numerically stable structure-preserving algorithm (SPA) based on (S + S−1_{)-transform [9] and Patel’s algorithm [14] was} proposed in [7] to solve the T-PQEP directly. The numerical results obtained by SPA show much promise and the computational cost is about a half of that of the

U RV -based method [17].

The purpose of this paper is to develop a new “quadraticization” which quadrati-cizes the even degree (?, ε)-PPEP (1.1) into a (?, ε)-PQEP. If ? = T and ε = 1, then we can directly apply the SPA algorithm [7] to the quadraticized T-PQEP for solving the T-PPEP. If ? = H and ε = ±1, we shall first transform the quadraticized (H, ε)-PQEP to an enlarged T-ε)-PQEP, and then apply the SPA algorithm for solving the (H, ε)-PPEP. If ? = T and ε = −1, then we apply the U RV -based method [15, 17] to the linearized T-PGEP for solving the (T, −1)-PPEP.

Throughout this paper, Cn×m_{is the set of all n×m complex matrices, C}n _{= C}n×1 and C = C1_{. In} _{and 0n×m} _{(or simply I and 0 if their dimensions are clear from the}

context) denote the n × n and n × m identity and zero matrices, respectively. ej denotes the jth column of an identity matrix.

This paper is organized as follows. In Section 2, we proposed a structured quadraticization to transform a (?, ε)-PPEP with the even degree into a (?, ε)-PQEP. In Section 3, we develop a structure-preserving algorithm for solving the transformed H-PQEP. We derive backward error bounds and the diagonal balance for PPEPs in Section 4 and 5, respectively. The comparisons of the numerical results computed by PQEP, PGEP and the standard GEP are presented in Section 6. Conclusions are

(3)

given in Section 7.

2. Structured Quadraticization of (?, ε)-palindromic polynomial eigen-value problems. Since the SPA algorithm in [7] is well-developed for solving a T-PQEP, it is motivated to propose a new structured “quadraticization” method which can be utilized to quadraticize a (?, ε)-PPEP (1.1) into a (?, ε)-PQEP.

In general, the structured quadraticization of a (?, ε)-PPEP has infinitely many solutions. However, unlike the unstructured linearization, the palindromic quadratic structure has much more feasibility to be kept. In practice, it is not so flexible to find a palindromic quadraticization of (1.1). In the following theorem we shall give a “simplest” form (to our knowledge) of the (?, ε)-PQEPs for the palindromic quadraticization of a (?, ε)-PPEP.

Theorem 2.1. An even degree (?, ε)-PPEP (1.1) with (?, ε) 6= (T, −1) can be

quadratized into a (?, ε)-PQEP of the form

Q(λ)y ≡¡λ2A?1+ λA0+ A1 ¢ y = 0 (2.1) with A? 0= εA0 and y =£ yT 1 · · · ydT ¤T , yi∈ Cn for each i, (2.2) where

(a) for 2d = 4m; A1 and A0 are given by

A1=               A1 0 · · · · · · · · · · · · d?mI ε√εd1A2m 0 · · · · · · · · · · · · 0 A2m−1 d?1I ... 0 −εd2I ... .. . . .. ... A3 d?m−1I ... 0 −εdmI 0               , (2.3a) A0=               A0− √ εI −√εA? 2mA2m 0 A?2m−2 0 · · · A?2 0 0 −√εd1d?1I ... εA2m−2 0 ... 0 0 ... .. . . .. ... εA2 0 0 0 · · · · · · · · · · · · · · · 0               (2.3b)

and {yi}2mi=1 satisfy

y3= λ2m−2y1, y2= 1 d? 1λ ¡ ε√ελ2m_{I + A} 2m ¢ y1, y5= λ2m−4y1, y4= 1 d? 2λ3 ¡ ε√ελ2mI + λ2A2m−2+ λA2m−1+ A2m ¢ y1, .. . y2m−1= λ2y1, y2m= 1 d? mλ2m−1 Ã ε√ελ2m_{I +} 2m−2_X i=0 λi_A 2m−i ! y1; (2.3c) 3

(4)

(b) for 2d = 4m + 2; A1 and A0 are given by A1=                 A1 0 · · · · · · · · · · · · · · · d?mI A2m+1 0 · · · · · · · · · · · · · · · 0 0 −εd1I ... A2m−1 d?1I ... 0 −εd2I ... .. . . .. ... A3 d?m−1I 0 0 0 −εdmI 0                 , (2.4a) A0=                  A0 A?2m 0 A?2m−2 0 · · · A?2 0 εA2m 0 ... 0 0 ... εA2m−2 0 ... 0 0 ... .. . . .. ... εA2 0 0 0 · · · · · · · · · · · · · · · · · · 0                  (2.4b)

and {yi}2m+1i=1 satisfy

y2= λ2my1, y3= 1 d? 1λ2 (λA2m+ A2m−1) y1, y4= λ2m−2y1, y5= 1 d? 2λ4 ¡ λ3_A 2m−2+ λ2A2m−1+ λA2m+ A2m+1 ¢ y1, .. . y2m= λ2y1, y2m+1= 1 d? mλ2m Ã_2m−1 X i=0 λi_A 2m−i+1 ! y1; (2.4c)

in which d1, . . . , dm are some suitable nonzero constants.

Proof. (a) Substituting the partition of y in (2.2) into (2.1),we get

0 =£λ2A?1+ λ ¡ A0− √ εI −√εA?2mA2m ¢ + εA1 ¤ y1+ λ2 √ εd?1A?2my2 + m−1_X j=1 ¡ λ2A?2m−2j+1+ λA?2m−2j ¢ y2j+1+ εd?my2m, (2.5a) 0 = λ2d1y3− λ √ εd1d?1y2+ √ εd1A2my1, (2.5b) 0 = −ελ2_d? jy2j+ ε(λA2m−2j+2+ A2m−2j+3)y1+ εd?j−1y2j−2, for j = 2, . . . , m, (2.5c) 0 = dm−j+1(y2m−2j+1− λ2y2m−2j+3), for j = 2, . . . , m − 1, (2.5d) 0 = dm(y2m−1− λ2y1). (2.5e)

Substituting (2.5e) into (2.5d), it follows that

y2m−2j+1= λ2jy1, for j = 1, . . . , m − 1. (2.6) 4

(5)

Substituting (2.6) with j = m − 1 into (2.5b), it follows that d? 1y2= 1 λ ¡ λ2m_ε√_{εI + A} 2m ¢ y1. (2.7)

Next, we shall show that

d? jy2j= λ1−2j Ã ε√ελ2m_{I +} 2j−2_X i=0 λi_A 2m−i ! y1, for j = 2, 3, . . . , m. (2.8)

Substituting (2.7), and (2.6) with j = m − 1 into (2.5c) with j = 2, we get

d? 2y4= λ−3 ¡ ε√ελ2m_{I + λ}2_A 2m−2+ λA2m−1+ A2m ¢ y1,

which implies that (2.8) holds for j = 2. Assume that (2.8) holds for j = `, i.e.,

d? `y2`= λ1−2` Ã ε√ελ2m_{I +} 2`−2_X i=0 λi_A 2m−i ! y1. (2.9)

Substituting (2.9), and (2.6) with j = m − ` into (2.5c) with j = ` + 1, we get

λ2_d? `+1y2`+2= (λA2m−2`+ A2m−2`+1)y1+ λ1−2` Ã ε√ελ2m_{I +} 2`−2_X i=0 λi_A 2m−i ! y1 = λ1−2` " ε√ελ2mI + 2` X i=0 λiA2m−i # y1,

which implies that (2.8) also holds for j = ` + 1. By mathematical induction, we proved (2.3c).

Finally, substituting (2.6), (2.7), and (2.8) with j = m into (2.5a), we have 0 =£λ2A?1+ λ ¡ A0− √ εI −√εA?2mA2m ¢ + εA1 ¤ y1+ λ2 √ εA?2m · 1 λ ¡ λ2mε√εI + A2m ¢¸ y1 + m−1_X j=1 ¡ λ2A?2m−2j+1+ λA?2m−2j ¢ ¡ λ2m−2jy1 ¢ + ελ1−2m Ã ε√ελ2mI + 2m−2_X i=0 λiA2m−i ! y1 = λ1−2m  2m−1X j=0 λ4m−jA?2m−j+ λ2mA0+ ε 2m X j=1 λ2m−jAj   y1,

or equivalently P(λ)y1= 0 with 2d = 4m. This proved (2.3a) and (2.3b).

(b) Substituting the partition of y in (2.2) into (2.1), we get 0 =£λ2A?1+ λA0+ εA1 ¤ y1+ εd?my2m+1+ m X j=1 ¡ λ2A?2m−2j+3+ λA?2m−2j+2 ¢ y2j, (2.10a) 0 = ε(λA2m+ A2m+1)y1− ελ2d?1y3, (2.10b) 0 = dj(y2j− λ2y2j+2), for j = 1, . . . , m − 1, (2.10c) 0 = ε(λA2m−2j+2+ A2m−2j+3)y1+ εd?j−1y2j−1− ελ2d?jy2j+1, for j = 2, . . . , m, (2.10d) 0 = dm(y2m− λ2y1). (2.10e) 5

(6)

Substituting (2.10e) into (2.10c), it follows that y2m−2j= λ2j+2y1, for j = 1, . . . , m − 1. (2.11) From (2.10b), we have d? 1y3= 1 λ2(λA2m+ A2m+1) y1. (2.12)

Next, we shall show that

d? jy2j+1= λ−2j 2j−1_X i=0 λi_A 2m−i+1y1, for j = 2, 3, . . . , m. (2.13)

Substituting (2.12), and (2.11) with j = m − 2 into (2.10d) with j = 2, we get

d? 2y5= λ−4 ¡ λ3_A 2m−2+ λ2A2m−1+ λA2m+ A2m+1 ¢ y1,

which implies that (2.13) holds for j = 2. Assume that (2.13) holds for j = `, i.e.,

d? `y2`+1= λ−2` 2`−1_X i=0 λi_A 2m−i+1y1. (2.14)

Substituting (2.14), and (2.11) with j = m − ` − 1 into (2.10d) with j = ` + 1, we get

λ2_d? `+1y2`+3= (λA2m−2`+ A2m−2`+1)y1+ λ−2` 2`−1_X i=0 λi_A 2m−i+1y1 = λ−2` 2`+1_X i=0 λi_A 2m−i+1y1,

which implies that (2.13) also holds for j = ` + 1. By mathematical induction, we proved (2.4c).

Finally, substituting (2.11), (2.12), and (2.13) with j = m into (2.10a), we have

0 =£λ2_A? 1+ λA0+ εA1 ¤ y1+ ελ−2m 2m−1_X i=0 λi_A 2m−i+1y1 + m X j=1 ¡ λ2_A? 2m−2j+3+ λA?2m−2j+2 ¢ ¡ λ2m−2j+2_y 1 ¢ = λ−2m   2m X j=0 λ4m−j+2_A? 2m−j+1+ λ2m+1A0+ ε 2m+1_X j=1 λ2m−j+1_A j   y1,

or equivalently P(λ)y1= 0 with 2d = 4m + 2. This proved (2.4a) and (2.4b).

As discussed in the beginning of this section, in general, there are infinitely many structured quadraticizations of a (?, ε)-PPEP with even degree. Theorem 2.1 gives a solution of structured quadraticization for (1.1). As in [12, 13], for a given P(λ) in (1.1), a structured quadraticization Q(λ) of P(λ) can be solved by finding a nonzero

h ∈ Cd _{and H(λ) ∈ C}dn×n _{with degree 2d − 2 such that}

Q(λ)H(λ) = h ⊗ P(λ), (2.15)

(7)

where Q(λ) is a dn × dn (?, ε)-PQEP and ⊗ denotes the Kronecker product. If

z ∈ Cdn_{\{0} is a left eigenvector of Q(λ), we then have}

0 = zTQ(λ)H(λ) = zT(h ⊗ P(λ)) = zT(h ⊗ In)P(λ). (2.16) Thus zT_{(h ⊗ In) is a left eigenvector of P(λ).}

Similarly, a structured quadraticization Q(λ) of P(λ) can also be solved by finding a nonzero g ∈ Cd _{and G(λ) ∈ C}n×dn _{with degree 2d − 2 such that}

G(λ)Q(λ) = gT _{⊗ P(λ).} _(2.17)

If y ∈ Cdn_{\{0} is a right eigenvector of Q(λ), then we have}

0 = G(λ)Q(λ)y = gT _{⊗ P(λ)y = P(λ)(g}T _{⊗ I}_n)y. _(2.18) Thus, (gT _{⊗ I}_{n)y is a right eigenvector of P(λ).}

The structured quadraticization Q(λ) defined in (2.3) and (2.4) must satisfy (2.15) and (2.17), respectively. In fact, from Theorem 2.1 for the case 2d = 4m, we define

h = e1∈ Cd and H(λ) =£H1(λ)T, · · · , H2m(λ)T ¤T (2.19a) with H1(λ) = λ2m−1In, H2`+1(λ) = λ4m−2`−1In, ` = 1, . . . , m − 1, (2.19b) H2`(λ) = λ 2(m−`) d? ` Ã ε√ελ2m_I n+ 2`−2_X i=0 λi_A 2m−i ! , ` = 1, . . . , m (2.19c)

which solves (2.15) with Q(λ) being defined in (2.3); for the case 2d = 4m + 2, we define h = e1∈ Cd and H(λ) =£H1(λ)T, · · · , H2m+1(λ)T ¤T (2.20a) with H1(λ) = λ2mIn, H2`(λ) = λ4m−2`+2In, ` = 1, . . . , m − 1, (2.20b) H2`+1(λ) = λ 2(m−`) d? ` Ã_2`−1 X i=0 λi_A 2m−i+1 ! , ` = 1, . . . , m (2.20c)

which solves (2.15) with Q(λ) being defined in (2.4).

To solve (2.17) for both cases 2d = 4m and 4m + 2, by “palindromic” property of P(λ), (2.19) and (2.20) it holds that

Q?_{(λ) (H}?_(λ))T _{= ed}_{⊗ P}?_(λ). _(2.21) This implies that

HT(λ)Q(λ) = eTd ⊗ P(λ) (2.22)

which solves (2.17).

(8)

Remark 2.1. Theorem 2.1 can not be applied to the case when (?, ε) = (T, −1).

In fact, up to now, there is no structure-preserving algorithm to solve a (T, −1)-PQEP. So, it is useless to transform a (T, −1)-PPEP to a (T, −1)-−1)-PQEP. Thus, for a (T, −1)-PPEP we can apply the structured linearization [12] to transform it into a PGEP: λZT _{+ Z, and then we rewrite it as a T-PQEP}

b

λ2_ZT _{+ b}_{λ0 + Z} _(2.23)

by letting bλ2_{= λ. Then the structure-preserving algorithm proposed by [7] can be used} to solve (2.23).

The following theorem shows that an H-anti-PQEP can be transformed to an H-PQEP.

Theorem 2.2. Given an H-anti-PQEP (λ2_AH

1 + λA0− A1)x = 0, (2.24)

where A1, A0∈ Cn×n with AH0 = −A0. Then (ıω, x) is an eigenpair of (2.24) if and only if (ω, x) is an eigenpair of the H-PQEP

£

ω2(−A1)H+ ω(ıA0) + (−A1)

¤

x = 0.

Proof. The theorem holds by setting λ = ıω and using the fact that (ıA0)H = ıA0.

even and odd polynomial eigenvalue problems. We now consider

?-even and ?-odd PEP with ?-even degree of the form ¡

λ2d_A

2d+ λ2d−1A2d−1+ · · · + λ2A2+ λA1+ A0

¢

x = 0, (2.25)

where Aj ∈ Cn×n _{for j = 0, 1, . . . , 2d and A}

2d6= 0.

The equation (2.25) is called a ?-even PEP, if

A?

2i = A2i, for i = 0, 1, . . . , d, A?2i−1 = −A2i−1, for i = 1, . . . , d,

and the equation (2.25) is called a ?-odd PEP, if

A?

2i= −A2i, for i = 0, 1, . . . , d, A?

2i−1= A2i−1, for i = 1, . . . , d.

By the Cayley transformation

λ = µ + 1

µ − 1, (2.26)

it was shown in [12] that an ?-even/odd PEP can be transformed to a (?, +1/ − 1)-PPEP, respectively. By substituting (2.26) into (2.25) we can derive an explicit

(9)

representation of (?, ε)-PPEP for the transformation as follows: 0 = (µ − 1)2d  X2d j=0 µ µ + 1 µ − 1 ¶j Aj   x =  X2d j=0 (µ + 1)j_{(µ − 1)}k−j_A j   x =   2d X j=0 (−1)2d−j 2d X i=0 µi_(γij_A_j)   x =   2d X j=0 (−1)j 2d X i=0 µi_(γij_A j)   x =   2d X i=0 µi   2d X j=0 (−1)j_γ ijAj     x =    d−1 X i=0 µ2d−i   2d X j=0 (−1)j(γ2d−i,jAj)   + µd   2d X j=0 (−1)j(γd,jAj)   + d−1 X i=0 µi  X2d j=0 (−1)j(γijAj)     x, (2.27) where 2d X i=0 γijµi≡ (1 + µ)j(1 − µ)2d−j= " _j X `=0 µ j ` ¶ µ` # "_2d−j X i=0 µ 2d − j i ¶ (−µ)i # (2.28) with µ j ` ¶ =    j(j−1)···(j−`+1) 1·2···` , if 1 ≤ ` ≤ j, 1, if ` = 0, 0, otherwise, (2.29)

and γij in (2.28) satisfy

γ2d−i,j = (−1)jγij, for j = 0, 1, . . . , 2d, i = 0, . . . , d − 1, (2.30a)

γd,j= 0, for j = 1, 3, . . . , 2d − 1. (2.30b) Therefore, we proved the following theorem.

Theorem 2.3. If (2.25) is a even or a odd PEP, then (2.27) is a ?-palindromic or a ?-anti-?-palindromic PEP, respectively.

We illustrate the relations between various structured PEPs in Figure 2.1. From Figure 2.1, we see that T-even, T-odd, T-anti-palindromic and T-palindromic PEPs with even degree can be reduced to T-PQEPs. Thus, the structure-preserving methods in [7] can be used to solve the associated T-PQEPs. On the other hand, we see that H-even, H-odd, H-anti-palindromic and H-palindromic PEPs with even degree can be reduced to H-PQEPs. Therefore, it is motivated to develop a structure-preserving algorithm in the next section for solving the H-PQEP.

3. H-palindromic quadratic eigenvalue problems. Consider the H-PQEP

Q(λ)x ≡ (λ2AH1 + λA0+ A1)x = 0, (3.1)

where A1, A0 ∈ Cn×n with AH0 = A0. Taking the conjugate transpose of (3.1), we

easily see that the eigenvalues of Q(λ) have a ‘symplectic’ property, that is, they are

(10)

H−even PEP H−palindromic PEP T−even PEP PEP T−palindromic PEP H−anti−palindromic PEP H−palindromic QEP T−palindromic QEP H−anti−palindromic QEP H−palindromic QEP Theorem 2.3 Theorem 2.3 Theorem 2.1 Theorem 2.1 Theorem 2.1 Theorem 2.2 H−odd T−odd PEP Theorem 2.3 T−palindromic QEP T−palindromic GEP PEP Theorem 2.3 T−anti−palindromic [12, 13] (2.23)

Fig. 2.1. Relations between various structured PEPs.

paired with respect to the unit circle, containing both eigenvalues λ and its conjugate reciprocal 1/¯λ.

Classical linearizations of (3.1) in a companion form, generally, cannot preserve the symplectic structure. Fortunately, the special linearization of (3.1) (see [2] or [7])

(M − λL)z ≡ µ· A1 0 −A0 −I ¸ − λ · 0 I AH 1 0 ¸¶ · x y ¸ = 0 (3.2) satisfies MJ MH_{= LJ L}H _(3.3)

which preserves symplecticity, where J ≡ J2n is the 2n × 2n matrix

·

0 In

−In 0 ¸

. A pencil M − λL or the matrix pair (M, L) satisfying (3.3) is called H-symplectic.

For any real symplectic matrix pair (M, L) satisfying (3.3), a structure-preserving (S + S−1_{)-transform for the computation of all its eigenvalues was proposed by [9].} For a H-symplectic matrix pair (M, L), the (S + S−1_{)-transform (Ms}_{, L}_{s) is defined} by

Ms≡ MJ LH+ LJ MH≡ KJ , Ls≡ LJ LH ≡ N J . (3.4) It is easily seen that K and N are both complex conjugate skew-Hamiltonian, i.e.,

KJ = J KH _{and N J = J N}H_{. Hence, if µ is an eigenvalue of (K, N ), so is ¯}_µ. We first give the relationship between eigenpairs of a H-symplectic matrix pair and its (S + S−1_)-transform.

Theorem 3.1. Let (M, L) be H-symplectic and (Ms, Ls) be its (S + S−1

)-transform as in (3.4). Suppose zs is an eigenvector of (Ms, Ls) corresponding to

µ = ν +1

ν. If (LH−ν1MH)zs6= 0 or (LH− νMH)zs6= 0, then (ν, J (LH−1νMH)zs)

or (1/ν, J (LH_{− νM}H_{)zs) is an eigenpair of (M, L).}

Proof. As in [9], since MJ MH_{= LJ L}H_{, by (3.4) it holds that} 0 = [Ms− µLs] zs= · MJ LH_{+ LJ M}H_{− (ν +}1 ν)LJ L H ¸ zs = (M − νL)J (LH₋1 νM H_)zs _(3.5a) = (M −1 νL)J (L H_{− νM}H_)zs_. _(3.5b) 10

(11)

By applying (3.5a), if (LH₋1

νMH)zs6= 0, then J (LH−1νMH)zs is an eigenvector of (M, L) corresponding to ν. By applying (3.5b), if (LH _{− νM}H_)zs _{6= 0, then}

J (LH_{− νM}H_)zs_{is an eigenvector of (M, L) corresponding to 1/ν.}

If (M, L) is of the form in (3.2), we give the relationship between eigenpairs of (Ms, Ls) and Q(λ) in the following theorem.

Theorem 3.2. Let (M, L) be H-symplectic of the form in (3.2) and (Ms, Ls)

be its (S + S−1_{)-transform. Suppose that the columns of Zs} _{≡ [Z}T

s1, Zs2T]T with

Zs1, Zs2 ∈ Cn×p and p ≥ 1 form all linearly independent eigenvectors of (Ms, Ls)

corresponding to the eigenvalue µ 6= ±2. Let ν be a root of the quadratic equation λ +1

λ = µ. Then: (i) if rank(Zs1+1

νZs2) ≥ 1 and rank(Zs1+ νZs2) ≥ 1, then the linearly independent

columns of Zs1+1_νZs2and Zs1+νZs2are eigenvectors of Q(λ) corresponding

to ν and 1

ν, respectively. Furthermore,

Null(Zs1+1

νZs2) ∩ Null(Zs1+ νZs2) = {0}. (3.6) Note that Null(B) denotes the null space of B; and

(ii) if Zs1+ νZs2 = 0, then the columns of Zs1+ 1

νZs2 are linearly independent

eigenvectors of Q(λ) corresponding to ν. Proof. (i). From (3.5a) and (3.5b), it follows that

(M − νL) · Zs1+_ν1Zs2 Xν ¸ = 0 (3.7a) and (M −1 νL) · Zs1+ νZs2 X1/ν ¸ = 0 (3.7b) where Xν≡ 1 νA H 1Zs1− 1 νA0Zs2− A1Zs2, X1/ν ≡ νA H 1 Zs1− νA0Zs2− A1Zs2. (3.8) Since (M, L) is of the form in (3.2), from (3.7a), we have

Xν= 1 νA1(Zs1+ 1 νZs2) (3.9a) and A0(Zs1+1 νZs2) + Xν+ νA H 1 (Zs1+ 1 νZs2) = 0. (3.9b)

Substituting Xν in (3.9a) into (3.9b) and multiplying (3.9b) by ν we get

Q(ν)(Zs1+1

νZs2) = 0.

Similarly, from (3.7b), we have Q(1

ν)(Zs1+ νZs2) = 0.

To show (3.6), suppose that there is a U1 ∈ Cp×m with rank(U1) = m ≥ 1 such

that

(Zs1+1

νZs2)U1= 0 and (Zs1+ νZs2)U1= 0. (3.10) 11

(12)

This implies Zs1U1= Zs2U1= 0, i.e., ZsU1= 0. On the other hand, since Zsand U1

are of full rank, ZsU16= 0. This is a contradiction.

(ii) Since Zs1+ νZs2= 0, from (3.2) and (3.7b), it follows that X1/ν= 0.

There-fore, 1

ν is not an eigenvalue of (M, L) and Q(λ). We now show that Zs1+ 1

νZs2 is of full rank. Suppose not, i.e., there is a U1 ∈ Cp×m _{with rank(U} 1) = m ≥ 1 such that µ Zs1+1 νZs2 ¶ U1= 0. (3.11)

Then, it holds that Zs1U1= −_ν1Zs2U16= 0, because Zsand U1are of full rank. Since ν 6= ±1, we have Zs1U1+νZs2U1= (1−ν2)Zs1U16= 0. From (3.7b) it follows that the

nonzero columns of Zs1U1+νZs2U1are eigenvectors of Q(λ) corresponding to_ν1. This

is a contradiction, since 1

ν is not an eigenvalue of Q(λ). From (i) it follows that the columns of Zs1+1

νZs2form linearly independent eigenvectors of Q(λ) corresponding to ν.

Structure-Preserving Algorithm. In [7], a structure-preserving algorithm (SPA) based on Patel’s algorithm [14] has developed for solving T-PQEPs. In or-der to solve a H-PQEP, we apply the (S + S−1_{)-transform to the H-symplectic pair} (M, L) of the form (3.2) and get the GEP

Kzs= µN zs, (3.12) where K = · A0 AH1 − A1 A1− AH1 A0 ¸ , N = · −A1 0 0 −AH 1 ¸ (3.13) are H-skew-Hamiltonian. However, Patel’s algorithm can only be used to solve (3.12) in the real case, but not be applied to solve (3.12) in the complex conjugate case directly. In the following, we shall convert (K, N ) in (3.13) into an enlarged real T-skew-Hamiltonian pair so that Patel’s algorithm can be applied.

Let

A0= A0R+ ıA0I, A1= A1R+ ıA1I,

where A0R, A0I, A1R, A1I ∈ Rn×n with AT0R= A0R and AT0I = −A0I. Then K and N

in (3.13) can be represented by K = · A0R AT1R− A1R A1R− AT1R A0R ¸ + ı · A0I −(A1I+ AT1I) A1I+ AT1I A0I ¸ ≡ KR+ ıKI (3.14) and N = · −A1R 0 0 −AT 1R ¸ + ı · −A1I 0 0 AT 1I ¸ ≡ NR+ ıNI. (3.15) We extend (K, N ) to a real 4n × 4n matrix pair (K2, N2) by

K2= · KR −KI KI KR ¸ and N2= · NR −NI NI NR ¸ ∈ R4n×4n_. _(3.16) 12

(13)

From (3.16) it is easily seen that if µ is an eigenvalue of (K, N ), then µ and ¯µ are

eigenvalues of (K2, N2).

Theorem 3.3. All eigenvalues of (K2, N2) are double.

Proof. Substituting matrices KR, KI, NR, NI in (3.14) and (3.15) into (3.16), we get K2=     A0R AT1R− A1R −A0I A1I+ AT1I A1R− AT1R A0R −(A1I+ AT1I) −A0I A0I −(A1I + AT1I) A0R AT1R− A1R A1I+ AT1I A0I A1R− AT1R A0R     and N2=     −A1R 0 A1I 0 0 −AT 1R 0 −AT1I −A1I 0 −A1R 0 0 AT 1I 0 −AT1R     . Define Π = diag{In, · 0 In In 0 ¸

, In}. Then it holds that

e K2≡ ΠK2Π =     A0R −A0I AT1R− A1R A1I+ AT1I A0I A0R −(A1I+ AT1I) AT1R− A1R A1R− AT1R −(A1I+ AT1I) A0R −A0I A1I+ AT1I A1R− AT1R A0I A0R     (3.17a) and e N2≡ ΠN2Π =     −A1R A1I 0 0 −A1I −A1R 0 0 0 0 −AT 1R −AT1I 0 0 AT 1I −AT1R     . (3.17b)

It is easy to check that eK2 and eN2 are real skew-Hamiltonian, i.e., ( eK2J4n)T = − eK2J4n and ( eN2J4n)T = − eN2J4n. Therefore, from the result of [9] it follows that

all eigenvalues of ( eK2, eN2) are double.

From (3.4), we see that µ is an eigenvalue of (K, N ) if and only if ¯µ is an eigenvalue

of (K, N ). The relationship between eigenpairs of (K, N ) and (K2, N2) is given in the

following theorem. Theorem 3.4.

(i) If (K, N ) has an eigenvalue α + ıβ associated with the eigenvector x + ıy, where

α, β ∈ R and x, y ∈ R2n_{, then} · x y ¸ ± ı · y −x ¸ is an eigenvector of (K2, N2) corresponding to α ± ıβ.

(ii) If (K2, N2) has an eigenvalue α + ıβ associated with the eigenvector

· x1 x2 ¸ + ı · y1 y2 ¸

, then x + ıy is an eigenvector of (K, N ) corresponding to α + ıβ with x = x1− y2, y = x2+ y1.

(14)

Proof. (i) Since K(x + ıy) = (α + ıβ)N (x + ıy), comparing the real and the

imaginary parts of both sides, we come to

K2 µ· x y ¸ ± ı · y −x ¸¶ = (α ± ıβ)N2 µ· x y ¸ ± ı · y −x ¸¶ . (ii) Since · x1 x2 ¸ + ı · y1 y2 ¸ is an eigenvector of (K2, N2) corresponding to α + ıβ, it holds that KRx1− KIx2= α(NRx1− NIx2) − β(NRy1− NIy2), (3.18a) KRy1− KIy2= β(NRx1− NIx2) + α(NRy1− NIy2), (3.18b) KIx1+ KRx2= α(NIx1+ NRx2) − β(NIy1+ NRy2), (3.18c) KIy1+ KRy2= β(NIx1+ NRx2) + α(NIy1+ NRy2). (3.18d)

Set x = x1− y2and y = x2+ y1. Subtracting (3.18a) from (3.18d) and adding (3.18c)

and (3.18b), we get

KRx − KIy = α(NRx − NIy) − β(NIx + NRy), (3.19a)

KIx + KRy = β(NRx − NIy) + α(NIx + NRy). (3.19b) Thus, x + ıy is an eigenvector of (K, N ) corresponding to α + ıβ.

From Theorem 3.3 and Theorem 3.4, the eigenpairs of Kzs= µN zscan be com-puted from the eigenpairs of eK2z = µ e˜ N2z where e˜ K2 and eN2 are defined in (3.17).

Since eK2 and eN2 are both real skew-Hamiltonian, based on Patel’s approach [7, 14],

the pair ( eK2, eN2) can be reduced to block upper triangular forms

e K2:= QTKe2Z = · K11 K12 0 KT 11 ¸ , Ne2:= QTNe2Z = · N11 N12 0 NT 11 ¸ , (3.20)

where Q, Z ∈ R4n×4n _{are orthogonal satisfying}

Q = J4nTZJ4n, (3.21)

and K11, N11∈ R2n×2n are upper Hessenberg and upper triangular, respectively.

From Theorem 3.3 and (3.20), we see that the pair (K11, N11) has the same

spectrum as ( eK2, eN2). We then apply the QZ algorithm to (K11, N11) for computing

all eigenpairs {(µj, ˜zj)}2n j=1. Consequently, {(µj, ΠZ · ˜ zj 0 ¸ )}2n j=1 are 2n eigenpairs of (K2, N2), where Π is defined in (3.17). Let µj= αj+ ıβj and

ΠZ · ˜ zj 0 ¸ ≡ · x1j x2j ¸ + ı · y1j y2j ¸

with αj, βj∈ R and x1j, x2j, y1j, y2j∈ R2n. From Theorem 3.4, {(αj+ ıβj, JT(x1j− y2j+ı(x2j+y1j)))}2nj=1are eigenpairs of (Ms, Ls). Finally, we compute all eigenvalues and the associated eigenvectors of Q(λ) by Theorem 3.2.

(15)

Algorithm 3.1 (Structure-Preserving Algorithm (SPA) for H-PQEP).

Input: A H-palindromic quadratic pencil Q(λ) ≡ λ2_AH

1 + λA0+ A1 with A0, A1∈ Cn×n and AH0 = A0;

Output: All eigenvalues and eigenvectors of Q(λ). Step 1. Form the matrix pair ( eK2, eN2) as in (3.17);

Step 2. Reduce ( eK2, eN2) to block upper triangular forms as in (3.20); Step 3. Compute eigenpairs {(µj, ˜zj)}2nj=1 of (K11, N11) defined in (3.20)

by the QZ algorithm; Step 4. Compute ΠZ · ˜ zj 0 ¸ ≡ · x1j x2j ¸ + ı · y1j y2j ¸ , j = 1, 2, . . . , 2n;

Step 5. Compute the eigenpair (µj, zj), for j = 1, 2, . . . , 2n, of (Ms, Ls) by

zj = JT(x1j− y2j+ ı(x2j+ y1j)) ≡ [zj1T, zj2T]T;

Step 6. Compute νj and _ν1_j by solving ν2− µjν + 1 = 0;

Compute xj1≡ zj1+_ν1_jzj2 and xj2≡ zj1+ νjzj2for j = 1, 2, . . . , 2n;

Step 7. If xj1is a nonzero vector, then it is an eigenvector of Q(λ)

corresponding to νj; If xj2is a nonzero vector, then it is an eigenvector

of Q(λ) corresponding to 1

νj;

4. Backward error analysis. Let (λ, x) be an approximate eigenpair of a (?, ε)-palindromic polynomial pencil P(λ) in (1.1). We define the backward error of P(λ) by η(λ, x) := _hP kP(λ)xk d j=1(|λ|d+j+ |λ|d−j) kAjk + |λ|dkA0k i kxk (4.1)

essentially as in [18]. For a (?, PPEP with degree 2d, Theorem 2.1 gives a (?, ε)-palindromic quadraticization Q(λ) ≡ λ2_A?

1+ λA0+ A1 of the form (2.3) or (2.4).

Suppose (λ, y ≡ [yT

1, · · · , ydT]T) is an approximate eigenpair of Q(λ) associated with the residual r ≡ [rT

1, · · · , rdT]T. In what follows we shall study the relation between residuals r ≡ Q(λ)y and P2d(λ)y1.

For simplification, we first consider cases 2d = 8 and 10 with ε = 1. For 2d = 8: From (2.3) we have

   λ2     A? 1 d?1A?4 A?3 0 0 0 d1I 0 0 0 0 −d? 2I d2I 0 0 0     + λ     A0− I − A?4A4 0 A?2 0 0 −d1d?1I 0 0 A2 0 0 0 0 0 0 0     +     A1 0 0 d?2I d1A4 0 0 0 A3 d?1I 0 0 0 0 −d2I 0             y1 y2 y3 y4     =     r1 r2 r3 r4     , (4.2)

where d1 and d2are nonzero scalars. It follows that

r1= [λ2A?1+ λ(A0− I − A?4A4) + A1]y1+ λ2A4?d?1y2+ (λ2A?3+ λA?2)y3+ d?2y4, (4.3a) r2= d1A4y1− λd1d?1y2+ λ2d1y3, (4.3b) r3= (λA2+ A3)y1+ d?1y2− λ2d?2y4, (4.3c) r4= λ2d2y1− d2y3. (4.3d) 15

(16)

Then we have y3= λ2y1− d−12 r4, (4.4a) d? 1y2= 1 λ(λ 4_{I + A} 4)y1− λd−12 r4−1 λd −1 1 r2, (4.4b) d?2y4= 1 λ3(λ 4_{I + λ}2_A 2+ λA3+ A4)y1−1 λd −1 2 r4− 1 λ3d −1 1 r2− 1 λ2r3. (4.4c)

Substituting (4.4) into (4.3a) we get

P8(λ)y1= d−11 (λ4A?4+ I)r2+ λ2d−12 (λ4A?4+ λ3A?3+ λ2A?2+ I)r4+ λr3+ λ3r1. (4.5)

For 2d = 10: From (2.4) we have      λ 2       A? 1 A?5 0 A?3 0 0 0 −d? 1I 0 0 0 0 0 d1I 0 0 0 0 0 −d? 2I d2I 0 0 0 0      + λ       A0 A?4 0 A?2 0 A4 0 0 0 0 0 0 0 0 0 A2 0 0 0 0 0 0 0 0 0       +       A1 0 0 0 d?2I A5 0 0 0 0 0 −d1I 0 0 0 A3 0 d?1I 0 0 0 0 0 −d2I 0                   y1 y2 y3 y4 y5      =       r1 r2 r3 r4 r5      , (4.6)

where d1 and d2are nonzero scalars. It follows that

r1= (λ2A?1+ λA0+ A1)y1+ (λ2A?5+ λA?4)y2+ (λ2A?3+ λA?2)y4+ d?2y5, (4.7a)

r2= (λA4+ A5)y1− λ2d?1y3, (4.7b) r3= λ2d1y4− d1y2, (4.7c) r4= (λA2+ A3)y1+ d?1y3− λ2d?2y5, (4.7d) r5= λ2d2y1− d2y4. (4.7e) Then we have y4= λ2y1− d−12 r5, (4.8a) y2= λ4y1− λ2d2−1r5− d−11 r3, (4.8b) d? 1y3= 1 λ2(λA4+ A5)y1− 1 λ2r2, (4.8c) d? 2y5= 1 λ4(λ 3_A 2+ λ2A3+ λA4+ A5)y1− 1 λ4r2− 1 λ2r4. (4.8d)

Substituting (4.8) into (4.7a), we get

P10(λ)y1= λ5d−12 (λ3A?5+ λ2A?4+ λA?3+ A?2)r5+ λ5d−11 (λA?5+ A?4)r3

+(λ4_r

1+ λ2r4+ r2). (4.9)

In general, for 2d = 4m and 4m + 2 we have the following theorem. Theorem 4.1. Let (λ, y ≡ [yT

1, · · · , ydT]T) be an approximate eigenpair of (2.1)

associated with residual r ≡ [rT

1, · · · , rdT]T. Then

(17)

(i) for 2d = 4m, we have P4m(λ)y1= m−1_X i=0 λ2i_d−1 i+1   2i X j=0 λ2m−j_A? 2m−j+ √ εI   r2i+2 +λ2m−1 m−1_X i=1 λ−2i_r 2m−2i+1+ λ2m−1r1≡ ˜r0; (4.10)

(ii) for 2d = 4m + 2, we have

P4m+2(λ)y1= λ2m+1 m X j=1 d−1 j   2m X i=2m−2j+1 λi−2m+2j−1_A? i+1   rj+1 + m X i=1 λ2i−2_r 2i+ λ2mr1≡ ˜r2. (4.11)

Proof. (A sketch) For both cases we consider the equation

(λ2_A?

1+ A0+ A1)y = r, (4.12)

where A0 and A1 are given in (2.3) or (2.4). Let y ≡ [y1T, · · · , ydT]T and r ≡ [rT

1, · · · , rdT]T. As in the derivation of (4.5) and (4.9) we first derive the represen-tations of y2, . . . , yd by {ri}di=2 and y1 from the second to the last equations of (4.12).

Then, substituting y2, . . . , yd into the first equation of (4.12) and arranging the suit-able terms carefully, we get (4.10) and (4.11). Here we omit the details. A tedious proof can be found in Appendix.

On the other hand, suppose (ˆλ, ˆy) with ˆy = [ˆyT

1, · · · , ˆy2dT]T is an approximate

eigenpair of (1.2) with ε = 1 and ˆr = [ˆr1, · · · , ˆrT2d]T is the associated residual. That

is,              Ad 0 · · · 0 0 −In .. . . .. 0 −In     + ˆλ         Ad−1 · · · A0 · · · A?d In 0 .. . . .. ... .. . . .. ... 0 · · · · · · In 0                      ˆ y1 ˆ y2 .. . ˆ y2d     =      ˆ r1 ˆ r2 .. . ˆ r2d     . (4.13) After computation, we also have the following theorem.

Theorem 4.2. Let (ˆλ, ˆy) be an approximate eigenpair of (1.2) associated with the residual ˆr satisfying (4.13). Then it holds

P2d(ˆλ)ˆy1= ˆr1− d−2 X k=0 Ak d−k X j=2 λd−k−j+1_r_ˆ j− d X k=1 A? k d+k X j=2 ˆ λd+k−j+1_r_ˆ j. (4.14)

In the following, we shall analyze the backward error of the structured lineariza-tion (λZ?_{+ Z)y = 0 proposed in [12]. If (λ, [λ}2d−1_xT_{, λ}2d−2_xT_{, · · · , λx}T_{, x}T_]T_{) is an}

(18)

eigenpair of λZ?_{+ Z , then} (λZ?_{+ Z)}        λ2d−1_x λ2d−2_x .. . λx x        =        P2d(λ)x 0 .. . 0 P2d(λ)x        = 0. (4.15)

Thus, (λ, x) is an eigenpair of P2d(λ). For simplification, we consider that 2d = 6.

Then Z?=         A? 3 A?2− A3 A?1 A0 A1 A2 0 A? 3− A2 A?2− A3 A?1 A0 A1 0 −A1 A?3− A2 A?2− A3 A?1 A0 0 −A0 −A1 A?3− A2 A?2− A3 A?1 0 −A? 1 −A0 −A1 A?3− A2 A?2− A3 A? 3 0 0 0 0 A?3         .

Let (˜λ, ˜y) be an approximate eigenpair of (λZ?_{+ Z)y = 0 associated with the residual} ˜

r, i.e.,

(˜λZ?+ Z)˜y = ˜r. (4.16)

From (4.16) it follows that ˜ r1= ˜λ [A?3y˜1+ (A?2− A3)˜y2+ A?1y˜3+ A0y˜4+ A1y˜5+ A2y˜6] + A3y˜1+ A3y˜6,(4.17a) ˜ r2= ˜λ [(A?3− A2)˜y2+ (A?2− A3)˜y3+ A1?y˜4+ A0y˜5+ A1y˜6] +(A2− A?3)˜y1+ (A3− A?2)˜y2− A?1y˜3− A0y˜4− A1y˜5, (4.17b) ˜ r3= ˜λ [−A1y˜2+ (A?3− A2)˜y3+ (A2?− A3)˜y4+ A?1y˜5+ A0y˜6] +A1y˜1+ (A2− A?3)˜y2+ (A3− A2?)˜y3− A?1y˜4− A0y˜5, (4.17c) ˜ r4= ˜λ [−A0y˜2− A1y˜3+ (A?3− A2)˜y4+ (A?2− A3)˜y5+ A?1y˜6] +A0y˜1+ A1y˜2+ (A2− A?3)˜y3+ (A3− A?2)˜y4− A?1y5, (4.17d) ˜ r5= ˜λ [−A?1y˜2− A0y˜3− A1y˜4+ (A?3− A2)˜y5+ (A?2− A3)˜y6] +A1?y˜1+ A0y˜2+ A1y˜3+ (A2− A?3)˜y4+ (A3− A?2)˜y5, (4.17e) ˜ r6= ˜λ [A?3y˜1+ A?3y˜6] + A?2y˜1+ A?1y˜2+ A0y˜3+ A1y˜4+ (A2− A?3)˜y5+ A3y˜6.(4.17f)

Multiplying (4.17b), (4.17c), (4.17d), (4.17e) and (4.17f) by ˜λ, ˜λ2_{, ˜}_λ3_{, ˜}_λ4 _{and ˜}_λ5_,

respectively, and adding them to (4.17a), we get

P6(˜λ)(˜y1+ ˜y6) = 6 X i=1 ˜ λi−1r˜i.

In general, we have the following theorem.

Theorem 4.3. Suppose (4.15) holds and let (˜λ, [˜yT

1, · · · , ˜yT2d]T) be an approximate eigenpair of (λZ?_{+ Z)y = 0 associated with the residual [˜}_rT

1, · · · , ˜r2dT]T. Then P2d(˜λ)(˜y1+ ˜y2d) = 2d X i=1 ˜ λi−1_r_˜ i. 18

(19)

Remark 4.1. From Theorem 4.1 we see that the residual P2d(λ)y1 derived from the residual r = Q(λ)y of the quadraticization Q(λ) depends on the norms of A2, · · · , Ad, but is independent of the norms of A0 and A1. On the other hand, from Theorem 4.2 the residual P2d(ˆλ)ˆy1 derived from the residual ˆr = L(ˆλ)ˆy of the linearization L(ˆλ) depends on the norms of A0, A1, · · · , Ad. Generally, if the norms

of A0 and A1 are much dominant to the norms of A2, · · · , Ad, then the norms of

P2d(λ)y1 should be much smaller than that of P2d(ˆλ)ˆy1.

5. Balancing for PPEP and PQEP. Scaling is a commonly used technique for standard eigenvalue problems to improve the sensitivity of the eigenvalues. In this section, we shall discuss the diagonal scaling of the PPEP (1.1). In addition, the free parameters d1, . . . , dm in (2.3) and (2.4) will be chosen so as to scale the PPEP so that the backward errors of the eigenpairs for the PPEP will be improved. The discussed strategy for scaling, deduced from our numerical experiments, are not be unique or optimal.

First, consider the diagonal scaling of the PPEP (1.1). In order to balance the entries of coefficient matrices in the PPEP, we define a complex diagonal matrix

D ≡ diag(2α1_{, 2}α2+ıβ2_{, · · · , 2}αn+ıβn₎

with αi, βi∈ R so that the magnitude of the entries of coefficient matrices in the new PPEP D  d−1X j=0 λ2d−jA?d−j+ λdA0+ ε d X j=1 λd−jAj   D?_{x = 0}_˜

are close to one as much as possible. That is, we shall determine α1, . . . , αn and

β2, . . . , βn so that

2αj+ıβj_a(k)

j` 2α`−ıβ` ≈ 1, (5.1)

for j, ` = 1, 2, . . . , n and k = 0, 1, . . . , d, where a(k)_j` is the (j, `)-th entry of Ak. By taking logarithm of (5.1), the parameters, α1, . . . , αnand β2, . . . , βncan be determined by solving the least square problems

αj+ α`= −Re(log2(a(k)j`)) and βj− β`= −Im(log2(a(k)j` )),

where Re(c) and Im(c) represent the real and the imaginary parts of c, respectively. Then, {α1, . . . , αn} is solved by the normal equation

AT_A    α1 .. . αn    = ATa

where a is the vector formed by −Re(log₂(a(k)_j` )), the nonzero entries of A are equal to 1 or 2 and AT_{A ≡ [a} ij] with aij= ½ 4(d + 1) + (2d + 1)(n − 1), if i = j, 2d + 1, if i 6= j. 19

(20)

Furthermore, AT_{A is invertible and (A}T_A)−1_{≡ [ˆa} ij] with ˆaij= ½ (4dn + 2n − 2d + 1)/δ, if i = j, −(2d + 1)/δ, if i 6= j, where δ = a2 11+ (n − 2)a11a12− (n − 1)a2126= 0.

Similarly, {β2, . . . , βn} is solved by the normal equation

BT_B    β2 .. . βn    = BTb

where b is the vector formed by −Im(log2(a(k)j`)), the nonzero entries of B are equal to 1 or −1 and (BT_B)−1 _{≡ [ˆb}_{ij] with}

ˆbij= ½

2/(n(2d + 1)), if i = j, 1/(n(2d + 1)), if i 6= j. It only requires O(n2_{) flops to determine the diagonal matrix D.}

Next, we shall determine d1, . . . , dmin (2.3) and (2.4). Let

η(1)_i = max{max{kA2m−jk1; j = 0, 1, . . . , 2i − 2}, 1} (for 2d = 4m), η(2)_i = max{max{kA2m−j+1k1; j = 0, 1, . . . , 2i − 1}, 1} (for2d = 4m + 2),

for i = 1, 2, . . . , m. If we take di = η(1)_i and di= η(2)_i for i = 1, 2, . . . , m in (2.3) and (2.4), respectively, then from Theorem 4.1, we have

kP4m(λ)y1k∞≤ m−1_X i=0 |λ|2i   2i X j=0 |λ|2m−j+ 1   kr2i+2k∞ +|λ|2m−1 m−1_X i=1 |λ|−2ikr2m−2i+1k∞+ |λ|2m−1kr1k∞ and kP4m+2(λ)y1k∞≤ |λ|2m+1 m X j=1   X2m i=2m−2j+1 |λ|i−2m+2j−1   krj+1k∞ + m X i=1 |λ|2i−2_kr 2ik∞+ |λ|2mkr1k∞

which imply that the upper bound of the residual P2d(λ)y1 is independent of the

norms of A0, . . . , Ad. However, such a choice for {di} may destroy the balance of the

coefficient matrices in (2.3) and (2.4) for the PQEP.

(21)

In order to balance dk = η(`)_k and the entries of A2m−2k+1(k = 1, . . . , m − 1) as

well as dm= η(`)m and the entries of A1 for A1 in (2.3) and (2.4), we re-set dk = ξ(`)_k

with ξ(`)_k = v u u u tη(`)_k  Xn i=1 n X j=1 |A2m−2k+1(i, j)|/n2  , k = 1, . . . , m − 1, ξ(`)m = v u u u tη(`)m   n X i=1 n X j=1 |A1(i, j)|/n2  ,

for ` = 1, 2. Here Ak(i, j) represents the (i, j)-th entry of Ak.

To balance the entries of A0and A1 in (2.3) and (2.4), we define

τ(1)= max   0≤k≤dmax    n X i=1 n X j=1 |Ak(i, j)|/n2   , n X i=1 n X j=1 | ˜A0(i, j)|/n2, ρ(1)   , τ(2)= max   0≤k≤dmax    n X i=1 n X j=1 |Ak(i, j)|/n2   , ρ (2)   , where ˜A0= A0− √ εI −√εA? 2mA2mand ρ(`)= max{ξ_k(`); k = 1, . . . , m}, for ` = 1, 2. We take dk= %(`)= p ρ(`)_τ(`)_{, for k = 1, . . . , m, ` = 1, 2.}

Finally, for the case 2d = 4m, −√εd2

1I is a submatrix of A0in (2.3). In order to

balance this submatrix with other submatrices in A0, we let

χ = max   max    n X i=1 n X j=1 |A2k(i, j)|/n2; k = 1, . . . , m − 1   , n X i=1 n X j=1 | ˜A0(i, j)|/n2   , and modify d1 by d1= ( ¡ %(1)¢1/1.5_{, if χ <}¡_%(1)¢2_, %(1)_, _otherwise.

6. Numerical results. A standard approach for solving the (?, ε)-PPEP (1.1) is to transform it to a 2dn×2dn linear eigenvalue problem and compute the eigenpairs by the QZ algorithm. In this paper, we have proposed a structured quadraticization to transform (1.1) into a dn × dn PQEP, to which we apply the SPA (Algorithm 3.1). In this section, we compare the computational costs of these two approaches. Moreover, we also compare the backward errors and symplectic reciprocities of the eigenpairs from the SPA and QZ approaches as well as the direct application of “polyeig” in MATLAB on the (?, ε)-PPEP (1.1).

(22)

6.1. Computational cost. For a more efficient SPA, we reorder the submatrices in the matrix pair ( eK2, eN2) of (3.17) by the permutation matrices

Π1=       In 0 0 0 0 0 0 I(d−1)n 0 0 0 0 0 0 I(d−2)n 0 0 0 In 0 0 In 0 0 0      , Π2=         0 0 In 0 0 0 0 I(d−2)n 0 0 0 0 0 0 0 0 0 In 0 0 0 0 I(d−2)n 0 0 0 0 In 0 0 In 0 0 0 0 0         .

For the case 2d = 4m, we substitute A1 of (2.3a) into eN2of (3.17b), and get

e N2:= · Π1 0 0 Π2 ¸ e N2 · ΠT 2 0 0 ΠT 1 ¸ =     D11 0 −V11 V21 0 D11 V21 V11 0 0 − Ã2m,R − Ã2m,I 0 0 A˜2m,I − Ã2m,R     ⊕     D11 0 0 0 0 D11 0 0 −VT 11 V21T − ÃT2m,R A˜T2m,I VT 21 V11T − ÃT2m,I − ÃT2m,R     , where D11= diag ¡ −dmIn −d1In εd2In −d2I εd3I · · · −dm−1In εdmIn ¢ , VT 11= £ −AT

1,I −AT2m−1,I 0 · · · −AT3,I 0

¤ , VT 21= £ −AT 1,R −AT2m−1,R 0 · · · −AT3,R 0 ¤ , ˜ A2m= ε √ εd1A2m.

Here Ak,Rand Ak,I represent the real and the imaginary parts of Ak. Similarly, for the case 2d = 4m + 2, we have

e N2:= · Π1 0 0 Π2 ¸ e N2 · ΠT 2 0 0 ΠT 1 ¸ =     D12 0 −V12 V22 0 D12 V22 V12

0 0 −A2m+1,R −A2m+1,I

0 0 A2m+1,I −A2m+1,R     ⊕     D12 0 0 0 0 D12 0 0 −VT 12 V22T −AT2m+1,R AT2m+1,I VT

22 V12T −AT2m+1,I −AT2m+1,R

    , where D12= diag ¡ −dmIn εd1In −d1In εd2In −d2In · · · −dm−1In εdmIn ¢ , VT 12= £ −AT

1,I 0 −AT2m−1,I 0 · · · −AT3,I 0

¤ , VT 22= £ −AT 1,R 0 −AT2m−1,R 0 · · · −AT3,R 0 ¤ . Set e K2:= · Π1 0 0 Π2 ¸ e K2 · ΠT 2 0 0 ΠT 1 ¸ .

Now, we use the following steps to reduce the new pair ( eK2, eN2) to block upper

triangular forms as in (3.20):

(23)

• (QR factorization and updating with real arithmetic operations) Compute Q1 and Z1 such that QT1Ne2Z1 =

"

N₁₁(1) 0 0 (N₁₁(1))T

#

, where N₁₁(1) is upper triangular. In this step, it requires 32/3n3 _{flops for the QR factorization of}

an 2n × 2n real matrix and 48dn3_{+ 24dn}2 _{flops to update Q}T

1Ke2Z1.

• (Given’s rotations and updating with real arithmetic operations) For each j-th columns of ( eK2, eN2) with j = 1, . . . , 2dn − 2, we need to

(i) annihilate eK2(2nd+i, j) and eK2(2nd+j, i) with i > j by using rotations of

the (2nd+i+1, j)-th and (2nd+i, j)-th entries, and the (2nd+j, i+1)-th and (2nd + j, i)-th entries of eK2, respectively. It requires 6(4dn − j) + 5

flops. Updating the corresponding rows and columns of eN2 requires

6(i + 1) flops. Subsequently, annihilating the corresponding fill-in for e

N2 by Given’s rotations requires 6(2dn − i) + 5 flops. Updating the

corresponding matrix eK2 requires 6(4dn − j + 1) flops. Consequently,

annihilating the entries from 2nd + j + 1 to 4dn − 1 in the j-th column of e

K2and preserving the upper triangular N11(1)require (2dn−j −2)(60dn−

12j + 22) flops.

(ii) annihilate eK2(4dn, j) and eK2(2dn + j, 2nd) by using the rotation of the

(2nd, j)-th and (4nd, j)-th entries, (2nd+j, 4nd)-th and (2nd+j, 2nd)-th entries of eK2, respectively, and update eN2 which require 6(4dn − j) +

5 + 6(2dn) flops.

(iii) annihilate eK2(i, j) and eK2(2nd + j, 2nd + i) with i > j + 1 by using the

rotations of (i − 1, j)-th and (i, j)-th entries, (2nd + j, 2nd + i − 1)-th and (2nd + j, 2nd + i)-th entries of of eK2, respectively. It requires 6(4dn − j) + 5. Updating the corresponding rows and columns of eN2 requires

6(4nd − i + 2) flops. On the other hand, annihilating the corresponding filled in for eN2by using Given’s rotations requires 6i + 5 flops.

Updat-ing the correspondUpdat-ing matrix eK2 needs 6(4dn − j) flops. Consequently,

annihilating the entries from 2nd to j + 2 of the j-th column for eK2and

preserving the upper triangular N₁₁(1)require (2dn−j−1)(72dn−12j+22) flops.

From above steps, we see that reducing the pair ( eK2, eN2) in (3.17) to block upper

triangular forms of (3.20) requires

48dn3_{+ 24dn}2₊ 2dn−2_X j=1 £ 264d2_n2_{− (180j + 68)dn + 24j}2_{− 14j − 61}¤ = (232d3_{+ 48d)n}3_{− (296d}2_{− 24d)n}2_{− 20dn + 84 flops.}

The QZ step for the real upper Hessenberg and triangular pair (K11, N11) in (3.20)

requires 22(2dn)3_{= 176d}3_n3_{flops to obtain the upper quasi-triangular and triangular}

pair, i.e., all eigenvalues of (K11, N11). Computing the corresponding eigenvectors

by using inverse power method requires 4 × (2dn) × (2dn)2 _{= 32d}3_n3 _{flops. The}

eigenvectors of ( eK2, eN2) can be computed by an additional (376d3+32d)n3−(332d2−

16d)n2_{+56dn flops. Therefore, computed all eigenvalues of ( e}_K

2, eN2) requires (408d3+

48d)n3_{− (1400d}2_{− 24d)n}2_{flops, and an additional (408d}3_{+ 32d)n}3_{− (332d}2_{− 16d)n}2

flops are required for the corresponding eigenvectors. For the QZ algorithm applied to a 2dn×2dn palindromic linearization λZ +Z∗_{, it requires approximately 120(2dn)}3₌

(24)

Eigenvalues Eigenvectors Total Algorithm 3.1 (408d3_{+ 48d)n}3 _(408d3_{+ 32d)n}3 _(816d3_{+ 80d)n}3

QZ algorithm 960d3_n3 _736d3_n3 _1696d3_n3

Table 6.1

Computational flops of Algorithm 3.1 and QZ algorithm on (Z?_{, −Z).}

960d3_n3 _{flops for the eigenvalues and an additional 92(2dn)}3_{= 736d}3_n3 _{flops for the}

eigenvectors.

We summarize the computational flops of Algorithm 3.1 and the QZ algorithm in Table 6.1

6.2. Numerical experiments. In Sections 2 and 3, we propose a structured quadraticization to transform a PPEP into a PQEP, to which we apply Algorithm 3.1 (SPA). Here, we use “SQ SPA” to denote the proposed method. We use “SL QZ” to denote the method asscoiated with the structured linearization (λZ?_{+ Z)y = 0,} proposed in [12] for the PPEP, to which the QZ algorithm is applied. In the following, we show the numerical comparisons of the backward errors and symplectic reciprocities of eigenpairs, computed by SQ SPA, “polyeig” in MATLAB (applied directly to (1.1)) and SL QZ.

As mentioned before, theoretically, the eigenvalues of H-PPEP appear in recipro-cal pairs (λ, 1/¯λ). So, if we sort the eigenvalues in ascending order by modulus, the

product of the i-th eigenvalue and the conjugate of the (2dn + 1 − i)-th eigenvalue should be one. Therefore, we define the reciprocities of computed eigenvalues by

ri≡ |λi¯λ2dn+1−i− 1|, i = 1, . . . , dn. (6.1)

All numerical experiments are carried out using MATLAB 2008a with the machine precision eps ≈ 2.22 × 10−16_.

Let Cn,m,rdenote the set of random complex n × m matrices with normal distri-bution which real and imaginary parts of the entries in [−r, r].

Example 6.1. Consider the discrete-time optimal control problem

min {uj} ∞ X j=0 · xj uj ¸H· Q S SH _R ¸ · xj uj ¸

with QH _{= Q, Mi} _{∈ C}n×n_{, S, B ∈ C}n×m _{and R}H _{= R ∈ C}m×m _{subject to the}

discrete-time control

2d

X i=0

Mixi+d= Bui

for given x0, x1, · · · , x2d−1. This system corresponds to a H-palindromic polyno-24

(25)

mial [1, 10] P (λ) = λ2d   _M0H M0 0 2d 0 0 0 SH ₀   + λ2d−1   _MH0 M1 0 2d−1 0 0 0 0 0   + · · · +λd+2   _M0H Md−2 0 d+2 0 0 0 0 0   + λd+1   _M0H Md−1 0 d+1 0 0 −BH ₀ ₀   +λd   _M0H Md 0 d Q 0 0 0 R   + λd−1   M0H Md+1 −B d−1 0 0 0 0 0   +λd−2   _M0H Md+2 0 d−2 0 0 0 0 0   + · · · + λ   _M0H M2d−1 0 1 0 0 0 0 0   +   _M0H M2d 0 0 0 S 0 0 0   . −3 −2 −1 0 1 2 3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 2.5

real part of eigenvalue

imaginary part of eigenvalue

Fig. 6.1. The distribution of nonzero finite eigenvalues of Example 6.1.

Take n = 5, m = 1 , d = 3, and Q, Mi ∈ Cn,n,10, S, B ∈ Cn,m,10 and R ∈

Cm,m,10. The corresponding nonzero finite eigenvalues are shown in Figure 6.1. The H-palindromic PEP has infinite and zero eigenvalues which can be computed by the SQ SPA and polyeig, but not by the SL QZ. For the SL QZ, the eigenvalue nearest zero is |λ| = 4.4 × 10−6_{. The backward errors and symplectic reciprocities ri} _{of the} eigenpairs are shown in Figure 6.2 (a) and (b). We use the notations “×”, “4” and “o” to denote the results computed by the polyeig, SL QZ and SQ SPA, respectively, and some zero reciprocity riof the eigenpairs produced by the SQ SPA are not shown. From Figure 6.1, we see that the SQ SPA not only preserve the reciprocal property of the eigenvalues but also provides better accuracy than that by the polyeig and SL QZ. In the following, we construct three examples with higher degrees and larger matrix sizes than that of Example 6.1.

Example 6.2. Consider the H-palindromic PEP with d = 5 and Ai ∈ Cn,n,100

for i = 0, . . . , 5.

Example 6.3. Consider the H-palindromic PEP with d = 4, A1, A3, A4 ∈ 25