多維陣列上的Perron-Frobenius定理及計算

(1)

國立高雄大學應用數學學系

碩士論文

多維陣列上的

Perron-Frobenius 定理及計算

Perron-Frobenius theorem and computation on

multidimensional arrays

研究生：林冠宇撰

指導教授：郭岳承

(2)

(3)

致謝

首先，我要感謝指導教授郭岳承老師，在我的碩士期間教導

我許多矩陣理論及計算的相關知識，也在我遇到困難時給予適當

的建議及鼓勵我參與許多校外活動。感謝吳宗芳老師大學及碩士

期間的照顧，能在大學時期上你的課以及擔任你兩年的大學入門

助教可以說是十分的幸運。感謝系辦雅鳳姐及佩君姊這些年的行

政上的各種協助。感謝我的家人一路上精神的支持讓我得以完成

碩士的學位。感謝學弟妹平常與我聊天及分享，陪我度過在高大

應數最後的時光。最後感謝系上的一切，讓我在這裡留下許多美

好的回憶。

林冠宇謹於

國立高雄大學應用數學系

(4)

Perron-Frobenius theorem and

computation on multidimensional

arrays

by

Guan-Yu Lin

Advisor

Yueh-Cheng Kuo

Department of Applied Mathematics,

National University of Kaohsiung

Kaohsiung, Taiwan 811, R.O.C.

June 2018

(5)

多維陣列上的

Perron-Frobenius 定理及計算

指導教授：郭岳承教授國立高雄大學應用數學系學生：林冠宇國立高雄大學應用數學系摘要 Perron-Frobenius 定理在非負不可約矩陣上展示了一些重要的結果。此定理目前已有許多的應用及推廣。在本論文中，我們著重在非負不可約矩陣上。在建構出一線性同倫後，我們分析解曲線並證明任意一個非負不可約矩陣有一正特徵對。這個是 Frobenius 定理主要的結果。此證明技巧可被推廣用來證明張量上的 Perron-Frobenius 定理。除此之外，我們也呈現了延拓法在非負不可約矩陣及張量上的數值結果。關鍵字： Perron-Frobenius 定理、不可約矩陣及張量、延拓法、隱函數定理

(7)

Perron-Frobenius theorem and computation on

multidimensional arrays

Advisor: Dr. Yueh-Cheng Kuo Department of Applied Mathematics

National University of Kaohsiung

Student: Guan-Yu Lin

Department of Applied Mathematics National University of Kaohsiung

ABSTRACT

The Perron-Frobenius theorem shows some important results on nonnegative irreducible matrices. This theorem has various applications and extensions. In this paper, we focus on the nonnegative irreducible matrices. After constructing a linear homotopy, we analyze the solution curve of the linear homotopy and prove that any nonnegative irreducible matrix has the positive eigenpair. This is the main result of the Perron-Frobenius theorem. The skill of the proof can be extended to prove the Perron-Frobenius theorem on tensors. Furthermore, the numerical results computed by the homotopy continuation method on nonnegative irreducible matrices and tensors are presented.

Keywords: Perron-Frobenius theorem, Nonnegative irreducible matrices and tensors, Continuation method, Implicit function theorem

(8)

1 Introduction

Perron-Frobenius theorem is a fundamental and useful theorem for nonnegative irreducible matrices. It guarantees that the spectral radius is a simple eigenvalue corresponding to a positive eigenvector for any nonnegative irreducible matrix. The theorem has various applications, e.g., Markov chains, graph theories, and population models. The result of the theorem is elegent but the proof of the theorem is not familiar to the beginner of learning linear algebra. The origin proof has lots of matrix limits and it can be found in [10]. So, how to give a simpler proof is the first problem that what we concern and we will re-prove it with a new way.

The multidimensional arrays are often called the tensors. In fact, there are many properties of matrices that have been extended on tensors, including the eigenvalue problem. There are many researches about the spectral theories and computation of the eigenvalues on tensors, including the extension of the Perron-Frobenius theorem. W. Ding and Y. Wei [15] organized the knowledges of tensors, including the notations and applications. Chang, Pearson, and Zhang used the Brower’s fixed point theorem to extend the Perron-Frobenius theorem on tensors [5]. They even surveyed the spectral theories and applications of tensors in [4]. Kuo, Lin, and Liu [14] used the continuation method to compute the positive eigenpair for nonnegative weakly irreducible tensors. Kuo’s research almost proved the Perron-Frobenius theorem on tensors, except the spectral radius. So, we will continue the work of [14] and use the skill of the proof in [14] to prove the Perron-Frobenius theorem on tensors.

The main contributions are that re-proving the Perron-Frobenius theorem by a new way and extending the skill of the proof on tensors. This paper is organized as follow. In section 2 is about the preliminaries, including the irreducible matrices, the Gershgorin’s disk theorem and the implicit function theorem. In section 3, we analyze the nonnegative irreducible matrices first and re-prove the Perron-Frobenius theorem by obtaining the positive eigenpair. In section 4, the homotopy continuation method is presented to compute the positive eigenpair for nonnegative irreducible matrices. Given an initial condition at t = 0 and construct a linear

homotopy F1(x, λ, t) = 0. We compute the predict vector of possible solution and

use the Newton’s method to correct the solution. Furthermore, we compare the numerical results with other algorithms. Section 5 is about the preliminaries for tensors. In section 6, we analyze the nonnegative irreducible tensors first and prove the Perron-Frobenius theorem on tensors by obtaining the positive eigenpair of any nonnegative irreducible tensor. Section 7 is the numerical results on tensors. We

construct a linear homotopy F2(x, λ, t) = 0 for tensor and use the same homotopy

continuation method in section 4 to compute the positive eigenpair on tensors. Final section is about the conclusion.

(9)

2 Preliminaries for matries

In this section, some useful theorems will be introduced. For the consistency of symbols, we use the capital letters to denote matrices and lowercase letters to denote the scalars, respectively. Bold letters are denoted as the vectors. Let Rn×n>0 (R

n×n

≥0 ) is the set of all positive(nonnegative) matrices and ρ(.) is the spectral

radius of a matrix.

2.1 Irreducible matries

Definition 2.1. A ∈ Cn×n _{is reducible if there is a permutation matrix P ∈ C}n×n

such that

PTAP = B C

O D

, (2.1)

where B ∈ Cr×r, C ∈ Cr×(n−r), and D ∈ C(n−r)×(n−r). If A is not reducible, then

A is irreducible.

On matrices, the reducibility is an important property. We can find that if a matrix A is reducible then the eigenvalues of A are the collection of the eigenvalues of B and D and det(A) = det(B)det(D). Furthermore, the less computations will be required if a matrix of linear system or eigenvalue problem is reducible.

2.2 Gershgorin’s disk theorem

The Gershgorin’s disk theorem is a fundamental theorem for the upper-bound of eigenvalues of matrices. The statement of the theorem is as follows.

Definition 2.2. Let A ∈ Cn×n and ρi(A) =Pn_j=1|Ai,j| for i = 1, 2, ...n. Then the

ith Gerschgorin’s disk Ci is defined

Ci = {z ∈ C : |z − Aii| ≤ ρi(A) − |Aii|}. (2.2)

Theorem 2.1. (Gershgorin’s Disk Theorem) Let A ∈ Cn×n_{. Then every}

eigen-value of A is contained in a Gershgorin disk. We give an example to explain this theorem.

Example 1. Let A = 2 + 2i 2

1 −3i

, then the eigenvalues λ1, λ2 are λ1 =

2.1648 + 1.6463i, λ2 = −0.1648 − 2.6463i. We can see that there are two

(10)

-1 0 1 2 3 4 real axis -4 -3 -2 -1 0 1 2 3 4 imaginary axis 2.1648 + 1.6463i -0.1648 - 2.6463i

Figure 1: Eigenvalues of A in each Gershgorin Disk

2.3 Implicit function theorem

In this subsection, we will introduce the implicit function theorem. The proofs can be found in [3]. Following definitions are about the critical value and regular value.

Definition 2.3. Let F : Rm _{→ R}n _{be continuously differentiable and let}

C = {x ∈ Rm | rank(DxF (x)) < min(m, n)},

then

(a). Let x ∈ C and x is called the singular point of G.

(b). Let x ∈ Rm_{\C and x is called the regular point of G.}

(c). F(C) is the set of critical values of F.

(d). Rn_{\F (C) is the set of regular values of F.}

We give a simple example to explain.

Example 2. Let F (x, t) = (x − 2)(2x2 _{− t), where x, t ∈ R. Then}

DF (x, t) = [6x2− 8x − t, 2 − x],

(11)

1. (2,8) is the critical point.

2. R2\{(2, 8)} is the set of regular point.

3. 0 is the critical value.

4. R\{0} is the set of regular value.

Next, we introduce the implicit function theorem

Theorem 2.2. (Implicit function theorem) Let F : Rn_{× R → R}n _{and it satisfies}

(a). F (u0, t0) = 0, where u0 ∈ Rn, t ∈ R.

(b). DuF (u0, t0) is invertible.

(c). F and DuF (u, t) are smooth near u0.

Then there is a smooth function u(t),u : Bρ1(t0) → Bρ2(u0) such that

1. F (u(t), t) = 0, for all t ∈ Bρ1(t0) and u(t0) = u0,

2. The solution of F (u, t) = 0 in Bρ2(u0) is only u(t).

We see the example 2 and obtain that there is no smooth function near (2, 8)(See the figure 2). Theorem 2.2 says that if a function F satisfies the conditions in The-orem 2.2 and there will be a unique solution curve that can be parametrized by t.

-5 -4 -3 -2 -1 0 1 2 3 4 5 x -20 0 20 40 60 80 100 t (2,8)

(12)

3 Perron-Frobenius theorem

3.1 Analysis of nonnegative irreducible matrices

First, we introduce the Perron-Frobenius theorem.

Theorem 3.1. (Perron-Frobenius Theorem) A matrix A ∈ Rn×n≥0 is irreducible,

then the following statements are hold (a). ρ(A) > 0 is a simple eigenvalue.

(b). There is a positive eigenvector x ∈ Rn

>0 such that Ax = ρ(A)x .

(c). If λ is an eigenvalue with a nonnegative eigenvector, then λ = ρ(A).

Our goal is to prove the Theorem 3.1. The most noticeable feature of any nonnegative irreducible matrices is the positive eigenpair. In fact, it is sufficient to prove the Perron-Frobenius theorem by obtaining a positive eigenpair for any nonnegative irreducible matrix. First, there is no nonnegative eigenpair (λ, x) ∈

∂(Rn+1≥0 ) for any nonnegative irreducible matrix, where ∂(Rn+1≥0 ) is the boundary of

Rn+1≥0 .

Lemma 3.1. Let A ∈ Rn×n≥0 be irreducible. Suppose x ∈ Rn≥0 is an eigenvector of

A coeersponding to λ, then x ∈ Rn

>0 and λ > 0.

Proof. Suppose x ≥ 0, x is nonzero and there is a nonempty set S ⊂ {1, 2, ..., n}

such that xi = 0, where i ∈ S. Then there is a permutation matrix P ∈ Rn×n such

that PT_{x =} x∗

0

∈ Rn

≥0, where x∗ ∈ Rk>0, 0 < k < n. And we obtain

λPTx = PTλx = PTAx = PTAP PTx. (λ, PT_{x) is an eigenpair of (P}T_{AP ). So,} PTAP = B C D E x∗ 0 = λ x∗ 0 , where B ∈ Rk×k≥0 , C ∈ R k×(n−k) ≥0 , D ∈ R (n−k)×(n−k) ≥0 , and E ∈ R (n−k)×(n−k) ≥0 . Hence,

Dx∗ = 0. Since x∗ > 0, we get D is a zero matrix and A is a reducible matrix,

a contradiction. Since A is nonnegative irreducible matrix and x ∈ Rn

>0, λx =

Ax > 0. So, λ > 0.

By Lemma 3.1, if there is an eigenvector x ∈ Rn≥0 of an irreducible matrix

A ∈ Rn×n≥0 , then x must be positive. And the following lemma can be obtained if

(13)

Lemma 3.2. Let A ∈ Rn×n≥0 be irreducible. Suppose x ∈ Rn>0 is an eigenvector of

A corresponding to λ ∈ R>0, then

(a). nullity(A − λI) = 1.

(b). The left eigenvector is also positive. (c). λ is a simple eigenvalue.

(d). λ = ρ(A).

(e). The matrix M = A − λI −x

2xT 0

∈ R(n+1)×(n+1) _{is invertible.}

Proof. (a). Suppose (λ, y) ∈ Rn+1 is also an eigenpair of A. Then there exists

β ∈ R such that

v = x − βy ≥ 0.

If y ∈ Rn

>0, β = min1≤i≤n(xi/yi). Otherwise, β = max1≤i≤n(xi/yi) < 0. Vector

v is a nonnegative vector with at least an element is 0. If v 6= 0, then (λ, v) is an eigenpair of A. But it is a contradiction to Lemma 3.1. So, v = 0 and nullity(A − λI) = 1.

(b). Let a nonzero y ∈ Rn _{be a left eigenvector of A such that y}T_{A = λy}T_{. So,}

λ | yT |=| λyT |=| yTA |≤| yT || A |=| yT | A,

and let zT _{=| y}T _{| (A − λI) ≥ 0. Since (λ, x) is an eigenpair of A. If z 6= 0,}

then zT_{x =| y}T _{| (A − λI)x = 0. But it is a contradiction, because z ≥ 0, x > 0}

, zTx 6= 0. So, z = 0, and (λ, | y |) is the left eigenpair of A. We know the left

eigenvector of A is the eigenvector of AT. By Lemma 3.1, the elements of | y | are

all positive .

(c). Suppose λ is not simple, then there is a x1 ∈ Rn such that

(A − λI)x1 = x ∈ Rn>0.

There is also a left positive eigenpair (λ, y) ∈ Rn+1>0 of A by Lemma 3.2(b). Then

yT_{(A − λI)x}

1 = 0 = yTx, a contradiction. So, λ is simple.

(d). λ ≤ ρ(A) is clearly. Suppose u is an eigenvalue of A and |u| = ρ(A). Then

there is a Gershgorin’s disk Ci such that ρ(A) ∈ Ci with some i ∈ {1, 2, ...n}. And

|u| ≤ |u − Aii| + |Aii| ≤Pn_j=1Aij. Then

ρ(A) = |u| ≤ max

1≤i≤n n

X

j=1

(14)

Since A is similar to Q−1AQ, where Q is the diagonal matrix with Qii = vi and

vi > 0 for i = 1, 2, ...n. It implies

ρ(A) = |u| ≤ max

1≤i≤n 1 vi n X j=1 Aijvj (3.2)

for any positive vector v ∈ Rn_{. Take (λ, x) in the inequality (3.2), and we get}

ρ(A) ≤ λ. So, λ = ρ(A).

(e). Suppose M is singular, then there exist a nonzero vector zT = zT₁, z2

T ∈ Rn+1. such that M z = A − λI −x 2xT ₀ z1 z2 = 0.

By (b), there is a left eigenvector y ∈ Rn>0 such that

yT(A − λI)z1 = z2yTx = 0.

It implies

z2 = 0, z1 = αx,

where α ∈ R and α 6= 0. And then z1x = 2αxTx 6= 0, a contradiction.

Let A ∈ Rn×n≥0 be irreducible. By Lemma 3.2, if we can obtain a positive

eigenpair (λ, x) ∈ R>0× Rn>0, then λ = ρ(A) is a simple eigenvalue. And x is the

unique positive eigenvector. By Lemma 3.2(a)-(d), the Perron-Frobenius theorem can be proved by obtaining a positive eigenvector for any nonnegative irreducible matrix. Lemma 3.2(e) is an useful result for later analysis.

3.2 Proof of Theorem 3.1

Let A0 = x1xT2 be a rank-1 matrix, where x1, x2 ∈ Rn>0. And we let

λ0 = xT1x2, x0 =

x1

||x1||

. (3.3)

By Lemma 3.2, we know that (λ0, x0) is the unique positive eigenpair of A0. Now,

we give an irreducible matrix A ∈ Rn×n≥0 and let

A(t) = (1 − t)A0+ tA, t ∈ [0, 1]. (3.4)

Let F1 : Rn+1 × R → Rn+1 be continuous and differentiable. We consider the

following linear homotopy

F1(x, λ, t) =

A(t)x − λx

xTx − 1

(15)

From equation (3.5), (x0, λ0, 0) is the unique positive solution of F1(x, λ, t) = 0

and (λ0, x0) is the positive eigenpair of A0. Let (x∗, λ∗, 1) be the solution of

F1(x, λ, 1) = 0 as t = 1 and it is clearly to see that (λ∗, x∗) is the eigenpair of A.

By Lemma 3.2, if (λ∗, x∗) is the positive eigenpair of A, then the proof of Theorem

3.1 will be finished. We differentiate F1(x, λ, t) and get the Jacobian matrix of

F1(x, λ, t), DF1(x, λ, t) = [Dx,λF1(x, λ, t)|DtF1(x, λ, t)] ∈ R(n+1)×(n+2), where Dx,λF1(x, λ, t) = A(t) − λI −x 2xT 0 ∈ R(n+1)×(n+1), (3.6a) DtF1(x, λ, t) = (A − A0)x 0 ∈ Rn+1. (3.6b)

So, the following lemma can be obtained.

Lemma 3.3. If (x∗, λ∗, t∗) ∈ Rn+1≥0 × [0, 1) is an solution of F1(x, λ, t) = 0, then

the Jacobian matrix Dx,λF1(x∗, λ∗, t∗) is invertible.

Proof. For t∗ ∈ [0, 1), A(t∗) ∈ Rn×n_>0 is irreducible. (λ∗, x∗) ∈ Rn+1_>0 is an eigenpair

of A(t∗) by Lemma 3.1. By Lemma 3.2(e), Dx,λF1(x∗, λ∗, t∗) is invertible.

Since A0 is a positive rank-1 matrix and (λ0, x0) that defined in (3.3) is the

eigenpair of A. By Lemma 3.2(d) and the implicit function theorem, there is a

solution curve s1(t) = (x(t), λ(t), t) ∈ Rn+1≥0 × [0, 1) of F1(x, λ, t) = 0 that is

parametrized by t, where s1(0) = (x0, λ0, 0). For t ∈ [0, 1), we can get s(t) ∈

Rn+1>0 × [0, 1) and (λ(t), x(t)) /∈ ∂(R n+1

≥0 ) by Lemma 3.1. By Lemma 3.2(d), λ(t) =

ρ(A(t)) and we get ||(λ(t), x(t))|| is bounded. We can obtain the nonnegative

solution set of F1(x, λ, t) = 0,

C1 = {(x(t), λ(t), t)|F1(x(t), λ(t), t) = 0} ⊆ Rn+1≥0 × [0, 1), (3.7)

where (x0, λ0, 0) ∈ C1. So, the following theorem can be obtained.

Theorem 3.2. Let A ∈ Rn×n≥0 be irreducible. Then the nonnegative solution of

F1(x, λ, 1) = 0 is isolated.

Proof. Let (x∗, λ∗, 1) be the accumulation point of C1. (λ∗, x∗) is the eigenpair of

A. Since A is irreducible. So, (λ∗, x) ∈ Rn+1_>0 . λ∗ is the root of the characteristic

polynomial of A and λ∗ is isolated. By Lemma 3.2(c), we get the nonnegative

solution of F1(x, λ, 1) = 0 is isolated.

Theorem 3.2 shows that any irreducible matrix A ∈ Rn×n≥0 has the positive

eigenpair (λ∗, x∗) of A by tracing the solution curve s1(t) of F1(x, λ, t) = 0 in (3.5)

with initial (λ0, x0) in (3.3). By Lemma 3.2(a)-(d), the Theorem 3.1 has been

(16)

4 Numerical experiments

In this section, the homotopy continuation method will be introduced for

comput-ing the positive solution of F1(x, λ, 1) = 0 which is defined in (3.5) first. The other

numerical methods will be introduced and the experiments results will be shown.

4.1 Homotopy Continuation method

In section 3, the Perron-Frobenius theorem has been proved and we know that the positive eigenpair of nonnegative irreducible matrices can be obtained by tracing the solution curve of the linear homotopy in (3.5). So, we use the homotopy continuation method to experiment. The homotopy continuation method includes

the predictor process and the correction process for each iteration. Let x0, λ0 in

(3.3) be the initial condition of F1(x, λ, t) = 0 in (3.5) as t = 0. Let (xTi , λi, ti)T

be the i-th step which satisfies F1(x, λ, t) = 0 for i = 1, 2, ... . Following is about

the prediction process and the correct process.

Predict process

We use the tangent vector as the predict vector. We differentiate the equation

F1(x, λ, t) = 0 at (xi, λi, ti) and get Dx,λF1(xi, λi, ti) ˙ xi ˙ λi + DtF1(xi, λi, ti) ˙ti = 0.

Let ˙ti = ∆ti, where ∆ti is the step length and ti+1= ti+ ∆ti ≤ 1. And we get

˙ xi ˙λi = −Dx,λF1(xi, λi, ti)−1DtF1(xi, λi, ti). So, h x(1)T_i+1, λ(1)_i+1 iT

is the predict vector, where " x(1)_i+1 λ(1)_i+1 # = xi λi + ∆ti ˙ xi ˙ λi .

(17)

Correct process

For computing the approximate solution of F1(xi+1, λi+1, ti+1) = 0, we use the

Newton method with the initial x(1)_i+1, λ(1)_i+1 and ti+1 is fixed. Each iteration of

Newton method satisfies " x(l+1)_i+1 λ(l+1)_i+1 # = " x(l)_i+1 λ(l)_i+1 # + δl, where δl satisfies Dx,λF1(x (l) i+1, λ (l) i+1, ti+1)δl = −F1(x (l) i+1, λ (l)

i+1, ti+1) for l = 2, 3, . . ..

We use the Newton method until that the (x(l+1)_i+1 , λ(l+1)_i+1 , ti+1) converges to (xi+1,

λi+1, ti+1) and satisfies F1(xi+1, λi+1, ti+1) = 0 approximately. The convergent

behavior of ||F1(x

(l+1) i+1 , λ

(l+1)

i+1 , ti+1)|| will be quadratic. After we get (xi+1, λi+1,

ti+1), we continue the predict-correct process until t = 1.

Adjustment of the step length

An iteration is composed by a predict and a correct process. ”∆t” is the step length for the predict process and ”times” is the numbers of the successful iterations. After the predict process, we execute the correct process. If the correct process

is not convergent to the possible solution of F1(x, λ, t) = 0, then ∆t will be cut

half. If ∆t < 10−16, the homotopy continuation method will be stop. If we get

a successful iteration, times will be added 1. While the iteration successes twice, the step length will be enlarged to accelerate the iteration. Then we set times = 0 and continue the iteration. Let the initial ”acc”= 1 and following is the algorithm of the homotopy continuation method for computing the nonnegative eigenpair of a nonnegative irreducible matrix.

(18)

Algorithm 1 Homotopy continuation method

Step 1. Set initial condition A0, x0, λ0, t0 = 0, times = 0

Set count = the number of possible solution

Step 2. Compute the tangent vector ( ˙xi, ˙λi)

Compute the predict vector (x(1)_i+1, λ(1)_i+1) = (xi, λi) + ∆t( ˙xi, ˙λi)

ti+1= ti+ ∆t Step 3. Compute δ = −Dx,λF1(x (1) i+1, λ (1) i+1, ti+1)−1F1(x (1) i+1, λ (1) i+1, ti+1)

(x(2)_i+1, λ(2)_i+1) = (x(1)_i+1, λ(1)_i+1) + δ

If (||F1(x (1) i+1, λ (1) i+1, ti+1)||/||F1(x (2) i+1, λ (2)

i+1, ti+1)|| <10 and

||F1(x

(2) i+1, λ

(2)

i+1, ti+1)|| > 10−8acc and ||δ|| > 10−7)

∆t = ∆t/2, back to (xi, λi, ti), times=0

if (∆t < 10−16)

stop the algorithm end go to step 2 else go to step 4 end Step 4. While(||F1(x (l) i+1, λ (l)

i+1, ti+1)|| > 10−8acc)

Compute δ = −Dx,λF1(x (l) i+1, λ (l) i+1) −1_F 1(x (l) i+1, λ (l) i+1)

(x(l+1)_i+1 , λ(l+1)_i+1 , ti+1) = (x(l)i+1, λ (l) i+1, ti+1) + δ If(||F1(x (l) i+1, λ (l) i+1, ti+1)||/||F1(x (l+1) i+1 , λ (l+1) i+1 , ti+1)|| < 5) If ||δ|| < 10−7

break the while loop and go to step 5 else

∆t = ∆t/2 ,back to (xi, λi, ti),times=0

if (∆t < 10−16)

stop the algorithm end go to step 2 end end end Step 5. If (times≥ 2) times=0, ∆t = 2∆t else times = times+1 end

(xi+1, λi+1, ti+1) = (x

(l) i+1, λ

(l) i+1, ti+1)

acc = p||Dx,λF1(xi+1, λi+1, ti+1)||||(xi+1, λi+1)||

If (acc < 1) acc=1 end If t = 1

stop the algorithm

(19)

4.2 Other numerical method

In this subsection, other algorithms [8] that compute the positive eigenpair of nonnegative irreducible matrices will be introduced.

Power method

Power method is a simple iteration method to compute the eigenpair of matrices.

It is developed by the matrix limits. Let A ∈ Cn×n _{and (λ}

i, xi) ∈ C × Cn is the

eigenpair of A, where i = 1, 2, . . . , n and |λ1| > |λ2| ≥ |λ3| ≥ . . . ≥ |λn|. We can

use the following algorithm to compute (λ1, x1). And the convergent rate is base

on |λ2/λ1|.

Algorithm 2 Power method

1. Set q(0) _{∈ C}n×n 2. For i=1,2,. . . z(k)= Aq(k−1) q(k)_{= z}(k)_/||z(k)_|| λ(k) _{= [q}(k)_]H_Aq(k) end

Noda’s algorithm

The main concept of Noda’s algorithm [8] is the inverse iteration. For a

nonnega-tive irreducible matrix A ∈ Rn×n, we let λ(0) be an approximation eigenvalue of A

with λ(0) _{> ρ(A) and x}(0) _{∈ R}n

>0be a unit random vector. Let (λ(i), x(i)) be the

ap-proximation eigenpair of A. For each iteration, Noda’s algorithm keeps λ(i)_{> ρ(A)}

and λ(i) is decreasing. Otherwise, (λ(i)I − A)−1x keeps positive. Finally, (λ(i), x(i))

(20)

Algorithm 3 Noda’s algorithm

1. Set initial eigenpair (λ(0)_{, x}0_{) ∈ R × R}n_{, where λ}(0) _{> ρ(A). And let ¯}_λ(0) _{= λ}(0)_.

2. For k = 0,1,2,. . . x∗(n+1) = (¯λ(n)I − A)−1x(n) τ(n+1) _{=< x}∗(n+1)_{, v > / < x}(n)_{, v >} x(n+1) _{= x}∗(n+1)_/τ(n+1) ¯ τ = maxk((x∗(n+1))k/x (n) k ), τ = mink((x∗(n+1))k/x (n) k ) ¯ λ(n+1) _{= ¯}_λ(n)_{− 1/¯}_τ(n+1)_{, λ}(n+1) _{= ¯}_λ(n)_{− 1/τ}(n+1) λ(n+1) _{= ¯}_λ(n)_{− 1/τ}(n+1)

if ¯λ(n+1)− λ(n+1) < stop the algorithm end

Prakash’s algorithm

Prakash Chanchana presented the algorithm in 2007 [8]. The main concept of the algorithm are Collatz’s formula and the inverse iteration. The most interesting is that Prakash’s algorithm used the LU-factorization to solve the linear system twice.

Algorithm 4 Prakash’s algorithm

1. Let x(0)_{∈ R}n

>0, λ(0) = maxi(

Pn

j=1aij) and B(i) = (λ(0)I − A)−1

2. For i = 1,2,. . .

(λ(i)_{I − A) = L}(i)_U(i)

L(i)_U(i)_x_ˆ(i) _{= x}(i)

L(i)U(i)x(n+1) = ˆx(i) ¯

λ(i) = λ(i)− minj(

ˆ x(i)_j B(i)_ˆ_x(i)

j

), λ(i) = λ(i)− maxj(

ˆ x(i)_j B(i)_ˆ_x(i) j ) λ(n+1) _{= ¯}_λ(n)

If ¯λ(i)− λ(i) < stop the algorithm

end

4.3 Compare the numerical results

In this subsection, we will present the numerical results of homotopy continuation method, including the behavior of the residual and the numbers of the positive

elements of x of F1(x, λ, t) = 0 for t ∈ [0, 1]. Each iteration includes a

predic-tion process and a correct process. The step length is ∆t = 0.1. Each iterapredic-tion

is required to stop by the stop criteria with tolerance = 10−10. Otherwise, we

also compare the numerical results with other algorithms in section 4.2. For con-venience, ”Continuation” is the homotopy continuation method, ”Power” is the

(21)

power method, ”Noda” is the Noda’s algorithm, and ”PC” is the Prakash’s al-gorithm. ”Count” is the number of solving linear system. Following numerical results are experimented by Matlab 2015b on an Intel(R) Core(TM) i5-8250 CPU @1.80 GHz with 4GB RAM and NVIDIA GeForce MX150.

We will construct the table to compare the different algorithms. The first row is the number of iterations. If the number of iterations is bigger than 2000, than we will stop the algorithm. The second row is the number of computing linear system.

The third row is the residual of F1(x, λ, 1). The fourth row is the subtraction with

the solution of each algorithm and the solution that be computed by the command ”eig” in Matlab.

Example 3. Let A, A0 ∈ Rn×n≥0 be random matrix, where A0 = x1xT2 with x1, x2 ∈

Rn>0. A is the case dwt 992 in [2] and A is a sparse symmetric matrix with

n = 992 and 16744 nonnzero elements. In Figure 3(a), it shows that the behavior

of the residual of F1(x, λ, t) = 0 for t ∈ [0, 1] and the homotopy continuation

method can compute the positive eigenvector correctly. In Figure 3(b), it shows that the numbers of the positive elements of the eigenvector for each iteration.

Figure 3: Compute for case in dwt 992

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 t 0 0.5 1 1.5 2 2.5 3 3.5 Residual of F1 ×10-14 (a) Residual of F1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 t 991 993

numbers of positive elements

(b) Numbers of positive elements of eigenvec-tor.

We can see that the homotopy continuation method can get the positive

eigen-pair successfully with the tolerance 10−10 but it need more computations in solving

linear systems. The power method presents the poor behavior of convergence

be-cause the rate of |λ2/λ1| = 0.9904.

(22)

Table 1: Compare for case in dwt 992

Continuation Power Noda PC

Iteration 5 816 8 3

Count 22 0 8 3

Residual 3.2041e−15 9.8707e−11 4.3471e−11 6.6069e−13

Difference 1.7091e−14 2.1754e−10 4.3478e−11 1.1316e−12

a p-cycle block matrix if it satisfies

A =        0 A1 0 . . . 0 0 0 A2 . . . 0 .. . ... ... . .. . . . 0 0 0 . . . Ap−1 Ap 0 0 . . . 0        , (4.1)

where Ai ∈ Cn×n for i = 1 . . . p. Let λ be the eigenvalue of A and

βk = ei2kπ/p, k = 0, 1, . . . p. (4.2)

In [6], it says that βkλ is also the eigenvalue of A. So the eigenvalue of A is

peri-odic with period 2π/p. It is clearly to see that the power method can not compute

the eigenpair of A. Now we let Ai ∈ Rn×n>0 for i = 1, 2 . . . , p and A be constructed in

(4.1). A is irreducible, there is a positive eigenpair of A by the Perron-Frobenius theorem. We let n = 50, p = 20 and get the numerical result like in example 3. Excluding the power method, each iteration can compute the positive eigenpair suc-cessfully and steadily.

Table 2: Compare for block-cyclic matrix

Continuation Power Noda PC

Iteration 5 > 2000 7 4

Count 17 0 7 4

Residual 2.0944e−15 0.3298 3.8447e−15 3.2600e−15

Difference 2.1936e−14 0.0647 2.2235e−14 2.2136e−14

Remark 4.1. The eigenvalue of A in (4.1) is the eigenvalue of Qp

i=1Ai with a

(23)

Figure 4: Compute for block-cyclic matrix 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 t 0 0.5 1 1.5 2 2.5 3 3.5 4 Residual of F1 ×10-13 (a) Residual of F1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 t 999 1001

numbers of positive elements

(b) Numbers of positive elements of eigen-vector.

So, the homotopy continuation method can compute the nonnegative eigen-pair of nonnegative irreducible matrices and the solution of each iteration keeps positive.

5 Preliminaries for tensors

Here we use calligraphic letters, like A, to denote tensors. A mth-order tensor

A ∈ Cn1×n2×...×nm _{is a multidimensional array. In fact, a vector and a matrix are}

the first-order tensor and second-order tensor, respectively. If m ≥ 3, A is called

the higher-order tensor. If n1 = n2 = ... = nm = n, we called A is a mth-order,

n-dimensional tensor and denote the set of all mth-order, n-dimensional tensor

over real(complex) field R(C) by R[m,n]_(C[m,n]_{). We let A ∈ C}n1×n2×...×nm _{be a}

tensor and the element (i1, i2, ..., im) of A is denoted by ai1i2...im.

5.1 Tensor notations and operations

Both matrices and tensors can operate with vectors. However, the computation of tensors is more complex than matrix. So, we usually transform a tensor into a matrix and the following definition is very useful for us [13].

Definition 5.1. Let A ∈ Cn1×n2×...×nm _{be a tensor. The mode-k fiber is that fixing}

every index of ai1i2...im except the k-th element. The mode-k unfolding is denoted

by A(k) ∈ Cnk×(

Qm

i6=kni) _{and arranges the k-fibers to be the columns of the matrix}

(24)

Example 5. We use the example in [13]. Let a tensor A ∈ R3×4×2 _{exist, where} A::1 =   1 4 7 10 2 5 8 11 3 6 9 12  , A::2 =   13 16 19 22 14 17 20 23 15 18 21 24  .

Following are the mode-k unfoldings for k = 1, 2, 3.

A(1)=   1 4 7 10 13 16 19 22 2 5 8 11 14 17 20 23 3 6 9 12 15 18 21 24   A(2) =     1 2 3 13 14 15 4 5 6 16 17 18 7 8 9 19 20 21 10 11 12 22 23 24     A(3) = 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

Next, we introduce the Kronecker product. Kronecker is an useful tool to compute on tensors [13].

Definition 5.2. Let A ∈ Cm×n, B ∈ Cs×t, then the Kronecker product of A and

B is A ⊗ B =      a11B a12B . . . a1nB a21B a22B . . . a2nB .. . ... . .. ... am1B am2B . . . amnB      ∈ Cms×nt (5.1)

Next, it is about the k-mode product with vector of tensors.

Definition 5.3. For a tensor A ∈ Cn1×n2×...×nm_{, the k-mode product of A with a}

vector x ∈ Cnk _{is denoted by A ×} kx ∈ Cn1×n2×...×nk−1×nk+1...×nm, and we have (A ×kx)i1...ik−1ik+1...im = nk X ik=1 ai1i2...imxik. (5.2)

Example 6. We use the tensor A in example 5. Let x = [1, 2, 3, 4]T. Then

A ×2x =   70 190 80 200 90 210  = A(1)(I2⊗ x),

(25)

5.2 Eigenvalue of tensors

Let A ∈ C[m,n] _{and x ∈ C}n_{. We denote Ax}m−1 _{= A ×}

2x ×3x . . . ×mx. Following

definition is about the eigenvector and eigenvalue on tensors [12][15].

Definition 5.4. Let A ∈ C[m,n]_{. If the pair (λ, x) ∈ C × C}n_{\ {0} satisfies}

Axm−1 _{= λx}[m−1]_, _(5.3)

then λ is called the eigenvalue of A corresponding to eigenvector x. Where x =

[x1, x2, ...xn]T and x[m−1] = [x1m−1, xm−12 , ..., xm−1n ]T. If (λ, x) ∈ R × Rn, then we

call (λ, x) is H-eigenpair.

Next definition is about the spectral radius of tensors.

Definition 5.5. Let A ∈ C[m,n]. We let

%(A) = max{ |λ| |λ is an eigenvalue of A.} (5.4)

and %(A) is called the spectral radius of A.

Solving an eigenvalue problem on tensors is a homogeneous problem. In [12], it defined the characteristic polynomial of a tensor A as det(A − λI) by using the resultant, where each element of I is

ii1i2...im =

1, if i₁ = i2 = ...im

0, others. In [12], it also showed the following theorem.

Theorem 5.1. Let A ∈ C[m,n] be a tensor. And the det(A − λI) can be written

as

det(A − λI) = Y

λi∈σ(A)

(λ − λi)mi, (5.5)

where σ(A) = {λ|λ is the eigenvalue of A.} and mi is the algebraic multiplicities

of λi.

In addition to algebraic multiplicity, the geometric multiplicity on tensors is also defined. Next definition is about the geometric multiplicity of tensors [5].

Definition 5.6. Let A ∈ C[m,n] and λ be the eigenvalue of A. Then the

geomet-ric multiplicity of λ is the maximum number of linear independent eigenvector corresponding to λ.

(26)

Example 7. Let A ∈ R[3,2]≥0 , where a111 = a222 = 1, a122 = a211 = for 0 < < 1

and other aijk = 0. The eigenvalue of A will satisfy

x2₁+ x2₂ = λx2₁ x2₁+ x2₂ = λx2₂

We obtain det(A − λI) = (λ − 1 + )2_{(λ − 1 − )}2_{. And we get the following results:}

(a). λ = 1 + with eigenvector x1 = [1, 1]T, x2 = [1, −1]T. The geometric

multi-plicity of λ is 2. And x1, x2 are the H-eigenvectors corresponding to λ.

(b). λ = 1 − with eigenvector x1 = [1, i]T, x2 = [1, −i]T. The geometric

multi-plicity of λ is 2. But x1, x2 are complex eigenvectors.

5.3 Symmetric and semi-symmetric tensors

Symmetric tensors are very useful for computing the eigenvalue problems on ten-sors [15].

Definition 5.7. Let A ∈ C[m,n] _{be a tensor and A is symmetric if it satisfies}

ai1i2...im = aj1j2...jm, (5.6)

where (j1, j2, . . . , jm) ∈ Θ(i1, i2, . . . , im) and Θ(i1, i2, . . . , im) is the set of all the

permutation of (i1, i2, . . . , im).

Following definition is the semi-symmetric tensor. The difference between sym-metric tensor and semi-symsym-metric tensor is about the first index.

Definition 5.8. Let A ∈ C[m,n] be a tensor and A is semi-symmetric if it satisfies

ai1i2...im = ai1j2...jm, (5.7)

where (j2, j3, . . . , jm) ∈ Θ(i2, i3, . . . , im) and Θ(i2, . . . , im) is the set of all the

per-mutation of (i2, . . . , im)

Remark 5.1. Let A ∈ C[m,n] _{be tensor. If A is symmetric then A is}

semi-symmetric.

In [9] [1], let A ∈ C[m,n] _{and there is a semi-symmetric tensor A}

s∈ C[m,n] such that Axm−1 _{= A} sxm−1. It is efficient to compute Dx(Axm−1) = m X k=2 A ×2x... ×k−1x ×k+1... ×mx ∈ Rn×n. (5.8)

(27)

We can get Dx(Axm−1) = Dx(Asxm−1) and Dx(Asxm−1) = m X k=2 As×2x... ×k−1x ×k+1... ×mx (5.9) = (m − 1)As×3x ×4 x . . . ×mx ∈ Rn×n. (5.10)

The equation (5.8) is equivalent to equation (5.10).

5.4 Irreducible tensors

Following definition is about the irreducible tensors[15] [5].

Definition 5.9. A tensor A ∈ R[m,n] is called reducible if there is a nonempty

proper subindex I ⊂ {1, 2, ...n} such that

a1i2...im = 0, ∀i1 ∈ I, ∀i2, i3...im ∈ I./

If A is not reducible, then A is irreducible.

The tensor A in Example 7 is an irreducible tensor. If m = 2, the definition of irreducible tensors is equivalent to the definition of irreducible matrices.

5.5 Rank-1 tensors

Next definition is about the rank-1 tensors. Rank-1 tensors play an important role in tensor decomposition and rank computing [13] .

Definition 5.10. A tensor A ∈ Rn1×n2×...×nm _{is rank-1 if it can be written as the}

outer product of m nonzeros vectors xk ∈ Rnk for k = 1, 2, ...m, and it is denoted

by A = x1◦ x2... ◦ xm. Each element of A is

ai1,i2,...im = x1,i1x2,i2...xm,im, (5.11)

where xk,ik is the ik-th element of vector xk.

In [14], it recorded the theorem about the H-eigenvalues of rank-1 tensors.

Lemma 5.1. Let A0 = x1 ◦ x2◦ ... ◦ xm ∈ R [m,n] ≥0 , where x1, x2, ...xm ∈ Rn≥0. Let λ0 = Qm k=2x T kx [1/(m−1)] 1 and x0 = x[1/(m−1)]1

kx[1/(m−1)]₁ k. Then (λ0, x0) and (0, w) with w ∈

Sm

k=2span{xk}

⊥ _{are H-eigenpairs. In addition, if x}

1 > 0, then the eigenvalue

λ0 > 0 with positive eigenvector x0. Furthermore, if x1, x2, ...xm ∈ Rn>0, then

cx ∈ Rn

>0 with c > 0 is the unique nonnegative eigenvector of A0.

So, we can find an unique positive eigenpair of a positive rank-1 tensor. To construct the linear homotopy like section 3, it is useful for us to set the initial condition.

(28)

6 Perron-Frobenius theorem on tensors

In this section, we will prove the Perron-Frobenius theorem on tensors. Let A ∈ R[m,n]≥0 be an irreducible tensor. And we focus on finding the positive H-eigenpair

of A. Here we let R[m,n]>0 (R

[m,n]

≥0 ) be the set of all real positive(nonnegative)

mth-order, n-dimension tensors.

6.1 Analysis of nonnegative irreducible tensors

Following theorem is the Perron-Frobenius theorem on tensors [5].

Theorem 6.1. (Perron-Frobenius Theorem on tensors) Let A ∈ R[m,n] _be

irre-ducible, then

(a). There is a H-eigenvalue λ0 = %(A) > 0.

(b). There is a H-eigenvector x ∈ Rn

>0 corresponding to λ0.

(c). If λ is an eigenvalue with nonnegative eigenvector y, then λ = λ0. Moreover,

y = kx, where k is a constant.

Remark 6.1. Some different of Perron-Frobenius theorem on tensors is that λ0 in

Theorem 6.1 may not be simple. In Example 7, A is irreducible and we can find the λ = 1 + is the %(A) with the algebraic multiplicity 2 and two H-eigenvectors. No matter the Perron-Frobenius theorem on matrices or tensors, the positive eigen-vector is unique up to a constant.

In fact, we can prove Theorem 6.1 by obtaining the positive eigenpair of any

nonnegative irreducible tensors. First, there is no H-eigenpair in ∂(Rn+1≥0 ), where

∂(Rn+1≥0 ) is the boundary of Rn+1≥0 .

Lemma 6.1. Let A ∈ R[m,n]≥0 be irreducible. If x ∈ Rn≥0 is a H-eigenvector of A,

then x ∈ Rn>0. Furthermore, there is a λ > 0 and (λ, x) is the H-eigenpair of A.

Proof. Let x ∈ Rn

≥0 be a H-eigenvector with some subset I ⊂ {1, 2, ...n} such that

xi = 0 ∀i ∈ I and xj 6= 0 ∀j /∈ I. We have

n

X

i2,i3,...im=1

ai1i2...imxi2xi3...xim = 0 ∀i1 ∈ I.

Since xi 6= 0 ∀ i /∈ I and it implies ai1,i2,...im = 0 with ∀ i1 ∈ I. It implies

ai1,i2,...im = 0 with ∀i1 ∈ I, ∀ i2. . . im ∈ I. We obtain A is reducible, a contradic-/

tion. So, x ∈ Rn

>0and we can get that there is a H-eigenvalue λ > 0 corresponding

(29)

Next, we discuss the H-eigenvalue of A. Here we use a theorem in [5] and the proof can also be found in [5].

Theorem 6.2. Let A ∈ R[m,n]≥0 be irreducible. If (λ, x), (µ, y) ∈ Rn+1≥0 satisfies

Axm−1 _{= λx}[m−1] _{and Ay}m−1 _{≥ µy}[m−1]_(Aym−1 _{≤ µy}[m−1]_{), then λ ≥ µ(λ ≤ µ).}

So, the following theorem can be obtained.

Theorem 6.3. Let A ∈ R[m,n]≥0 be irreducible. If x ∈ Rn>0 is a H-eigenvector of A

corresponding to λ, then λ = %(A) > 0 is a H-eigenvalue of A.

Proof. Let (u, y) ∈ Rn+1 _{be an H-eigenpair of A, where |u| = %(A) and y is a}

nonzero vector. Then

%(A)|y|[m−1]= |uy[m−1]| = |Aym−1_{| ≤ A|y|}m−1_.

So, by Theorem 6.2, %(A) ≤ λ. Since λ ≤ %(A) is clearly to see, and we have done λ = %(A).

Now, we know that there is no H-eigenpair on ∂(Rn+1≥0 ) for nonnegative

ir-reducible tensors. If there is a H-eigenvector x ∈ Rn

>0 of an irreducible tensor

A ∈ R[m,n]≥0 , then λ = %(A) is the H-eigenvalue corresponding to x. The

unique-ness of positive H-eigenvector will be discussed in later subsection by the implicit function theorem.

6.2 Proof of Theorem 6.1

We use the skill in [14]. Let A0 = x1◦ x2◦ ... ◦ xm ∈ R

[m,n] >0 be a rank-1 tensor, where x1, x2, ..., xm ∈ Rn>0. Let x0 = x[1/(m−1)] ||x[1/(m−1)]_||, λ0 = m Y k=2 (xTx1/(m−1)₁ ). (6.1)

By Lemma 5.1, (λ0, x0) is the unique positive H-eigenpair of A0. Now, we let

A(t) = (1 − t)A0+ tA, ∀t ∈ [0, 1]. (6.2)

We consider the following linear homotopy

F2(x, λ, t) =

A(t)xm−1 _{− λx}[m−1]

xT_{x − 1}

= 0, ∀t ∈ [0, 1]. (6.3)

Differentiating the F2(x, λ, t), and the Jacobian matrix of F2(x, λ, t) can be

ob-tained. The Jacobian matrix is denoted as DF2(x, λ, t) = [Dx,λF2(x,λ,t)|DtF2(x, λ, t)]

(30)

Dx,λF2(x, λ, t) = At− (m − 1)λ[|x[m−2]|] −x[m−1] 2xT 0 ∈ R(n+1)×(n+1) (6.4a) DtF2(x, λ, t) = (A − A0)xm−1 0 ∈ Rn+1_. _(6.4b)

Here [|x|] is the square diagonal matrix with the elements of x on the main diagonal and At ≡ Dx(A(t)xm−1) = m X k=2 A(t) ×2x... ×k−1x ×k+1... ×mx ∈ Rn×n. (6.5)

Suppose (x∗, λ∗, t∗) be the solution of F2(x, λ, t) = 0. The fact is

At∗x∗ = (m − 1)A(t∗)x m−1 ∗ = (m − 1)λ∗x[m−1]∗ , and (At∗− (m − 1)λ∗[|x [m−2] ∗ |])x∗ = 0. (6.6)

So, the following lemma can be obtained.

Lemma 6.2. Let (x∗, λ∗, t∗) ∈ Rn+1≥0 × [0, 1) be a solution F2(x, λ, t) = 0. Then

the Jacobian matrix Dx,λF2(x∗, λ∗, t∗) is invertible.

Proof. For t∗ ∈ [0, 1), A(t∗) > 0 is irreducible. So, (λ∗, x∗) must be positive. And

the Jacobian matrix Dx,λF2(x, λ, t) can be get as follow

Dx,λF2(x∗, λ∗, t∗) = At∗− (m − 1)λ[|x [m−2] ∗ |] −x[m−1]∗ 2xT∗ 0 ∈ R(n+1)×(n+1)_.

Since A∗ > 0 and x∗ > 0, we can get At∗ > 0 directly. Let Q =

1 m−1[|x [m−2] ∗ |]−1 And Q 0 0T ₁ At∗− (m − 1)λ[|x [m−2] ∗ |] −x[m−1]∗ 2xT ∗ 0 = QAt∗ − λ∗I − 1 m−1x∗ 2xT ∗ 0 = T ∈ R(n+1)×(n+1).

Q is the diagonal matrix and At∗ is positive matrix. So, QAt∗ > 0 and (λ∗, x∗) is

the positive eigenpair of QAt∗. By Lemma (3.2)(e), T is invertible.

Q 0

0T 1

is

(31)

For t = 0, A(0) = A0 and (x0, λ0, 0) is the positive solution of F2(x, λ, t) = 0.

By Lemma 6.2 and the implicit function theorem, there is a solution curve s2(t) =

(x(t), λ(t), t) ∈ Rn+1≥0 × [0, 1) that is parametrized by t for t ∈ [0, 1). We let the

following set

C2 = {(x(t), λ(t), t)|t ∈ [0, 1)}, (x(0), λ(0), 0) = (x0, λ0, 0) (6.7)

be the set of s2(t). The following theorem can be obtained.

Theorem 6.4. C2 is the solution set of H2(x, λ, t) = 0 in Rn+1≥0 × [0, 1).

Proof. If (x∗, λ∗, t∗) ∈ Rn+1≥0 ×[0, 1) is a solution of F1(x, λ, t) = 0 and (x∗, λ∗, t∗) 6=

(x(t∗), λ(t∗), t∗) ∈ C2. By Lemma 6.2 and the implicit function theorem, there is

a solution curve s∗₂ = (x∗(t), λ∗(t), t) can be parameterized by t for t ∈ [0, t∗] with

(x∗(t∗), λ∗(t∗), t∗) = (x∗, λ∗, t∗). Let C₂∗ be the set of s1∗(t), where

C₂∗ = {(x∗(t), λ∗(t), t)|t ∈ [0, t∗]} with (x∗(t∗), λ∗(t∗), t∗) = (x∗, λ∗, t∗).

We can obtain C2∩ C2∗ = ∅ by Lemma 6.2 and the implicit function theorem. By

Lemma 6.1, C2 ⊆ Rn+1≥0 ×[0, 1). Since (x∗(0), λ∗(0), 0) is the solution F2(x, λ, t) = 0

and the rank-1 matrix A0 ∈ Rn×n>0 is irreducible, we obtained (λ∗(0), x∗(0)) is the

eigenpair of A0 and (λ∗(0), x∗(0)) = (λ0, x0) by Lemma 5.1. It is a contradiction

because C2 ∩ C2∗ = ∅.

For t ∈ [0, 1), ||x(t)|| = 1 and A(t) is bounded. Since A(t) ∈ R[m,n]≥0 is

irre-ducible and (λ(t), x(t)) ∈ Rn+1≥0 is the eigenpair of A(t) for t ∈ [0, 1). By Lemma

6.3, λ(t) = %(A(t)) and we get ||(λ(t), x(t))|| is bounded for t ∈ [0, 1). Suppose

(x∗, λ∗, 1) is an accumulation point for C2 and we know (λ∗, x∗) is the H-eigenpait

of A. The H-eigenvalues of tensors had been shown are the roots of nonzero

polynomial by Theorem 5.1. So, λ∗ is isolated and limt→1−λ(t) = λ_∗. Let

Γ2 = {x ∈ Rn≥0|(x∗, λ∗, 1) is an accumulation point of C2.}. (6.8)

We get Γ2 is connected and the following theorem.

Theorem 6.5. Let A ∈ R[m,n]≥0 be irreducible. Then the nonnegative solution of

F2(x, λ, 1) = 0 is isolated.

Proof. Let x∗ ∈ Γ2 and we know that (x∗, λ∗, 1) is the nonnegative solution of

F2(x, λ, t) = 0. Since A is irreducible, we obtain (λ∗, x∗) ∈ Rn+1_>0 . The Jacobian

matrix Dx,λF2(x∗, λ∗, 1) is Dx,λF2(x∗, λ∗, 1) = A1− (m − 1)λ[|x [m−2] ∗ |] −x[m−1]∗ 2xT∗ 0 ∈ R(n+1)×(n+1)_.

(32)

Now, we prove A1is irreducible. Suppose A1is reducible , then there is a nonempty

subset S ⊂ {1, 2, . . . , n} such that (A1)i,j = 0, ∀i ∈ S, ∀j /∈ S. We obtain

(A1)i,j = ( m X k=2 A ×2x∗. . . ×k−1x∗×k+1x∗. . . ×mx∗)ij = m X k=2 n X i2,i3,...,im=1

aii2...i_ik−1ji_ik+1...imxi2. . . xik−1xik+1. . . xim = 0,

∀ i ∈ S, ∀j /∈ S. Since x∗ ∈ Rn>0, we get

aii2...im = 0, ∀ i ∈ S.

And it implies

aii2...im = 0, ∀ i ∈ S, ∀ i2, . . . im ∈ S./

So, A is reducible, a contradiction. Next, we let Q = _m−11 [|x[m−2]∗ |]−1 and we can

know Q 0 0T ₁ A1− (m − 1)λ[|x [m−2] ∗ |] −x[m−1]∗ 2xT ∗ 0 = QA1− λ∗I − 1 m−1x∗ 2xT_∗ 0 = T ∈ R(n+1)×(n+1).

Since Q is diagonal matrix and A1 is irreducible matrix. We obtain QA1 is

irre-ducible and (λ∗, x∗) is the positive eigenpair of QA1. By Lemma (3.2)(e), T is

invertible. Since

Q 0

0T ₁

is invertible, and it implies Dx,λF2(x∗, λ∗, 1) is

invert-ible. We know that the nonnegative solution of F2(x, λ, 1) = 0 is isolated. And

limt→1−(λ(t), x(t)) = (λ_∗, x_∗).

Theorem 6.5 shows that given a irreducible tensor A ∈ R[m,n]≥0 and we can obtain

the unique positive H-eigenpair (λ∗, x∗) of A by tracing the solution curve with

the initial rank-1 tensor A0 and the (λ0, x0) that is defined in (6.1). So, we have

proven the Theorem 6.1.

7 Numerical experiments on tensors

In this section, the homotopy continuation method will be used to compute the positive eigenpair on the nonnegative irreducible tensors. We use the algorithm

4.1 and replace F1(x, λ, t) by F2(x, λ, t) that is defined in (6.3). The equipment of

(33)

7.1 Jacobian matrix

The difficulty is to present the Jacobian matrices of F2(x, λ, t) that are defined

in (6.4a) and (6.4b). In fact, we always translate a tensor into matrix. Not only is is more familiar to understand, but also it is more efficient to execute tensor operations. So, we can use the Kronecker product to get

Axm−1 _{= A}

(1)(x ⊗x . . . ⊗

| {z }

m−2 times

x) ∈ Rn. (7.1)

At= Dx(A(t)xm−1) in (6.5) and we know there is a semi-symmetric As(t) ∈ R

[m,n] ≥0

such that A(t)xm−1 = As(t)xm−1. From (5.10), we can get

At= Dx(A(t)xm−1) = (m − 1)As(t) ×3x ×4 x . . . ×mx (7.2)

= (m − 1)As(1)(t)(I ⊗x . . . ⊗

| {z }

m−2 times

x). (7.3)

There is a simple way to get the semi-symmetric tensor As(t) [1]. Each element

of As(t) can be obtained by as(t)i1,i2,...,im = 1 (m − 1)! X j2,j3,...jm a(t)i1j2,...,jm, (7.4)

where (j2, j3, . . . , jm) is any permutation of (i2, i3, . . . , im).

7.2 Other algorithms for tensors

NQZ algorithm

The NQZ was developed by M. Ng, L. Qi and G. Zhou in 2009 [7]. It is like a power method with Collatz formula on tensors and it guarantees to compute the positive eigenpair for nonnegative irreducible tensors. The convergence of NQZ algorithm is linear.

Newton-Noda iteration (NNI)

We let r(x, λ) = λx[m−1]− Axm−1, f (x, λ) = −r(x, λ) 1 2(1 − x T_x) . (7.5)

The idea of NNI [1] is that using the Newton method to solve f (x, λ) = 0 and keeping the positive of each solution like Noda’s algorithm. The Jacobian matrix of f (x, λ) is Df (x, λ) = − Dxr(x, λ) x [m−1] xT 0 , (7.6)

(34)

Algorithm 5 NQZ algorithm

Step 1. For a irreducible tensor A ∈ R[m,n]≥0 , choosing x0 ∈ Rn>0. Let y0 = Axm−1

Step 2. For k = 0,1,... Compute xk+1 = _||yyk k|| yk+1= Axm−1 λ = minxi>0( yi xi) ¯ λ = maxxi>0( yi xi) Step 3. If ¯λ − λ <

stop the algorithm else

go to step 2. end

where

Dxr(x, λ) = (m − 1)λ[|x[m−2]|] − Dx(Axm−1) (7.7)

and Dx(Axm−1) is defined in (6.5). The key of the algorithm is the chosen of θ ∈

(0, 1] and we let θ = 1 in the following tests. It guarantees the global convergence and the convergence of NNI is quadratic near the solution.

Algorithm 6 NNI

Step 1. Given a unit random vector x0 ∈ Rn>0, ¯λ0 = max(

Axm−1₀ x[m−1]₀ ), and = 10 −10 Step 2. For k = 0, 1, 2... Solve Dxr(xk, ¯λk)wk= x [m−1] k yk= wk/||wk|| Choose θk > 0 ˆ xk+1= (m − 2)xk+ θkyk xk+1= ˆxk+1/||ˆxk+1|| ¯ λk+1 = max( Axm−1 k+1 x[m−1]_k+1 ), λk+1 = min( Axm−1 k+1 x[m−1]_k+1 ) Step 3. If ||¯λk+1− λk+1||/¯λk+1 <

stop the algorithm else

go to Step 2. end

7.3 Compare the numerical results on tensors

Following table is the numerical results for each size of tensor. (m,n) is the order and the dimension of tensor A, respectively. ”Continuation” is the homotopy

(35)

Table 3: Random tensor

Size Continuation NQZ NNI

m n Iter Num Res Num Res Num Res

3 50 5 20 6.4874e−11 6 1.2648e−12 4 6.0991e−12

60 5 20 1.8723e−11 6 1.1783e−12 4 1.7212e−11

70 5 20 2.3022e−11 6 1.1715e−12 4 1.5664e−12

80 5 20 8.5636e−11 6 8.5676e−13 4 3.0801e−11

90 5 20 1.7804e−11 6 5.8902e−13 4 3.1113e−12

100 5 20 6.6102e−11 6 6.6187e−13 4 4.7704e−12

4 50 5 19 2.2976e−11 6 1.5369e−12 4 3.2836e−11

60 5 19 5.2278e−11 6 1.832e−12 4 4.7681e−11

70 5 19 9.1175e−11 6 1.9992e−12 4 8.5072e−11

80 5 19 1.4789e−11 6 2.5718e−12 4 9.9697e−11

90 5 19 2.2007e−11 6 3.1720e−12 4 2.9086e−11

100 5 19 2.9606e−11 6 3.1736e−12 4 2.9932e−11

continuation method. ”NQZ” is the NQZ algorithm. ”NNI” is the Newton-Noda iteration. ”Iteration” is the numbers of iteration. ”Res” is the residual. ”Num” is

the number of execution of Axm−1_{. If Num≥2000, we will stop the algorithm.}

Example 8. Let A ∈ R[m,n] _{be a random tensor. We examine the numerical}

results with m = 3, 4 and n = 50, 60, 70, 80, 90, 100. We choose A0 ∈ R

[m,n] >0

with all the elements of A0 are 1, x0 = √1_n[1, 1, . . . 1]T ∈ Rn and λ0 = nm−1 as

the initial condition for each algorithm. Each algorithm can compute the positive eigenpair successfully. In Table 3, the homotopy continuation method can compute

the solution accurately but it costs more evaluations of Axm−1 _{than NQZ and NNI.}

Because there are many predict and correct processes in homotopy continuation method.

Example 9. Let A ∈ R[m,n] = D + L be a signless Laplacian tensor with the

edge set E = {{i − m + 2, i − m + 3, . . . , i, i + 1}, i = m − 1, . . . , n} [11] [1] and

n+1 is defined with 1. The elements of D are zeros except the di1i2...im 6= 0 for

i1 = i2 = . . . = im. So, A is a sparse tensor. We choose A0 = _nm−11 E ∈ R

[m,n] >0 ,

where all the elements of E ∈ R[m,n]>0 are 1, x0 = √1_n[1, 1, . . . 1]T ∈ Rn and λ0 = 1

as the initial condition for each algorithm. We examine the size of tensor by m = 3, 4 and n = 50, 60, 70, 80, 90, 100. In Table 4, it is clear to see that the number of iterations of NQZ is more than the homotopy continuation method and NNI. The convergence of NQZ is poor. The NNI and the homotopy continuation method can compute to the positive solution successfully.

(36)

Table 4: A = D + L

Size Continuation NQZ NNI

m n Iter Num Res Num Res Num Res

3 50 11 64 1.7402e−16 1402 9.9359e−11 8 2.6009e−16

60 13 79 2.0582e−16 1971 9.9329e−11 8 4.3628e−16

70 13 79 2.6795e−16 2000 1.3644e−8 8 5.4009e−16

80 15 93 1.6959e−16 2000 3.8976e−7 8 1.6211e−12

90 15 93 1.8075e−16 2000 3.7315e−6 8 1.9874e−142

100 15 94 1.1344e−15 2000 1.8135e−6 9 7.4063e−16

4 50 9 54 2.8566e−11 757 9.881e−11 9 1.8967e−15

60 12 71 1.2655e−16 1069 9.9096e−11 9 1.2254e−14

70 12 72 6.2383e−16 1431 9.9514e−11 9 1.1820e−13

80 14 86 1.2461e−16 1842 9.9996e−11 9 4.1863e−14

90 13 82 5.1522e−12 2000 1.1576e−8 9 7.1314e−15

100 13 82 3.4720e−11 2000 2.0444e−7 10 1.5562e−15

8 Conclusion

We have proven the Perron-Frobenius theorem by a new way. By constructing

a linear homotopy with a nonnegative irreducible matrix A ∈ Rn×n_{, we solve it}

to obtain the positive eigenpair of A and the positive eigenpair of A is sufficient to prove the Perron-Frobenius theorem. Furthermore, the skill of the proof has been extended to prove the Perron-Frobenius theorem on tensors. Both numerical results on matrices and on tensors are presented. Whether on matrices or on tensors, we can compute the positive eigenpair by the homotopy continuation method.

References

[1] C.S. Liu, C.H. Guo, and W. W. Lin, NewtonNoda iteration for finding the Perron pair of a weakly irreducible nonnegative tensor, preprint, (2017). [2] DIMACS10: DIMACS10 test set and the University of Florida Sparse Matrix

Collection. https : //sparse.tamu.edu/DIM ACS10

[3] H.B. Keller, Lectures on Numerical Methods in Bifurcation problems, 1987. [4] K.C. Chang, K. Pearson, and T. Zhang, A survey on the spectral theory of

(37)

[5] K.C. Chang, K. Pearson, and T. Zhang, Perron-Frobenius theorem for non-negative tensors, Commum. Math. Sci., 6(2008) no. 1, 507-520.

[6] K Kontovasilis, RJ Plemmons and WJ Stewart, Block Cyclic SOR for Markov Chains With p-cyclic Infinitesimal Generator, 1991.

[7] M. Ng, L. Qi, and G. Zhou, Finding the largest eigenvalue of a nonnegative tensor, SIAM J. Matrix Anal. Appl., 31 (2009), pp. 10901099.

[8] P. Chanchana, An algorithm for computing the Perron root of a nonnegative irreducible matrices, 2007.

[9] Q. Ni and L. Qi, A quadratically convergent algorithm for finding the largest eigenvalue of a nonnegative homogeneous polynomial map, J. Global Optim., 61 (2015) pp. 627641

[10] R. A. Horn, and C. R. Johnson, (2012)Matrix analysis, Cambridge University Press.

[11] S. Hu, L. Qi and J. Xie, The largest Laplacian and signless Laplacian H-eigenvalues of a uniform hypergraph, Linear Algebra Appl., 469 (2015), pp. 127.

[12] S. Hu, Z.-H. Huang, C. Ling, and L. Qi, On determinants and eigenvalue theory of tensors, J. Symb. Comput., 50 (2013), pp. 508531.

[13] T.G. Kolda and B.W. Bader, Tensor Decompositions and Applications, SIAM Review, vol. 51, no. 3, pp. 455-500, September 2009.

[14] Y.C. Kuo, W.W. Lin and C.S. Liu, Continuation Method For Computing Z-/H-Eigenpairs of Nonnegative Tensors, arXiv preprint arXiv:1702.05841.

[15] Y. Wei, and W. Ding, Theory and Computation of Tensors :

多維陣列上的Perron-Frobenius定理及計算

國立高雄大學應用數學學系

碩士論文

多維陣列上的

Perron-Frobenius 定理及計算

Perron-Frobenius theorem and computation on

multidimensional arrays

研究生：林冠宇 撰

指導教授：郭岳承

致謝

首先，我要感謝指導教授郭岳承老師，在我的碩士期間教導

我許多矩陣理論及計算的相關知識，也在我遇到困難時給予適當

的建議及鼓勵我參與許多校外活動。感謝吳宗芳老師大學及碩士

期間的照顧，能在大學時期上你的課以及擔任你兩年的大學入門

助教可以說是十分的幸運。感謝系辦雅鳳姐及佩君姊這些年的行

政上的各種協助。感謝我的家人一路上精神的支持讓我得以完成

碩士的學位。感謝學弟妹平常與我聊天及分享，陪我度過在高大

應數最後的時光。最後感謝系上的一切，讓我在這裡留下許多美

好的回憶。

林冠宇 謹於

國立高雄大學應用數學系

Perron-Frobenius theorem and

computation on multidimensional

arrays

by

Guan-Yu Lin

Advisor

Yueh-Cheng Kuo

Department of Applied Mathematics,

National University of Kaohsiung

Kaohsiung, Taiwan 811, R.O.C.

June 2018

Contents

多維陣列上的

Perron-Frobenius 定理及計算

Perron-Frobenius theorem and computation on

multidimensional arrays

1

Introduction

2

Preliminaries for matries

2.1

Irreducible matries

2.2

Gershgorin’s disk theorem

2.3

Implicit function theorem

3

Perron-Frobenius theorem

3.1

Analysis of nonnegative irreducible matrices

3.2

Proof of Theorem 3.1

4

Numerical experiments

4.1

Homotopy Continuation method

Predict process

Correct process

Adjustment of the step length

4.2

Other numerical method

Power method

Noda’s algorithm

Prakash’s algorithm

4.3

Compare the numerical results

5

Preliminaries for tensors

5.1

Tensor notations and operations

5.2

Eigenvalue of tensors

5.3

Symmetric and semi-symmetric tensors

5.4

Irreducible tensors

5.5

Rank-1 tensors

6

研究生：林冠宇撰

林冠宇謹於