AneuralnetworkbasedonthegeneralizedFBfunctionfornonlinearconvexprogramswithsecond-orderconeconstraints Neurocomputing

(1)

A neural network based on the generalized FB function for nonlinear convex programs with second-order cone constraints

Xinhe Miao

^a,1

, Jein-Shan Chen

^b,n,2

, Chun-Hsu Ko

^c

aDepartment of Mathematics, School of Science, Tianjin University, Tianjin 300072, PR China

bDepartment of Mathematics, National Taiwan Normal University, Taipei 11677, Taiwan

cDepartment of Electrical Engineering, I-Shou University, Kaohsiung 840, Taiwan

a r t i c l e i n f o

Article history:

Received 24 May 2015 Received in revised form 26 January 2016 Accepted 22 April 2016 Communicated by Ligang Wu Available online 10 May 2016 Keywords:

Neural network Generalized FB function Stability

Second-order cone

a b s t r a c t

This paper proposes a neural network approach to efﬁciently solve nonlinear convex programs with the second-order cone constraints. The neural network model is designed by the generalized Fischer–Bur- meister function associated with second-order cone. We study the existence and convergence of the trajectory for the considered neural network. Moreover, we also show stability properties for the considered neural network, including the Lyapunov stability, the asymptotic stability and the exponential stability. Illustrative examples give a further demonstration for the effectiveness of the proposed neural network. Numerical performance based on the parameter being perturbed and numerical comparison with other neural network models are also provided. In overall, our model performs better than two comparative methods.

1. Introduction

The nonlinear convex programs with second-order cone constraints (we abbreviate it as SOCP in this paper) is given as below:

min f ðxÞ s:t: Ax ¼ b

gðxÞAK ð1Þ

where AAR^mn has full row rank, bAR^m, f : Rⁿ-R is two-order continuous differentiable and convex mapping, g ¼ ½g₁; …; gl^T : Rⁿ -R^l is two-order continuous differentiable K-convex mapping which means for every x; yARⁿand tA½0; 1 such that

tgðxÞþð1 tÞgðyÞg tx þð1 tÞyð ÞAK;

and K is a Cartesian product of second-order cones (also called Lorentz cones), expressed as

K ¼ Kⁿ¹ Kⁿ² … Kⁿ^N

with N; n1; …; n^NZ1; n1þ⋯þn^N¼ l and Kⁿⁱ≔ xⁱ¹; xⁱ²; …; xⁱⁿi

T

ARⁿⁱj Jðxⁱ²; …; xⁱⁿiÞJ rxⁱ¹

n o

:

Here J J denotes the Euclidean norm and K¹ means the set of nonnegative realsR^þ.

It is well known that second-order cone programming problems (SOCP) have wide of applications in engineering, control and man- agement science[1,23,26]. For example, the grasping force optimization problem for the multi-ﬁngered robot hand can be recast as SOCP, see[23, Example 5.3]for real application data. For solving SOCP(1), there also exist many traditional optimization methods such as the interior point method[24], the merit function method[7,18], Newton method[21,31], and projection method[12]and so on. For a survey of solution methods, refer to[4]. In this paper, we are interested in the so-called neural network approach for solving SOCP(1), which is substantially different from the traditional ones. The main motivation to employ this approach arises from the following reason. In many applications, for example, force analysis in robot grasping and control applications, real-time solutions are usually imperative. For such applications, traditional optimization methods may not be competent due to the problem's stringent requirement on computational time.

Compared with the traditional optimization methods, the neural network method has its advantage in dealing with real-time optimization problems. Hence, many continuous-time neural networks for constrained optimization problems have been widely developed. At present, there are many results on neural networks for solving real- Contents lists available atScienceDirect

journal homepage:www.elsevier.com/locate/neucom

Neurocomputing

nCorresponding author.

E-mail addresses:xinhemiao@tju.edu.cn(X. Miao),

jschen@math.ntnu.edu.tw(J.-S. Chen),chko@isu.edu.tw(C.-H. Ko).

1The author's work is also supported by National Young Natural Science Foundation (No. 11101302) and National Natural Science Foundation of China (No.

11471241).

2The author's work is supported by Ministry of Science and Technology, Taiwan.

(2)

time optimization problems, see [6,9,11,14,16,17,19,22,23,25,27,33, 35–39,41]and references therein.

Neural networks stemmed back from McCulloch and Pitts' pioneering work half century ago, and these werefirst introduced for optimization domain in the 1980s[15,20,34]. The essence of neural network method for solving optimization problems[8]is to establish a nonnegative Lyapunov function (or called energy function) and a dynamic system which represents an artificial neural network. Indeed, the dynamic system is usually in the form of thefirst order ordinary differential equations. When utilizing neural networks for solving optimization problems, we are usually much more interested in the stability of networks starting from an arbitrary point. It is expected that for an initial point, the neural network will approach its equilibrium point which corresponds to the solution for the considered optimization problem.

In fact, the neural network approach for solving SOCP has been studied in[23,29]. More speciﬁcally, the SOCP studied in[23]is min f ðxÞ

s:t: Ax ¼ b

xAK ð2Þ

which is a special case of problem(1). Two kinds of neural networks were proposed in [23]. One is based on cone projection function (also called NR function) with which only Lyapunov stability is guaranteed. The other is based on the Fischer–Burmeister function (FB function) where Lyapunov stability and asymptotical stability are proved. Moreover, when solving problem(2), it was observed that the neural network based on the NR function has better performance than the one based on the FB function in most cases (except for some oscillating cases). However, compared to FB function, the NR function has a remarkable drawback, i.e., the non- differentiablity. In light of this phenomenon, the authors employed a neural network model based on “smoothed” NR function for solving more general SOCP(1), see[29]. In addition, all three kinds of stabilities including Lyapunov stability, asymptotical stability, and exponential stability are proved for such model in[29]. Moreover, the neural network based on generalized FB function can be regulated appropriately by perturbing its parameter p. Previous study[6]has demonstrated its efﬁciency for solving the nonlinear complementarity problems, which also motivates us to further explore its numerical performance for solving the SOCP. In view of the above discussions and the existing literature, we wish to keep tracking the performance of neural networks based on “smoothed” FB function, which is the main motivation of this paper. In particular, we consider a more general function, which is called the generalized FB function. In other words, we propose a neural network model based on the

“smoothed” generalized FB function including FB function as a special case. With this function, we perturb the parameter p associated with the generalized FB function to see how it affects the numerical performance. In addition, all the aforementioned three types of stabilities are guaranteed in our proposed neural network. Numerical comparison between model based on smoothed NR function and model based on smoothed generalized FB function are provided.

The organization of this paper is as follows. InSection 2, we introduce concepts about the stability, and recall some background materials. In Section 3, based on the smoothed generalized FB function, the neural network architecture is proposed for solving the problem(1). InSection 4, we study the convergence and stability results of the proposed neural network. Simulation results of the new method are reported in Section 5. Section 6 gives the conclusion of this paper.

2. Preliminaries

For a given mapping H: Rⁿ-Rⁿ, the ﬁrst order differential equation (ODE) means

du

dt¼ HðuðtÞÞ; uðt0Þ ¼ u0ARⁿ: ð3Þ

In general, the most concerned issues regarding ODE (3)are the existence and uniqueness of the solution. Besides, the convergence of solution trajectory is also concerned. To this end, concepts regarding equilibrium point and stabilities are needed. As below, we recall background materials about ODE(3)as well as stability concepts about the solution to ODE(3). All these materials can be found in usual ODE's textbook, e.g.,[30].

Lemma 2.1 (The existence and uniqueness). Assume that H: Rⁿ- Rⁿis a continuous mapping. Then, for arbitrary t0Z0 and u0ARⁿ, there exists a local solution u(t), tA½t0;

τ

^{Þ to} ⁽³⁾ ^{for some}

τ

4t0. Furthermore, if H is locally Lipschitz continuous at u0, then the solution is unique; and if H is Lipschitz continuous inRⁿ, then

τ

^can

be extended to 1.

Proof. See[25, Theorem 2.5].□

Remark 2.1. For Eq.(3), if a local solution deﬁned on ½t0;

τ

^{Þ cannot}

be extended to a local solution on a larger interval ½t0;

τ

¹^{Þ, where}

τ

¹4

τ

, then it is called a maximal solution, and this interval ½t0;

τ

^{Þ is}

the maximal interval of existence. It is obvious that an arbitrary local solution has an extension to a maximal one.

Lemma 2.2. Let H: Rⁿ-Rⁿ is a continuous mapping. If u(t) is a maximal solution, and ½t0;

τ

Þ is the maximal interval of existence associated with u₀and

τ

o þ1, then limt↑τJuðtÞJ ¼ þ1.

Proof. See[25, Theorem 2.6].□

For ODE(3), a point uⁿARⁿis called an equilibrium point of(3)if HðuⁿÞ ¼ 0. If there is a neighborhood

Ω

DRⁿof uⁿsuch that HðuⁿÞ

¼ 0 and HðuÞa0 for any uA

Ω

⧹fuⁿg, then uⁿ is called an isolated equilibrium point. The following are deﬁnitions of various stabilities.

More related materials can be found in[25,30,33].

Deﬁnition 2.1. Let u(t) be a solution of ODE(3).

(a) An isolated equilibrium point uⁿ is Lyapunov stable (or stability in the sense of Lyapunov) if for any u0¼ uðt0Þ and

ε

40, there exists a

δ

40 such that

Ju0uⁿJ o

δ

⟹ JuðtÞuⁿJ o

ε

^{for t}Zt0:

(b) Under the condition that an isolated equilibrium point uⁿ is Lyapunov stable, uⁿ is said to be asymptotic stable if it has the property that if Ju0uⁿJ o

δ

, then uðtÞ-uⁿ as t-1.

(c) An isolated equilibrium point uⁿ is exponentially stable for(3)if there exist

ω

o0,

κ

40,

δ

40 such that arbitrary solution u(t) of ODE (3) with the initial condition uðt0Þ ¼ u0,Ju0uⁿJ o

δ

^{is de}ﬁned on ½0; 1Þ and satisﬁes JuðtÞuⁿJ r

κ

^e^ω^tJuðt0ÞuⁿJ; t Zt0:

Deﬁnition 2.2 (Lyapunov function). Let

Ω

DRⁿ be an open neighborhood of u. A continuously differentiable function g: Rⁿ-R is said to be a Lyapunov function (or energy function) at the state u (over

(3)

the set

Ω

^{) for Eq.}⁽³⁾^if

gðuÞ ¼ 0;

gðuÞ40 8uA

Ω

⧹fug;

dgðuðtÞÞ

dt r0 8uA

Ω

: 8>

>>

<

>>

>:

From the above deﬁnition, it is obvious that exponentially stable is asymptotically stable. The next results show the rela- tionship between stabilities and a Lyapunov function, see [5,10,40].

Lemma 2.3.

(a) An isolated equilibrium point uⁿis Lyapunov stable if there exists a Lyapunov function over some neighborhood

Ω

^{of u}ⁿ^.

(b) An isolated equilibrium point uⁿ is asymptotically stable if there exists a Lyapunov function over some neighborhood

Ω

of uⁿsatisfying dgðuðtÞÞ

dt o0; 8 uA

Ω

⧹ u ⁿ :

To close this section, we brieﬂy review some properties of the spectral factorization with respect to second-order cone, which will be used in the subsequent analysis. Spectral factorization is one of the basic concepts in Jordan algebra. For more details, see [7,13,31]. For any vector z ¼ zð 1; z2ÞAR R^{l 1}ðlZ2Þ, its spectral factorization with respect to second-order coneK is deﬁned as z ¼

λ

¹^ðzÞe¹^ðzÞþ

λ

²^ðzÞe²ðzÞ;

where

λ

iðzÞ ¼ z1þ 1ð ÞⁱJz²J ði ¼ 1; 2Þ are called the spectral values of z, and

eiðzÞ ¼ 1

2 1; 1ð Þⁱ zi

JziJ

; z2a0 1

2ð1; 1ð ÞⁱwÞ; z2¼ 0 8>

>>

<

>>

>:

with wAR^{l 1}being an arbitrary element such thatJwJ ¼ 1. Here e1ðzÞ and e2ðzÞ are called the spectral vectors of z. It is well known that for any zAR^l, we have

λ

¹ð Þr^z

λ

²^{ðzÞ and}

λ

1ð ÞZ0⟺zAK:z

Note that any closed convex cone can always yield a partial order.

Suppose that the partial order “≽K” is induced by K, i.e., z≽K03zAK. The following technical lemma is helpful towards the subsequent analysis.

Lemma 2.4 (Pan et al. [32, Lemma 2.2]). For any 0rr r1 and z≽Kw≽K0, we have z^r≽Kw^r.

For any x ¼ xð 1; x2ÞAR R^{n 1} and y ¼ y₁; y2

AR R^{n 1}, Jor- dan product of x○y is deﬁned as

x○y ¼ 〈x; y〉

x1y₂þy₁x2

" #

:

According to Jordan product and spectral factorization with respect to second-order coneK, we often employ the following vector-valued functions (also called SOC-functions) associated with j tj^pðt ARÞ and ﬃﬃ

pt

p ðt Z0Þ, respectively, which are expressed as

j xj^p¼ j

λ

¹^ðxÞj^p^e¹^{ðxÞþ j}

λ

²^ðxÞj^pê²ðxÞ 8xARⁿ; ffiffiffix

pp

¼ ffiffiffiffiffiffiffiffiffiffiffi

λ

¹^ðxÞ

qp

e1ðxÞþ ffiffiffiffiffiffiffiffiffiffiffi

λ

¹^ðxÞ

qp

e2ðxÞ 8xAK:

In light of the expressions of j xj^pand ffiffiffi x pp

as above, for any p41, the generalized FB merit function

ϕ

p: Rⁿ Rⁿ-Rⁿ associated

with second-order cone is deﬁned in[32]:

ϕ

pðx; yÞ≔ ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi j xj^pþ j yj^p pp

ðxþyÞ:

In particular, in[32]the authors have shown that

ϕ

pðx; yÞ is an SOC-complementarity function, i.e.,

ϕ

pðx; yÞ ¼ 0⟺xAK; yAK and 〈x; y〉 ¼ 0:

This also yields that the function

Φ

^p: Rⁿ-Rⁿgiven by

Φ

^pð Þ≔^x 1

2J

ϕ

pðx; FðxÞÞJ²

is a merit function for second-order cone complementarity problems. Moreover, the following conclusions are obtained in[32].

Lemma 2.5. For any pffiffiffiffi 41, let w≔w x; yð Þ≔j xj^pþ j yj^p, t ¼ t xð ; yÞ≔

w pp

and denote g^socð Þ≔j xjx ^p. Then, tðx; yÞ is continuously differentiable at (x,y) with wAintðKÞ, and

∇xtðx; yÞ ¼ ∇g^socð Þ∇gx ^socð Þt ¹ and

∇ytðx; yÞ ¼ ∇g^socð Þ∇gy ^socð Þt ¹ where

∇g^socðxÞ ¼

psignðx1Þj x1j^{p 1}I; x2¼ 0 bðxÞ c xð Þx^T₂

c xð Þx2 aðxÞI þ bðxÞ aðxÞð Þx2x^T₂

" #

; x2a0 8>

><

>>

: with x2¼_{J x}^x²

2Jand aðxÞ ¼j

λ

²^ðxÞj^p^j

λ

¹^ðxÞj^p

λ

²^ðxÞ

λ

¹^ðxÞ ^;

bðxÞ ¼p

2signð

λ

²^ðxÞÞj

λ

²^ðxÞj^{p 1}^þsignð

λ

¹^ðxÞÞj

λ

¹^ðxÞj^{p 1}; cðxÞ ¼p

2signð

λ

²^ðxÞÞj

λ

²^ðxÞj^{p 1}^signð

λ

¹^ðxÞÞj

λ

¹^ðxÞj^{p 1}: Proof. See[32, Lemma 3.2].□

Lemma 2.6. Let

Φ

pbe deﬁned as

Φ

^pð^x; yÞ≔¹2J

ϕ

pðx; yÞJ²and denote w xð ; yÞ≔j xj^pþ j yj^p, g^socð Þ≔j xjx ^p. Then, the function

Φ

pfor pAð1; 4Þ is differentiable everywhere. Moreover, for any x; yARⁿ,

(a) if wðx; yÞ ¼ 0, then ∇^x

Φ

^pðx; yÞ ¼ ∇^y

Φ

^pðx; yÞ ¼ 0;

(b) if wðx; yÞAintðKÞ, then

∇x

Φ

^pð^x; yÞ ¼ ∇g ^socðxÞ∇g^socð Þt ¹I

ϕ

pðx; yÞ

¼ ∇g ^socðxÞ∇g^socðtÞ

∇g^socð Þt ¹

ϕ

pðx; yÞ; ∇y

Φ

^pð^x; yÞ

¼ ∇g ^socðyÞ∇g^socð Þt ¹I

ϕ

pðx; yÞ

¼ ∇g ^socðyÞ∇g^socðtÞ

∇g^socð Þt ¹

ϕ

pðx; yÞ:

(c) if wðx; yÞA∂K⧹f0g, where ∂K means the boundary of K, then

∇^x

Φ

^pðx; yÞ ¼ signðx1Þj x1j^{p 1} ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffij x1j^pþ j y₁j^p

pq 1

!

ϕ

pðx; yÞ;

∇^y

Φ

^pðx; yÞ ¼ signðy₁Þj y₁j^{p 1} ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi j x1j^pþ j y₁j^p

pq 1

!

ϕ

pðx; yÞ:

Proof. See[32, Proposition 3.1].□

3. Generalized FB neural network model

In this section, we will explain how we form the dynamic system. As is mentioned earlier, the key points for neural network method lie in constructing the dynamic system and Lyapunov function. To this end, weﬁrst look into the KKT conditions of the

(4)

problem(1)which are presented as below:

∇f ðxÞA^Ty þ∇gðxÞz ¼ 0;

zAK; g xð ÞAK; z^TgðxÞ ¼ 0; Ax b ¼ 0;

8>

<

>: ð4Þ

where yAR^m,∇gðxÞ denotes the gradient matrix of g. According to the KKT condition, it is well known that if the problem(1)satisﬁes Slater's condition, which means there exists a strictly feasible point for the problem(1), i.e., there exists an xARⁿsuch that g ð ÞAintðKÞ and Ax¼b. Then, for the nonlinear convex programsx (1), xⁿis a solution of the problem(1)if and only if there exist yⁿ and zⁿsuch that ðxⁿ; yⁿ; zⁿÞ satisfying the KKT conditions (4), see [2]. Hence, we assume that the problem(1)satisﬁes Slater's condition in this paper.

Lemma 3.1. For z ¼ zð 1; z2ÞAR R^{n 1} and x ¼ ðx1; x2ÞAR R^{n 1} with z≽Kx, we have

λ

ið ÞZz

λ

iðxÞ for i¼1,2.

Proof. Since z≽Kx, we may express z ¼ x þ y where x ¼ xð 1; x2ÞAR R^{n 1}, y ¼ y1; y2

AR R^{n 1} and y ¼ z x≽K0.

This implies y₁Z Jy2J and

λ

¹^{ðzÞ ¼ ðx}¹^þy1Þ Jx2þy₂J Zðx1þy₁Þ Jx2J Jy2J Zx1 Jx2J ¼

λ

¹ðxÞ:

Thus, we have

λ

²^{ðzÞ ¼ ðx}¹^þy1Þþ Jx2þy₂J Zðx1þy₁Þþ j Jx2J Jy2J j

¼ x1þy₁þ Jx2J Jy2J; if Jx2J Z Jy2J x1þy₁ Jx2J þ Jy2J; if Jx2J o Jy2J (

Z x1þ Jx²J; if Jx²J Z Jy2J x1þy₁; if Jx2J o Jy2J (

Zx1þ Jx2J ¼

λ

²^ðxÞ

which is the desired result.□

Lemma 3.2. Let w≔wðx; yÞ ¼ j xj^pþ j yj^p, t ¼ t xð ; yÞ≔ ffiffiffiffi w pp

and g^socð Þ≔j xjx ^p. Then, the following three matrices

∇g^socðtÞ∇g^socðxÞ; ∇g^socðtÞ∇g^socðyÞ;

∇g^socðtÞ∇g^socðxÞ

∇g^socðtÞ∇g^socðyÞ

are all positive semi-deﬁnite for p ¼ⁿ2with nAN.

Proof. From the expression of∇g^socðxÞ inLemma 2.5and the proof of[32, Lemma 3.2], we know that the eigenvalues of ∇g^socðxÞ for x2a0 are

bðxÞ cðxÞ; aðxÞ; …; aðxÞ; and bðxÞþcðxÞ:

Let w≔ wð ¹; w²ÞAR R^{n 1}. Then applying[32, Lemma 3.1]gives w1¼j

λ

²^ðxÞj^p^{þ j}

λ

¹^ðxÞj^p

2 þj

λ

²^ðyÞj^p^{þ j}

λ

¹^ðyÞj^p

2 w2¼j

λ

²^ðxÞj^p^j

λ

¹^ðxÞj^p

2 x2þj

λ

²^ðyÞj^p^j

λ

¹^ðyÞj^p

2 y₂;

where x2¼_{J x}^x²

2Jif x2a0, and otherwise x2is an arbitrary vector in R^{n 1}satisfyingJx2J ¼ 1. Similar situation applies for y2. Thus, we will proceed the proof by discussing two cases: w2¼ 0 or w2a0.

Case 1: For w2¼ 0, we have ∇g^socðtÞ ¼ p ffiffiffiffiffiffiffi^qw1

p I where

w1¼j

λ

2ðxÞj^pþ j

λ

1ðxÞj^p

2 þj

λ

2ðyÞj^pþ j

λ

1ðyÞj^p

2 : ð5Þ

Under the condition of w2¼ 0, there are the following two subcases.

(i) If x2¼ 0, then w1¼ j x1j^pþ^j^λ²^ðyÞj^p^{þ j}₂ ^λ¹^ðyÞj^p, which implies that p^qffiffiffiffiffiffiffiw1

p Zpsignðx1Þj x1j^{p 1}. Hence, we see that the matrix∇g^socðtÞ

∇g^socðxÞ is positive semi-deﬁnite. Indeed, if xa0, ∇g^socðtÞ∇g^socðxÞ is positive deﬁnite.

(ii) If x2a0, it follows from w2¼ 0 that j

λ

²^ðxÞj^p^j

λ

¹^ðxÞj^p

2

¼ j

λ

²^ðyÞj^p^j

λ

¹^ðyÞj^p

2

: ð6Þ

We want to prove that the matrix ∇g^socðtÞ∇g^socðxÞ is positive semi-deﬁnite. It is sufﬁcient to show that

p ffiffiffiffiffiffiffi w1

pq

Zmax bðxÞcðxÞ; aðxÞ; bðxÞþcðxÞ : It is obvious that p^qffiffiffiffiffiffiffiw1

p ðbðxÞcðxÞÞ40 when

λ

¹ðxÞo0. When

λ

¹ðxÞZ0, using(5)and

λ

²ð ÞZ^x

λ

¹ðxÞ, we have p ffiffiffiffiffiffiffi

w1

pq

ðbðxÞcðxÞÞZp ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi j

λ

¹^ðxÞj^p

qq

psignð

λ

¹^ðxÞÞj

λ

¹^ðxÞj^{p 1}

Z0:

Next, we verify that p^qffiffiffiffiffiffiffiw1

p aðxÞZ0. For j

λ

¹ðxÞj Z j

λ

²ðxÞj , it is clear that p^qffiffiffiffiffiffiffiw1

p aðxÞZ0. For j

λ

¹ðxÞj o j

λ

²ðxÞj , it follows from

λ

²

ð ÞZx

λ

¹^{ðxÞ that x}¹40, which yields j

λ

²^ðxÞj^p^j

λ

¹^ðxÞj^p

λ

²^ðxÞ

λ

¹^ðxÞ ^r

λ

²^{ð Þ}^x^p^j

λ

¹^ðxÞj^p

λ

²^{ðxÞ j}

λ

¹^{ðxÞj :}

Let p ¼_mⁿðn; mANÞ, a ¼

λ

²^{ð Þ}^x^m¹ ^{and b ¼ j}

λ

¹^ðxÞj^m¹^{. From p}41, it follows that n4m. Then, we have 0rboa and

aðxÞ ¼aⁿbⁿ

a^mb^m¼ a^{n 1}þa^{n 2}b þ…þab^{n 2}þb^{n 1} a^{m 1}þa^{m 2}b þ…þab^{m 2}þb^{m 1}: Now, letting f ðvÞ ¼_a^amⁿ^v vⁿ^mwith vA½0; a, we obtain f⁰ðvÞ ¼nv^{n 1}ða^mv^mÞþmv^{m 1}ðaⁿvⁿÞ

a^mv^m

ð Þ² :

In addition, it follows from f⁰ðvÞ ¼ 0 that aⁿvⁿ

a^mv^m¼n mv^{n m}:

Since f ð0Þ ¼_a^amⁿ¼ a^{n m}withv¼0 and f ðaÞ ¼mⁿa^{n m}withv¼a, it is easy to verify that f bð Þrmⁿa^{n m}for 0rboa, i.e.,

j

λ

2ðxÞj^p j

λ

1ðxÞj^p

λ

²^ðxÞ

λ

¹^ðxÞ ^{rp j}

λ

²^ðxÞj^{p 1}: Hence, we have

p^qffiffiffiffiffiffiffiw1

p aðxÞZp ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi maxfj

λ

²^ðxÞj^p; j

λ

¹^ðxÞj^p^gþminfj

λ

²^ðyÞj^p; j

λ

¹^ðyÞj^p^g

qq

j

λ

²^ðxÞj^p^j

λ

¹^ðxÞj^p

λ

²^ðxÞ

λ

¹^ðxÞ

Zp ffiffiffiffiffiffiffiffiffiffiffiffiffi

λ

²^{ð Þ}^x^p

qq

pj

λ

²^ðxÞj^{p 1}

Z0;

where thefirst inequality holds due to(6). Lastly, we also see that p ffiffiffiffiffiffiffi

w₁ pq

ðbðxÞþcðxÞÞZp ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi maxfjλ2ðxÞj^p; jλ1ðxÞj^pgþminfjλ2ðyÞj^p; jλ1ðyÞj^pg

q

psignð

λ

²^ðxÞÞj

λ

²^ðxÞj^{p 1}

Zp ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi maxfj

λ

²^ðxÞj^p; j

λ

¹^ðxÞj^p^g

qq

psignð

λ

²^ðxÞÞj

λ

²^ðxÞj^{p 1}

Z0:

To sum up, under this case x2a0, we prove that the matrix ∇g^soc ðtÞ∇g^socðxÞ is positive semi-deﬁnite.

Case 2: For w2a0, from the expression of tðx; yÞ and the properties of the spectral values of the vector-valued function j xj^p with p ¼ⁿ₂for nAN, all the eigenvalues of the matrix ∇g^socðtÞ are

bðtÞ cðtÞraðtÞrbðtÞþcðtÞ: ð7Þ

(5)

When x2¼ 0, we note that

bðtÞcðtÞ psignðx1Þj x1j^{p 1}¼ p ffiffiffiffiffiffiffiffiffiffiffiffi

λ

¹^ðwÞ

qp

p 1

psignðx1Þj x1j^{p 1}

¼ p j

λ

²^ðxÞj^p^{þ j}

λ

¹^ðxÞj^p

2 þj

λ

²^ðyÞj^p^{þ j}

λ

¹^ðyÞj^p

2

Jj

λ

²^ðxÞj^p^j

λ

¹^ðxÞj^p

2 x2þj

λ

²^ðyÞj^p^j

λ

¹^ðyÞj^p

2 y₂J

^{p 1}p

psignðx1Þj x1j^{p 1}

Zpj x1j^{p 1}psignðx1Þj x1j^{p 1} Z0;

where y₂ denotes y₂¼_{J y}^y²

2J when y₂a0, and otherwise y2 is an arbitrary vector in R^{n 1} satisfying Jy2J ¼ 1. Now, applying the relation of the eigenvalues in(7), we have

bðtÞþcðtÞZaðtÞZpsignðx1Þj x1j^{p 1};

which implies that the matrix∇g^socðtÞ∇g^socðxÞ is positive semi- deﬁnite.

When x2a0, we also note that bðtÞcðtÞ bðxÞcðxÞð Þ ¼ p ffiffiffiffiffiffiffiffiffiffiffiffi

λ

¹^ðwÞ

qp

p 1

psignð

λ

¹^ðx¹^ÞÞj

λ

¹^ðx¹^Þj^{p 1}: For

λ

¹ðxÞo0, it is clear that bðtÞcðtÞðbðxÞcðxÞÞZ0. For

λ

¹ð ÞZ0, we have^x

λ

²ðxÞZ

λ

¹ð ÞZ0, which leads to^x

λ

¹^{ðwÞ ¼}^j

λ

²^ðxÞj^p^{þ j}

λ

¹^ðxÞj^p

2 þj

λ

²^ðyÞj^p^{þ j}

λ

¹^ðyÞj^p

2

j

λ

²^ðxÞj^p^j

λ

¹^ðxÞj^p

2 x2þj

λ

²^ðyÞj^p^j

λ

¹^ðyÞj^p

2 y₂

Zj

λ

²^ðxÞj^p^{þ j}

λ

¹^ðxÞj^p

2 j

λ

²^ðxÞj^p^j

λ

¹^ðxÞj^p

2 þj

λ

²^ðyÞj^p^{þ j}

λ

¹^ðyÞj^p

2 j

λ

²^ðyÞj^p^j

λ

¹^ðyÞj^p

2

Z j

λ

¹^ðxÞj^p:

Thus, it follows that bðtÞcðtÞðbðxÞ cðxÞÞZ0. Moreover, since t≽Kj xj , byLemma 3.1and the eigenvalue of j xj being j

λ

¹^{ðxÞj and}

j

λ

²ðxÞj , we have

λ

²ðtÞZmaxfj

λ

¹ðxÞj ; j

λ

²^{ðxÞj g}

and

λ

¹ðtÞZminfj

λ

¹ðxÞj ; j

λ

²ðxÞj g: ð8Þ When p ¼ⁿ₂with nAN, then, we have

aðtÞ aðxÞ ¼

λ

²^{ð Þ}^tⁿ²

λ

¹^{ð Þ}^tⁿ²

λ

²^ðtÞ

λ

¹^ðtÞ ^j

λ

²^ðxÞjⁿ²^j

λ

¹^ðxÞjⁿ²

λ

²^ðxÞ

λ

¹^ðxÞ ^:

If j

λ

²ðxÞj o j

λ

¹^{ðxÞj ,} ît îs ôbvious ^that ^{aðtÞ aðxÞ}Z0. If j

λ

²ðxÞj Z j

λ

¹ðxÞj , in light of

λ

²ð ÞZ^x

λ

¹ðxÞ, we obtain that x1Z0 and

λ

²ðxÞZ0. Now, let

a≔

λ

2ð Þt¹²; b≔

λ

1ð Þt¹²; c≔

λ

2ð Þx¹² and d≔j

λ

1ðtÞj¹²: Then, we get that

aðtÞ aðxÞ ¼aⁿbⁿ a²b²cⁿdⁿ

c²d²

¼ða^{n 1}þa^{n 2}b þ…þab^{n 2}þb^{n 1}Þðc þdÞ ðaþbÞðcþdÞ

ðaþbÞðc^{n 1}þc^{n 2}d þ…þcd^{n 2}þd^{n 1}Þ ðaþbÞðc þdÞ

¼a^{n 1}c þ bcða^{n 2}þa^{n 3}b þ…þab^{n 3}þb^{n 2}Þ ðaþbÞðc þdÞ

þadða^{n 2}þa^{n 3}b þ…þab^{n 3}þb^{n 2}Þþb^{n 1}d ðaþbÞðc þdÞ

ac^{n 1}þadðc^{n 2}þc^{n 3}d þ…þcd^{n 3}þd^{n 2}Þ ðaþbÞðc þdÞ

bcðc^{n 2}þc^{n 3}d þ…þcd^{n 3}þd^{n 2}Þþbd^{n 1}

ðaþbÞðc þdÞ ;

which together with(8)implies that aZc; bZdZ0 and aðtÞa xð ÞZ0:

In addition, we also verity that bðtÞþ cðtÞðbðxÞþ cðxÞÞ ¼ p

λ

2ðtÞp 1

psignð

λ

2ðxÞÞj

λ

2ðxÞj^{p 1} Z0:

Therefore, for any xARⁿ, we have

x^Tð∇g^socðtÞ∇g^socðxÞÞx ¼ x^T∇g^socðtÞxx^T∇g^socðxÞx

¼ bðtÞcðtÞþðn2ÞaðtÞþbðtÞþcðtÞ x^Tx

bðxÞcðxÞþðn2ÞaðxÞþbðxÞþcðxÞ x^Tx Z0;

which shows that the matrix∇g^socðtÞ∇g^socðxÞ is positive semi- deﬁnite.

With the same arguments, we can verify that the matrix∇g^soc ðtÞ∇g^socðyÞ is also positive semi-deﬁnite.

Finally, using the properties of eigenvalues of symmetric matrix product, i.e.,

λ

ⁱðABÞZ

λ

ⁱ^ðAÞ

λ

^minðBÞ; i ¼ 1; …; n; 8A; BASⁿⁿ;

where Sⁿⁿ denotes n order symmetric matrix, we easily obtain that the matrix ð∇g^socðtÞ∇g^socðxÞÞð∇g^socðtÞ∇g^socðyÞÞ is also positive semi-deﬁnite.□

Remark 3.1. From the above proof ofLemma 3.2, when xa0 and ya0, we have that the matrixes ∇g^socðtÞ∇g^socðxÞ, ∇g^socðtÞ∇g^soc ðyÞ and ð∇g^socðtÞ∇g^socðxÞÞð∇g^socðtÞ∇g^socðyÞÞ are all positive deﬁnite.

Now, we look into the KKT conditions(4)of the problem(1). Let Lðx; y; zÞ ¼ ∇f ðxÞA^Tyþ∇gðxÞz;

HðuÞ≔

Ax b Lðx; y; zÞ

ϕ

pðz; gðxÞÞ 2

64

3

75 ð9Þ

and

Ψ

^pðuÞ≔1

2JHðuÞJ²¼1

2J

ϕ

pðz; gðxÞÞJ²þ1

2JLðx; y; zÞJ²þ1

2JAxbJ²; where u ¼ x ^T; y^T; z^TT

ARⁿ R^m R^l. From Lemma 2.5 in[32], we know that

ϕ

pðz; gðxÞÞ ¼ 0⟺zAK; gðxÞAK; z^TgðxÞ ¼ 0:

Hence, the KKT conditions (4) are equivalent to HðuÞ ¼ 0, i.e.,

Ψ

^pðuÞ ¼ 0. Then, it follows that the KKT conditions (4) are equivalent to the following unconstrained minimization problem with zero optimal value via the merit function approach:

min

Ψ

^pð Þ≔^u 1

2JHðuÞJ²: ð10Þ

However, the function

ϕ

^p^{is not}K-convex and the merit function

Ψ

^p is neither convex function for p ¼ 2, which is showed in Example 3.5 of[3].

Theorem 3.1. Let

Ψ

pbe deﬁned as in(10).

(a) The matrix∇g^socðxÞ is positive deﬁnite for all 0axAK.

(6)

(b) The function

Ψ

^p ^{for p}Að1; 4Þ is continuously differentiable everywhere. Moreover,∇

Ψ

^pðuÞ ¼ ∇HðuÞHðuÞ where

∇HðuÞ ¼

A^T ∇^xLðx; y; zÞ ∇gðxÞV1

0 A 0

0 ∇g xð Þ^T V2

2 64

3

75 ð11Þ

with

V1¼

0; wðz; gðxÞÞ ¼ j zj^pþ j gðxÞj^p¼ 0;

∇g^socð Þ∇gx ^socð Þt¹I; w z; gðxÞð ÞAintðKÞ;

signð g₁ðxÞÞj g₁ðxÞj^{p 1} ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi j g₁ðxÞj^pþ j z1j^p

pq 1; w z; gðxÞð ÞA∂K⧹f0g

8>

>>

><

>>

: and

V2¼

0; wðz; gðxÞÞ ¼ j zj^pþ j gðxÞj^p¼ 0;

∇g^socð Þ∇gz ^socð Þt¹I; w zð; gðxÞÞAintðKÞ;

signðz1Þj z1j^{p 1} ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi j g₁ðxÞj^pþ j z1j^p

pq 1; w z; gðxÞð ÞA∂K⧹f0g

8>

>>

><

>>

:

with t≔ ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi wðz; gðxÞÞ pp

.

Proof. (a) For all 0axAK, if x2¼ 0, it is obvious that the matrix

∇g^socðxÞ ¼ psignðx1Þj x1j^{p 1}I is positive definite. If xa0, from the expression of∇g^socðxÞ inLemma 2.5and xAK, we have bðxÞ40. In order to prove that the matrix∇g^socðxÞ is positive definite, it suf- fices to show that the Schur complement of b(x) in the matrix ∇ g^socðxÞ is positive definite. In fact, from the expression of ∇g^socðxÞ, the Schur complement has the form

aðxÞI þðbðxÞ aðxÞÞx₂x^T₂c²ðxÞ

bðxÞx₂x^T₂¼ aðxÞðI x2x^T₂ÞþbðxÞ 1c²ðxÞ bðxÞ

x₂x^T₂: Since xAK, we have

λ

²ð ÞZ^x

λ

¹ð ÞZ0, which implies that aðxÞ40^x and bðxÞ4c xð ÞZ0. Note that the matrices I x2x^T₂ and x2x^T₂ are positive semi-definite. Thus, the Schur complement is positive definite. Further, we get that ∇g^socðxÞ is positive definite for all 0axAK.

(b) From the proof of Proposition 3.1 and Lemma 3.2 of[32], we know that the function

Ψ

p for pAð1; 4Þ is continuously differentiable everywhere. Hence, in view of the deﬁnition of the function

Ψ

p and the chain rule, the expression of ∇

Ψ

^p^{ðuÞ is}

obtained.□

In light of the main ideas for constructing artificial neural networks (see[8]for details), we will establish a specific first order ordinary differential equation, i.e., an artificial neural network.

Moreover, speciﬁcally, based on the gradient of the merit function

Ψ

pin minimization problem(10), we propose the neural network for solving the KKT system(4) of nonlinear SOCP (1) with the following differential equation:

duðtÞ

dt ¼

ρ

∇

Ψ

^pðuÞ; uðt0Þ ¼ u0; ð12Þ

where

ρ

40 is a time scaling factor. In fact, if

τ

^¼

ρ

^{t, then}

duðtÞ

dt ¼

ρ

^duðdτ^τ^Þ. Hence, it follows from(12)that^duð_d_τ^τ^Þ¼ ∇

Ψ

^p^{ðuÞ. For}

simplicity and convenience, we set

ρ

¼ 1 in this paper.

4. Stability analysis

In this section, we are interested in the stability analysis about the proposed neural network(12). By these theoretical analyses, the desired optimal solution of SOCP(1)can always be obtained by setting the initial state of the network of an arbitrary value. In order to study the stability issues on the proposed neural network (12)for solving SOCP(1), weﬁrst make an assumption which will be needed in our subsequent analysis, in order to avoid the sin- gularity of∇HðuÞ.

Assumption 4.1.

(a) The SOCP problem(1)satisﬁes Slater's condition.

(b) The matrix ½A^T ∇gðxÞ is full column rank, and the matrix

∇xLðx; y; zÞ is positive deﬁnite on the null space ft j At ¼ 0g of A.

Here we say a few words about Assumption 4.1(a) and (b).

Slater's condition is a standard condition which is widely used in optimization ﬁeld. When g is linear,Assumption 4.1(b) is indeed equivalent to the well-used condition∇²f ðxÞ is positive deﬁnite.

Lemma 4.1. Let p ¼ⁿ₂Að1; 4Þ with nAN. Then, the following hold.

(a) Under the condition ofAssumption 4.1,∇HðuÞ is nonsingular for u ¼ xð ; y; zÞARⁿ R^m R^lwith zð ; gðxÞÞa0.

(b) Every stationary point of

Ψ

pis a global minimizer of problem (10)for zð ; gðxÞÞa0.

(c)

Ψ

^pðuðtÞÞ is nonincreasing with respect to t.

Proof. (a) Suppose

ξ

¼ s; t; vð ÞARⁿ R^m R^l. From the expression (11)of∇HðuÞ inTheorem 3.1, to show the nonsingularity of∇HðuÞ, it is enough to prove that

∇H uð Þ

ξ

¼ 0 ⟹ s ¼ 0; t ¼ 0 and v ¼ 0:

Indeed, by∇H uð Þ

ξ

¼ 0, we have

At ¼ 0; A^Ts þ∇xLðx; y; zÞ t ∇gðxÞV1v ¼ 0 ð13Þ and

∇g xð Þ^Tt þ V2v ¼ 0: ð14Þ

From(13), it follows that

t^T∇^xLðx; y; zÞt t^T∇gðxÞV1v ¼ 0: ð15Þ Moveover, by Eq.(14), we obtain

t^T∇gðxÞ ¼ v^TV^T₂: ð16Þ

Then, combining(15)and(16), this yields that t^T∇^xLðx; y; zÞt þv^TV^T₂V1v ¼ 0:

By Lemma 3.2and Assumption 4.1(b), it is not hard to see that t¼0. In addition, from(13)and(14), we have

A^Ts ∇gðxÞV1v ¼ 0 and V2v ¼ 0:

ByAssumption 4.1(b) again, we also get that s ¼ 0 and V1v ¼ 0:

Thus, combining Lemma 3.2 with the expression V1 and V2 in Theorem 3.1, we havev¼0. Therefore, ∇H uð Þ^T is nonsingular.

(b) Suppose that uⁿ is a stationary point of

Ψ

p. This says

∇

Ψ

^p^ðuⁿÞ ¼ 0, and from Theorem 3.1, we have ∇HðuⁿÞHðuⁿÞ ¼ 0.

According to part(a), ∇HðuÞ is nonsingular. Hence, it follows that HðuⁿÞ ¼ 0, i.e.,

Ψ

^p^ðuⁿÞ ¼ 0, which says uⁿ is a global minimizer of (10).

(c) By the deﬁnition of

Ψ

^p^{ðuðtÞÞ and}(12), it is clear that d

Ψ

^p^ðuðtÞÞ

dt ¼ ∇

Ψ

^p^ðuðtÞÞ^duðtÞ_dt ^¼

ρ

∇

Ψ

^p^ðuðtÞÞ²r0:

Therefore,

Ψ

^pðuðtÞÞ is nonincreasing with respect to t.□

Proposition 4.1. Assume that∇HðuÞ is nonsingular for any uARⁿ R^m R^land p ¼ⁿ₂Að1; 4Þ with nAN. Then,

(a) ðxⁿ; yⁿ; zⁿÞ satisﬁes the KKT conditions(4)if and only if ðxⁿ; yⁿ; zⁿÞ is an equilibrium point of the neural network(12);

(7)

(b) under Slater's condition, xⁿis a solution of the problem(1)if and only if ðxⁿ; yⁿ; zⁿÞ is an equilibrium point of the neural network(12).

Proof. (a) It is easy to prove that ðxⁿ; yⁿ; zⁿÞ satisﬁes the KKT conditions (4) if and only if HðuⁿÞ ¼ 0 where uⁿ¼ xð ⁿ; yⁿ; zⁿÞ^T. According to the condition that∇HðuÞ is nonsingular, we have that HðuⁿÞ ¼ 0 if and only if ∇

Ψ

^p^ðuⁿÞ ¼ ∇H uð Þⁿ^THðuⁿÞ ¼ 0. Then the desired result follows.

(b) Under Slater's condition, it is well known that xⁿis a solution of the problem(1)if and only if there exist yⁿand zⁿsuch that ðxⁿ; yⁿ; zⁿÞ satisfying the KKT conditions(4). Hence, by part (a), it follows that ðxⁿ; yⁿ; zⁿÞ is an equilibrium point of the neural network(12).□

The next result addresses the existence and uniqueness of the solution trajectory of the neural network(12).

Theorem 4.1. For any ﬁxed p ¼ⁿ2Að1; 4Þ with nAN, the following hold.

(a) For any initial point u0¼ uðt0Þ, there exists a unique continuously maximal solution u(t) with tA½t0;

τ

Þ for the neural network (12), where ½t0;

τ

Þ is the maximal interval of existence.

(b) If the level setL uð Þ≔fu j0

Ψ

^pð Þr^u

Ψ

^p^ðu⁰Þg is bounded, then

τ

can be extended to þ1.

Proof. This proof is exactly the same as the proof of[33, Propo- sition 3.4]. Hence, we omit it here.□

Theorem 4.2. Assume that∇HðuÞ is nonsingular and uⁿis an isolated equilibrium point of the neural network(12). Then, the solution of the neural network (12) with any initial point u0 is Lyapunov stable.

Proof. FromLemma 2.3, we only need to argue that there exists a Lyapunov function over some neighborhood

Ω

^{of u}ⁿ. To this end, we consider the smoothed merit function for p ¼ⁿ₂Að1; 4Þ with nAN

Ψ

^p^{ðuÞ ¼}¹₂JHðuÞJ²:

Since uⁿis an isolated equilibrium point of(12), there is a neighborhood

Ω

^{of u}ⁿ^{such that}

∇

Ψ

^p^ðuⁿÞ ¼ 0 and ∇

Ψ

^p^ð^uðtÞÞa0; 8 u tð ÞA

Ω

⧹fuⁿg:

By the nonsingularity of∇HðuÞ and the deﬁnition of

Ψ

p, it is easy to obtain that

Ψ

^p^ðuⁿÞ ¼ 0. From the deﬁnition of

Ψ

p, we claim that

Ψ

^pðuðtÞÞ40 for any u tð ÞA

Ω

⧹fuⁿg, where

Ω

is a neighborhood of uⁿ. If not, that is,

Ψ

^pðuðtÞÞ ¼ 0, it follows that HðuðtÞÞ ¼ 0. Then, we have∇

Ψ

^pðuðtÞÞ ¼ 0, which contradicts with the assumption that uⁿ is an isolated equilibrium point of(12). Thus,

Ψ

^pðuðtÞÞ40 for any u tð ÞA

Ω

⧹fuⁿg. Moreover, by the proof ofLemma 4.1(c), we know that for any u tð ÞA

Ω

d

Ψ

^p^ðuðtÞÞ

dt ¼ ∇

Ψ

^p^ðuðtÞÞ^duðtÞ_dt ^¼

ρ

J∇

Ψ

^pðuðtÞÞJ²r0: ð17Þ Therefore, the function

Ψ

p is a Lyapunov function over

Ω

^{. This}

implies that uⁿis Lyapunov stable for the neural network(12).□ Theorem 4.3. Assume that∇HðuÞ is nonsingular and uⁿis an isolated equilibrium point of the neural network (12). Then, uⁿ is asymptotically stable for neural network(12).

Proof. From the proof of Theorem 4.2, we consider again the Lyapunov function

Ψ

pfor p ¼ⁿ₂Að1; 4Þ with nAN. ByLemma 2.3 again, we only need to verify that the Lyapunov function

Ψ

pover

some neighborhood

Ω

^{of u}ⁿ^satisﬁes d

Ψ

^p^ðuðtÞÞ

dt o0; 8 u tð ÞA

Ω

⧹ u ⁿ

: ð18Þ

In fact, by using(17)and the deﬁnition of the isolated equilibrium point, it is not hard to check that Eq.(18) is true. Hence, uⁿ is asymptotically stable.□

Theorem 4.4. Assume that uⁿis an isolated equilibrium point of the neural network (12). If ∇H uð Þ^T is nonsingular for any u ¼ xð ; y; zÞARⁿ R^m R^l, then uⁿ is exponentially stable for the neural network(12).

Proof. From the deﬁnition of H(u) andLemma 2.6, we have HðuÞ ¼ HðuⁿÞþ∇H uðtÞð Þ^TðuuⁿÞþoðJuuⁿJÞ; 8 uA

Ω

⧹fuⁿg; ð19Þ where∇H uðtÞð Þ^TA∂HðuðtÞÞ and

Ω

is the neighborhood of uⁿ. Now, letting

gðuðtÞÞ ¼JuðtÞuⁿJ²; t A½t0; 1Þ;

we have dgðuðtÞÞ

dt ¼ 2 uðtÞu ⁿTduðtÞ dt

¼ 2

ρ

^{uðtÞ u}ⁿ^T∇

Ψ

^p^ðuðtÞÞ

¼ 2

ρ

^{uðtÞ u}ⁿ^T∇HðuÞHðuÞ: ð20Þ Substituting(19)into(20)yields

dgðuðtÞÞ

dt ¼ 2

ρ

^{uðtÞ u}ⁿ^T∇HðuðtÞÞoðH u ⁿ þ∇H uðtÞð Þ^TðuðtÞuⁿÞþoðJuðtÞuⁿJÞÞ

¼ 2

ρ

^{uðtÞ u}ⁿ^T∇HðuðtÞÞ∇H uðtÞð Þ^TuðtÞ uⁿ þoðJuðtÞuⁿJ²Þ:

Since∇HðuÞ and ∇H uð Þ^Tare nonsingular, we claim that there exists an

κ

40 such that

uðtÞ uⁿ

T

∇H uð Þ∇H uð Þ^TuðtÞ uⁿ

Z

κ

JuðtÞuⁿJ²: ð21Þ Otherwise, if uðtÞ uð ⁿÞ^T∇H uðtÞð Þ∇H uðtÞð Þ^TðuðtÞuⁿÞ ¼ 0, it implies that

∇H uðtÞð Þ^TðuðtÞuⁿÞ ¼ 0:

Indeed, from the nonsingularity of H(u), we have uðtÞ uⁿ¼ 0, i.e., uðtÞ ¼ uⁿ, which contradicts with the assumption of uⁿ that is an isolated equilibrium point. Therefore, there exists an

κ

40 such that(21)holds. Moreover, for oðJuðtÞuⁿJ²Þ, there is

ε

40 such that oðJuðtÞuⁿJ²Þr

ε

JuðtÞuⁿJ². Hence,

dgðuðtÞÞ

dt rð2

ρκ

^þ

ε

ÞJuðtÞuⁿJ²¼ ð2

ρκ

^þ

ε

ÞgðuðtÞÞ:

This implies

g uðtÞð Þre^{ð 2}^ρκ^þ^ε^Þtgðuðt0ÞÞ;

which means

JuðtÞuⁿJ re^ρκ^þ^ε²Juðt0ÞuⁿJ:

Thus, uⁿis exponentially stable for the neural network(12).□

5. Numerical examples

In order to demonstrate the effectiveness of the proposed neural network, we test several examples for our neural network (12)in this section. The numerical implementation is coded by Matlab 7.0 and the ordinary differential equation solver adopted here is ode23, which uses Ruge–Kutta ð2; 3Þ formula. As mentioned earlier, the parameter

ρ

is set to be 1. How is

μ

chosen initially?

From Theorem 4.2 in last section, we know the solution will