• 沒有找到結果。

Recurrentneuralnetworksforsolvingsecond-orderconeprograms Neurocomputing

N/A
N/A
Protected

Academic year: 2022

Share "Recurrentneuralnetworksforsolvingsecond-orderconeprograms Neurocomputing"

Copied!
8
0
0

加載中.... (立即查看全文)

全文

(1)

Recurrent neural networks for solving second-order cone programs

Chun-Hsu Ko

a

, Jein-Shan Chen

b,1,

 , Ching-Yu Yang

b

aDepartment of Electrical Engineering, I-Shou University, Kaohsiung County 840, Taiwan

bDepartment of Mathematics, National Taiwan Normal University, Taipei 11677, Taiwan

a r t i c l e i n f o

Article history:

Received 6 January 2011 Received in revised form 1 June 2011

Accepted 14 July 2011 Communicated by Y. Liu Available online 10 August 2011 Keywords:

SOCP Neural network Merit function

Fischer–Burmeister function Cone projection function Lyapunov stable

a b s t r a c t

This paper proposes using the neural networks to efficiently solve the second-order cone programs (SOCP). To establish the neural networks, the SOCP is first reformulated as a second-order cone complementarity problem (SOCCP) with the Karush–Kuhn–Tucker conditions of the SOCP. The SOCCP functions, which transform the SOCCP into a set of nonlinear equations, are then utilized to design the neural networks. We propose two kinds of neural networks with the different SOCCP functions. The first neural network uses the Fischer–Burmeister function to achieve an unconstrained minimization with a merit function. We show that the merit function is a Lyapunov function and this neural network is asymptotically stable. The second neural network utilizes the natural residual function with the cone projection function to achieve low computation complexity. It is shown to be Lyapunov stable and converges globally to an optimal solution under some condition. The SOCP simulation results demonstrate the effectiveness of the proposed neural networks.

&2011 Elsevier B.V. All rights reserved.

1. Introduction

Second-order cone program (SOCP) has been widely applied in engineering optimization[1]. It requires solving the optimization problem subject to the linear equality and second-order cone inequality constraints [2]. Numerical approaches such as the interior-point method[1] or the merit function method[3] can effectively solve the SOCP. However, many engineering dynamic systems, such as force analysis in robot grasping[1,4] and control applications [5,6], require the real-time SOCP solutions. As a result, efficient approaches for solving the real-time SOCP are needed. Prior research[7–18] indicates that the neural networks can be used to solve various optimization problems. Furthermore, the neural networks based on circuit implementation exhibit the real-time processing ability. We consider that it is appropriate to utilize the neural networks for efficiently solving the SOCP problems.

The recurrent neural network was introduced by Hopfield and Tank[7]for solving linear programming problems. Kennedy and Chua[8]proposed an extended neural network for solving non- linear convex programming problems thereafter, while their

approach involves the penalty parameter which affects the neural network accuracy. To find the exact solutions, more neural networks for optimization have been further developed. Among them, the primal-dual neural network [9–11] with the global stability is proposed for providing the exact solutions of the linear and quadratic programming problems. The projection neural network, developed by Xia and Wang[12,14,15], was proposed to efficiently solve many optimization problems and variational inequalities. Since the SOCP is a nonlinear convex problem, both primal-dual neural network[16]and projection neural network [17]can be used to provide the SOCP solution. However, they require many state variables, leading to high model complexity. It thus motivates the development of more compact neural networks for SOCP.

The SOCP can be solved by analyzing its Karush–Kuhn–Tucker (KKT) optimality conditions which leads to the second-order cone complementarity problem (SOCCP) [3,19,20]. The approaches [3,20] based on the SOCCP functions, such as Fischer–Burmeister (FB) and natural residual functions, can be further utilized for solving the SOCCP. In the merit function approach [3], an unconstrained smooth minimization with the FB function is achieved in finding the SOCCP solution. On the other hand, the semi-smooth approach [20] uses the natural residual function with the cone projection (CP) function to reformulate the SOCCP as a set of nonlinear equations and then apply the non-smooth Newton method to obtain the solution. Previous studies have demonstrated the feasibility of these SOCCP functions in solving the SOCP problems. We also use them in our neural network Contents lists available atScienceDirect

journal homepage:www.elsevier.com/locate/neucom

Neurocomputing

0925-2312/$ - see front matter & 2011 Elsevier B.V. All rights reserved.

doi:10.1016/j.neucom.2011.07.009

Corresponding author. Member of Mathematics Division, National Center for Theoretical Sciences, Taipei Office.

E-mail addresses: [email protected] (C.-H. Ko), [email protected] (J.-S. Chen), [email protected] (C.-Y. Yang).

1The author’s work is partially supported by National Science Council of Taiwan.

(2)

design. In this paper, we propose two novel neural networks for efficiently solving the SOCP problems. One is based on the gradient of the smooth merit function derived from the FB function[18]. The other is an extended projection neural network by replacing the scalar projection function[12,14,15] with the CP function. These neural networks are with less state variables than those previously proposed[16,17] for solving the SOCP. Further- more, they are shown to be stable and globally convergent to the SOCP solutions.

This paper is organized as follows.Section 2introduces the second-order cone program and its SOCCP formulation. InSection 3, the neural network based on the Fischer–Burmeister function is proposed and analyzed. InSection 4, the second neural network based on the cone projection function is proposed. Its global stability is also verified. InSection 5, several SOCP examples are presented to demonstrate the effectiveness of the proposed neural networks. Finally, the conclusions are given inSection 6.

2. Problem formulation

In this section, we introduce the second-order cone program and reformulate it as a second-order cone complementarity problem. The second-order cone program is in the form of minimize f ðxÞ

subject to Ax ¼ b, x A K: ð1Þ

Here f :Rn-R is a nonlinear continuously differentiable func- tion, A ARmnis a full row rank matrix, bARmis a vector, and K is a Cartesian product of second-order cones (or Lorentz cones), expressed as

K ¼ Kn1Kn2    KnN, ð2Þ

where N,n1, . . . ,nNZ1,n1þ    þnN¼n, and Kni:¼ fðxi1,xi2, . . . ,xiniÞTARni9 Jðxi2, . . . ,xiniÞJrxi1g

with J  J denoting the Euclidean norm and K1 the set of non- negative realsRþ. A special case of Eq. (2) is K ¼Rnþ, namely the nonnegative orthant in Rn, which corresponds to N ¼n and n1¼    ¼nN¼1. When f is linear, i.e., f ¼ cTx with c ARn, SOCP (1) reduces to the following linear SOCP:

minimize cTx

subject to Ax ¼ b, x A K: ð3Þ

The KKT optimality conditions for (1) are given by

r

f ðxÞATyl¼0, xTl¼0, x A K, lAK, Ax ¼ b,

8>

<

>: ð4Þ

where y ARmandlARn. When f is convex, these conditions are sufficient for optimality. It also can be written as

xTð

r

f ðxÞATyÞ ¼ 0, x A K,

r

f ðxÞATy A K, Ax ¼ b:

(

ð5Þ

By solving the system (5), we may obtain a primal-dual optimal solution of SOCP (1). Note that system (5) involves the SOCCP. To efficiently solve it, we propose using the neural network approaches with the FB function and CP function, respectively, described below.

3. Neural network design with Fischer–Burmeister function

It is known that the merit function approach[3]can be used for solving system (5). Motivated by this approach, we propose a

neural network with the Fischer–Burmeister function to find the minimal of the merit function and study its global stability.

In [3], system (5) is shown to be equivalent to an uncon- strained smooth minimization problem via the merit function approach, described as

min Eðx,yÞ ¼CFBðx,

r

f ðxÞATyÞ þ12JAxbJ2, ð6Þ where Eðx,yÞ is a merit function, CFBðx,yÞ ¼12PN

i ¼ 1JfFBðxi,yiÞJ2, x ¼ ðx1, . . . ,xNÞT, y ¼ ðy1, . . . ,yNÞTARn1    RnN, and fFB is the Fischer–Burmeister function defined as

fFBðxi,yiÞ:¼ ðx2iþy2iÞ1=2xiyi: ð7Þ Based on the gradient of the objective Eðx,yÞ in minimization problem (6), we propose the first neural network for solving the SOCP, with the following dynamic equation:

d dt

x y !

¼

r



r

xEðx,yÞ



r

yEðx,yÞ

!

, ð8Þ

where

r

is a positive scaling factor and

r

xEðx,yÞ ¼

r

xCFBðx,

r

f ðxÞAT

þ

r

2f ðxÞ 

r

yCFBðx,

r

f ðxÞATyÞ þ ATðAxbÞ,

r

yEðx,yÞ ¼ A 

r

yCFBðx,

r

f ðxÞATyÞ:

8>

><

>>

:

ð9Þ

For linear SOCP (3), the above equations reduce to

r

xEðx,yÞ ¼

r

xCFBðx,cATyÞ þ ATðAxbÞ,

r

yEðx,yÞ ¼ A 

r

yCFBðx,cATyÞ:

(

ð10Þ

Note that the Jordan product[3]is required for calculating

r

xCFB

and

r

yCFBwhich are introduced in Appendix. And, the dynamic equation (8) can be realized by a recurrent neural network with FB function as shown inFig. 1. The circuit for the neural network realization requires n þ m integrators, n processors for

r

f ðxÞ, n2 processors for

r

2f ðxÞ, n processors for

r

xCFB, m processors for

r

yCFB, 4mn connection weights and some summers. Further- more, the neural network (8) is asymptotically stable, as proven in the following theorem.

Fig. 1. Block diagram of the proposed neural network with FB function.

(3)

Theorem 3.1. If un¼ ðxn,ynÞ is an isolated equilibrium point of neural network (8), then un¼ ðxn,ynÞis asymptotically stable for (8).

Proof. We assume that un¼ ðxn,ynÞ is an isolated equilibrium point of neural network (8) over a neighborhoodOnDRn of un such that

r

Eðxn,ynÞ ¼0 and

r

Eðx,yÞa0, 8ðx,yÞAOn\fðxn,ynÞg. First we show that Eðx,yÞ is a Lyapunov function for unatOn. Since

r

yEðxn,ynÞ ¼ A 

r

yCFBðxn,

r

f ðxnÞATynÞ ¼0, from Lemma 3 and Proposition 1 of[3], we have

r

xCFBðxn,

r

f ðxnÞATynÞ ¼

r

yCFBðxn,

r

f ðxnÞATynÞ ¼0:

Moreover, from Proposition 1 of[3], this says CFBðxn,

r

f ðxnÞATynÞ ¼0:

Then from Eq. (9),

r

xEðxn,ynÞ ¼

r

xCFBðxn,

r

f ðxnÞATynÞ

þ

r

2f ðxnÞ 

r

yCFBðxn,

r

f ðxnÞATynÞ þATðAxnbÞ ¼ 0,

which implies that ATðAxnbÞ ¼ 0. Because A ARmn is a full row rank matrix, we must have Axnb ¼ 0, which yields

Eðxn,ynÞ ¼CFBðxn,

r

f ðxnÞATynÞ þ12JAxnbJ2¼0:

Next, we claim that Eðx,yÞ 4 0, 8ðx,yÞ AOn\fðxn,ynÞg. If not, there is an ðx,yÞ AOn\fðxn,ynÞg such that Eðx,yÞ ¼ 0, this says that CFBðx,

r

f ðxÞATyÞ ¼ 0 and Ax ¼b, then

r

xEðx,yÞ ¼ 0 and

r

yEðx,yÞ ¼ 0. Hence, (x,y) is an equilibrium point of neural net- work (8), contradicting with that un¼ ðxn,ynÞis an isolate equili- brium point. Finally,

dEðxðtÞ,yðtÞÞ

dt ¼ ½

r

ðxðtÞ,yðtÞÞEðxðtÞ,yðtÞÞTð

rr

ðxðtÞ,yðtÞÞEðxðtÞ,yðtÞÞÞ

¼ 

r

J

r

ðxðtÞ,yðtÞÞEðxðtÞ,yðtÞÞJ2r0:

Therefore, the function Eðx,yÞ is a Lyapunov function for neural network (8) over the set On. Since un¼ ðxn,ynÞ is an isolated equilibrium point of neural network (8), we have

dEðxðtÞ,yðtÞÞ

dt o0, 8ðxðtÞ,yðtÞÞAOn\fðxn,ynÞg:

Thus, unis asymptotically stable for neural network (8). &

4. Neural network design with cone projection function In this section, we propose another neural network associated with the cone projection function to solve system (5) for obtain- ing the SOCP solution and study its stability. In fact, from [24, Proposition 3.3], we know that such cone projection onto K has a special formula given as

PKðzÞ ¼ ½l1ðzÞþuð1Þz þ ½l2ðzÞþuð2Þz ,

where ½þmeans the scalar projection,l1ðzÞ,l2ðzÞ and uð1Þz , uð2Þz are the spectral values and the associated spectral vectors of z ¼ ðz1,z2Þ ARRn1, respectively, given by

liðzÞ ¼ z1þ ð1ÞiJz2J, uðiÞz ¼1

2 1,ð1Þi z2

Jz2J

 

, 8>

<

>:

for i¼ 1,2. The CP function PK(z) has the following property, called projection theorem [21], which is useful in our subsequent analysis.

Property 4.1. Let K be a nonempty closed convex subset ofRn. Then, for each z ARn, PK(z) is the unique vector z A K satisfying ðyzÞTðzzÞr0, 8yAK.

Employing the natural residual function with the CP function [19,20], system (5) can be equivalently written as

xPKðx

r

f ðxÞ þ ATyÞ ¼ 0, Axb ¼ 0,

(

ð11Þ

where x ¼ ðx1, . . . ,xNÞTARn1    RnNwith xi¼ ðxi1,xi2, . . . ,xiniÞT, i ¼ 1, . . . ,N, and PKðxÞ ¼ ½PKðx1Þ, . . . ,PKðxNÞT.

Based on the equivalent formulation in (11) and employing the ideas for networks used in[12,13], we consider the second neural network for solving the SOCP, with the following dynamic equations:

d dt

x y !

¼

r

x þ PKðx

r

f ðxÞ þAT

Ax þ b

!

, ð12Þ

where

r

is a positive scaling factor. The dynamic can be realized by a recurrent neural network with the cone projection function as shown inFig. 2. The circuit for the neural network realization requires n þ m integrators, n processors for

r

f ðxÞ, N processors for cone projection mapping PK, 2mn connection weights and some summers. Compared with the first neural network in (8), the second neural network (12) dose not require to calculate

r

2f ðxÞ, resulting in lower model complexity.

To analyze the stability of the neural network in Eq. (12), we first give three lemmas and one proposition.

Lemma 4.1. Let F(u) be defined as

FðuÞ :¼ Fðx,yÞ :¼ x þ PKðx

r

f ðxÞ þAT

Ax þb

!

: ð13Þ

Then, F(u) is semi-smooth. Moreover, F(u) is strongly semi-smooth if

r

2f ðxÞ is locally Lipschitz continuous.

Proof. This is an immediate consequence of[20, Theorem 1]. &

Proposition 4.1. For any initial point u0¼ ðx0,y0Þ where x0:¼ xðt0Þ AK, there exists a unique solution uðtÞ ¼ ðxðtÞ,yðtÞÞ for neural network (12). Moreover, xðtÞ A K.

Proof. For simplicity, we assume K ¼ Kn. The analysis can be carried over to the general case where K is the Cartesian product of second-order cones. From Lemma 4.1, FðuÞ :¼ Fðx,yÞ is semi- smooth and Lipschitz continuous. Thus, there exists a unique solution uðtÞ ¼ ðxðtÞ,yðtÞÞ for neural network (12). Therefore, it remains to show that xðtÞ A Kn. For convenience, we denote xðtÞ :¼ ðx1ðtÞ,x2ðtÞÞ ARRn1. To complete the proof, we need to verify two things: (i) x1ðtÞ Z 0 and (ii) Jx2ðtÞJrx1ðtÞ. First, from (12), we have

dx

dtþ

r

xðtÞ ¼

r

PKðx

r

f ðxÞ þ ATyÞ:

The solution of the first-order ordinary differential equation above is

xðtÞ ¼ erðtt0Þxðt0Þ þ

r

ert Z t

t0

ersPKðx

r

f ðxÞ þ ATyÞds:

If we let xðt0Þ:¼ ðx1ðt0Þ,x2ðt0ÞÞ ARRn1 and denote zðtÞ :¼ ðz1ðt0Þ,z2ðt0ÞÞas the term PKðxð

r

f ðxÞATyÞÞ, which leads to x1ðtÞ ¼ erðtt0Þx1ðt0Þ þ

r

ert

Z t t0

ersz1ðsÞds,

x2ðtÞ ¼ erðtt0Þx2ðt0Þ þ

r

ert Z t

t0

ersz2ðsÞds:

Due to both x0ðtÞ and z(t) belong to Kn, there have x1ðt0Þ Z0, Jx2ðt0ÞJrx1ðt0Þ and z1ðtÞ Z0, Jz2ðtÞJrz1ðtÞ. Therefore, x1ðtÞ Z0 since both terms in the right-hand side are nonnegative.

(4)

In addition,

Jx2ðtÞJrerðtt0ÞJx2ðt0ÞJþ

r

ert Z t

t0

ersJz2ðsÞJds

rerðtt0Þx1ðt0Þ þ

r

ert Zt

t0

ersz1ðsÞds ¼ x1ðtÞ,

which implies that xðtÞ A Kn. &

Lemma 4.2. Let H(u) be defined as

HðuÞ :¼ Hðx,yÞ :¼

r

f ðxÞATy Axb

!

: ð14Þ

Then, H is a monotone function if f is a convex function. Moreover,

r

HðuÞ is positive semi-definite if and only if

r

2f ðxÞ is positive semi- definite.

Proof. Let u ¼ ðx,yÞ and ~u ¼ ð ~x, ~yÞ. Then, the monotonicity of H holds since

ðu ~uÞTðHðuÞHð ~uÞÞ ¼ ðx ~xÞTð

r

f ðxÞ

r

f ð ~xÞÞðx ~xÞTðATðy ~yÞÞ þ ðy ~yÞTðAðx ~xÞÞ ¼ ðx ~xÞTð

r

f ðxÞ

r

f ð ~xÞÞZ 0,

where the last inequality is due to the convexity of f(x), see[22, Theorem 3.4.5]. Furthermore, we observe that

r

HðuÞ ¼

r

2f ðxÞ AT

A 0

" #

:

Thus, we have

uT

r

HðuÞu ¼ ½xT yT

r

2f ðxÞ AT

A 0

" #

x y

" #

¼xT

r

2f ðxÞx,

which indicates that the positive semi-definiteness of

r

HðuÞ is equivalent to the positive semi-definiteness of

r

2f ðxÞ. &

Lemma 4.3. Let F(u) and H(u) be defined as in (13) and (14), respectively. Also, let un¼ ðxn,ynÞbe an equilibrium point of neural network (12) with xnbeing an optimal solution of SOCP. Then, the following inequalities hold:

ðFðuÞ þuunÞTðFðuÞHðuÞÞ Z0: ð15Þ

Proof. First, we denotel:¼

r

f ðxÞATy. Then, we obtain ðFðuÞ þuunÞTðFðuÞHðuÞÞ

Fig. 2. Block diagram of the proposed neural network with CP function.

(5)

¼ x þ PKðxlÞ þ ðxxnÞ ðAx þ bÞ þ ðyynÞ

" #T

xPKðxlÞl ðAxbÞðAxbÞ

" #

¼ xnþPKðxlÞ ðAx þ bÞ þ ðyynÞ

" #T

ðxlÞPKðxlÞ 0

 

¼ ðxnPKðxlÞÞTððxlÞPKðxlÞÞ:

Since xnAK, applyingProperty 4.1gives ðxnPKðxlÞÞTððxlÞPKðxlÞÞr0:

Thus, inequality (15) is proved. &

We now investigate the stability and convergence issues of neural network (12). First, we analyze the behavior of the solution trajectory of neural network (12) including existence and con- vergence. We then establish two kinds of stability for an isolated equilibrium point.

We know that every solution un to SOCP is an equilibrium point of neural network (12). If further unis an isolated equili- brium point of neural network (12), we show that unis Lyapunov stable.

Theorem 4.1. If f is convex and twice differentiable, then the solution of neural network (12), with initial point u0¼ ðx0,y0Þwhere x0AK, is Lyapunov stable. Moreover, the solution trajectory of neural network (12) is extendable to the global existence.

Proof. Again, for simplicity, we assume K ¼ Kn. FromProposition 4.1, there exists a unique solution uðtÞ ¼ ðxðtÞ,yðtÞÞ for neural network (12) and xðtÞ A Kn. Let un¼ ðxn,ynÞbe an equilibrium point of neural network (12) with xnbeing an optimal solution of SOCP.

We define a Lyapunov function as below:

EðuÞ :¼ Eðx,yÞ :¼ HðuÞTFðuÞ12JFðuÞJ2þ12JuunJ2, ð16Þ where F(u) and H(u) are given as in (13) and (14), respectively.

From[23, Theorem 3.2], we know that E is continuously differ- entiable with

r

EðuÞ ¼ HðuÞ½

r

HðuÞIFðuÞ þ ðuunÞ:

It is also trivial that EðunÞ ¼0. Then, we have dEðuðtÞÞ

dt ¼

r

EðuðtÞÞTdu

dt ¼ fHðuÞ½

r

HðuÞIFðuÞ þ ðuunÞgT

r

FðuÞ

¼

r

f½HðuÞ þðuunÞTFðuÞ þ JFðuÞJ2FðuÞT

r

HðuÞFðuÞg:

Hence, inequality (15) inLemma 4.3implies ðHðuÞ þ uunÞTFðuÞrHðuÞTðuunÞJFðuÞJ2, which yields

dEðuðtÞÞ

dt r

r

fHðuÞTðuunÞFðuÞT

r

HðuÞFðuÞg

¼

r

fHðunÞTðuunÞðHðuÞHðunÞÞTðuunÞ

FðuÞT

r

HðuÞFðuÞg: ð17Þ

On the other hand, we know that

ðFðunÞ þunuÞTðFðunÞHðunÞÞ ¼ ðxPKðxnlnÞÞTððxnlnÞPKðxnlnÞÞ:

Since x A Kn, applyingProperty 4.1gives ðxPKðxnlnÞÞTððxnlnÞPKðxnlnÞÞr0:

Thus, we have ðFðunÞ þunuÞTðFðunÞHðunÞÞ Z0. Note that FðunÞ ¼0, we therefore obtain HðunÞTðuunÞTr0. Also the mono- tonicity of H implies ðHðuÞHðunÞÞTðuunÞr0. In addition, f is convex and twice differentiable if and only if

r

2f ðxÞ is positive semidefinite and hence

r

H is positive semidefinite byLemma 4.2, i.e., the second term FðuÞT

r

HðuÞFðuÞr0. The above discussions lead to dEðuðtÞÞ=dtr0.

In order to obtain E(u) is a Lyapunov function and un is Lyapunov stable, we will show the following inequality:

HðuÞTFðuÞ Z JFðuÞJ2: ð18Þ

To see this, we first observe that

JFðuÞJ2þHðuÞTFðuÞ ¼ ðxPKðxlÞÞTððxlÞPKðxlÞÞ:

Since x A K, applyingProperty 4.1again, there holds ðxPKðxlÞÞTððxlÞPKðxlÞÞr0,

which yields the desired inequality (18). By combining Eq. (16) and inequality (18), we have

EðuÞ Z12JFðuÞJ2þ12JuunJ2,

which says EðuÞ 4 0 if uaun. Hence E(u) is indeed a Lyapunov function and unis Lyapunov stable. Moreover, it holds that Eðu0Þ ZEðuÞ Z12JuunJ2 for t Z t0, ð19Þ which means the solution trajectory u(t) is bounded. Hence, it can be extended to global existence. &

Theorem 4.2. Let un¼ ðxn,ynÞbe an equilibrium point of (12) with xnbeing an optimal solution of SOCP. If f is twice differentiable and

r

2f ðxÞ is positive definite, the solution of neural network (12), with initial point u0¼ ðx0,y0Þwhere x0AK, is globally convergent to un and has finite convergence time.

Proof. From (19), the level set Lðu0Þ:¼ fu 9 EðuÞrEðu0Þg

is bounded. Then, the Invariant Set Theorem [25] implies the solution trajectory u(t) converges to yas t-1 where yis the largest invariant set in

P¼ u A Lðu0Þ dEðuðtÞÞ dt ¼0









:

We will show that du=dt ¼ 0 if and only if dEðuðtÞÞ=dt ¼ 0 which yields that u(t) converges globally to the equilibrium point un¼ ðxn,ynÞ. Suppose du=dt ¼ 0, then it is clear that dEðuðtÞÞ=dt ¼

r

EðuÞTðdu=dtÞ ¼ 0. Let ^u ¼ ð ^x, ^yÞ AP which says dEð ^uðtÞÞ=dt ¼ 0.

From (17), we know that dEð ^uðtÞÞ

dt r

r

fðHð ^uÞHðunÞÞTð ^uunÞFð ^uÞT

r

Hð ^uÞFð ^uÞg:

Both terms inside the big parenthesis are nonpositive as shown in Lemma 4.2, so ðHð ^uÞHðunÞÞTð ^uunÞ ¼0, Fð ^uÞT

r

Hð ^uÞFð ^uÞ ¼ 0, and Fð ^uÞT

r

Hð ^uÞFð ^uÞ ¼ f ^x þ PKð ^x

r

f ð ^xÞ þ ATyÞg^ T

r

2f ð ^xÞf ^x

þPKð ^x

r

f ð ^xÞ þ ATyÞg ¼ 0:^

The condition of

r

2f ð ^xÞ being positive definite leads to

 ^x þPKð ^x

r

f ð ^xÞ þATyÞ ¼ 0,^

which is equivalent to d ^x=dt ¼ 0. On the other hand, similar to the arguments inLemma 4.2, we have

ð ^uunÞTðHð ^uÞHðunÞÞ ¼ ð ^xxnÞTð

r

f ð ^xÞ

r

f ðxnÞÞ

¼ ð ^xxnÞT

r

2f ðxsÞð ^xxnÞ ¼0,

where xsA½xn, ^x. Again, the condition of

r

2f ðxsÞ being positive definite yields ^x ¼ xn. Hence d ^y=dt ¼ 0 and therefore d ^uðtÞ=dt ¼ 0.

From above, u(t) converges globally to the equilibrium point un¼ ðxn,ynÞ. Moreover, with Theorem 4.1 and following the same arguments as in[12, Theorem 2], the neural network (12) has finite convergence time. &

(6)

5. Simulations

To demonstrate the effectiveness of the proposed neural net- works, three illustrative SOCP problems are tested, described as below.

Example 5.1. Consider the nonlinear convex SOCP[20]given by minimize exp ðx1x3Þ þ3ð2x1x2Þ4þ

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 þð3x2þ5x3Þ2 q

subject to Ax ¼ b, x A K3K2, where

A ¼ 4 6 3 1 0

1 7 5 0 1

 

and b ¼ 1

2

 

:

This problem has an approximate solution xn¼ ½0:2324,

0:07309,0:2206,0:153,0:153T. We use the proposed neural net- works with the FB and CP functions, respectively, to solve the problem with the trajectories obtained by them shown in Figs. 3 and 4. From the simulation results, we found that both trajectories are globally convergent to xn and the neural network with the CP function converged to xnquicker than that with the FB

function. On the other hand, the neural network with the CP function also has lower model complexity than that with the FB function as mentioned in Section 4. Hence, the neural network with the CP function is preferable to the neural network with the FB function when both can globally converge to the optimal solution.

Example 5.2. Consider the following linear SOCP given by minimize x1þx2þx3þx4þx5þx6

subject to Ax ¼ b, x A K3K3, where

A ¼

1 2 0 0 0 1

1 0 0 1 4 0

0 1 1 0 1 0

1 1 0 0 0 0

0 0 1 0 2 0

2 66 66 66 4

3 77 77 77 5

and b ¼ 9 20

6 4 8 2 66 66 66 4

3 77 77 77 5

This problem has an optimal solution xn¼ ½3,1,2,5,3,4T. Note that, its objective function is convex and the Hessian matrix

r

2f ðxÞ is a zero matrix. Hence, the neural network with the FB function is asymptotically stable from Theorem 3.1 while the neural network with the CP function is Lyapunov stable from Theorem 4.1.Figs. 5 and 6display the trajectories obtained using the neural networks with the FB and CP functions, respectively.

The simulation results show that both trajectories are convergent to xn. Coinciding with above results ofTheorems 3.1and4.1, the neural network with the CP function yields the oscillating trajectory and has longer convergence time than the neural network with the FB function.

Example 5.3. Consider the grasping force optimization problem for the multi-fingered robotic hand[1,4,17]. Its goal is to find the minimum grasping force for moving an object. For the robotic hand with m fingers, the optimization problem can be formulated as minimize 12fTf

subject to Gf ¼ fext

Jðfi1,fi2ÞJr

m

fi3 ði ¼ 1, . . . ,mÞ,

where f ¼ ½f11,f12, . . . ,fm3T is the grasping force, G the grasping transformation matrix, fextthe time-varying external wrench, and

m

the friction coefficient.

0 0.05 0.1 0.15 0.2

−0.2 0 0.2 0.4 0.6 0.8 1

Time (sec)

Trajectories of x (t)

x1

x2 x3

x4, x5

Fig. 3. Transient behavior of the neural network with FB function inExample 5.1.

0 0.005 0.01 0.015 0.02 0.025 0.03

−1

−0.5 0 0.5 1 1.5 2 2.5 3

Time (sec)

Trajectories of x (t)

x1

x2 x3 x4, x5

Fig. 4. Transient behavior of the neural network with CP function inExample 5.1.

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4

0 1 2 3 4 5 6

Time (sec)

Trajectories of x (t)

x1, x5

x2 x3 x4

x6

Fig. 5. Transient behavior of the neural network with FB function inExample 5.2.

(7)

Letting ½xi1,xi2,xi3 ¼ ½

m

fi3,fi1,fi2,i ¼ 1, . . . ,m, and x ¼ ½x11,x12, . . . , xm3T, the problem can be reformulated as a nonlinear convex SOCP. For the three-finger grasp example in[17], the robot hand grasps a polyhedral with the grasp points ½0,1,0T, ½1,0:5,0T, and

½0,1,0T, and the robot hand moves along a vertical circular trajectory of radius r with a constant velocity

n

. We reformulate the example as

minimize 12xTQx

subject to Ax ¼ b, x A K3K3K3, ð20Þ where Q ¼ diagð1=

m

2,1,1,1=

m

2,1,1,1=

m

2,1,1Þ

A ¼

0 0 1 1=

m

0 0 0 1 0

1=

m

0 0 0 0 1 1=

m

0 0

0 1 0 0 1 0 0 0 1

0 1 0 0 0:5 0 0 0 1

0 0 0 0 1 0 0 0 0

0 0 1 0:5=

m

0 1 0 1 0

2 66 66 66 66 64

3 77 77 77 77 75

and

b ¼

0

fcsinyðtÞ MgfccosyðtÞ

0 0 0 2 66 66 66 66 4

3 77 77 77 77 5 ,

where M is the mass of the polyhedral, g ¼9.8 m/s2, fc¼M

n

2=r the centripetal force, t the time, and y¼

n

t=r A ½0,2

p

. Note that problem (20) is a nonlinear convex SOCP and the matrix Q is positive definite. We know fromTheorems 3.1and4.2that both the proposed neural networks are globally convergent to the optimal solution. Under the conditions M¼0.1 kg, r ¼0.2 m,

n

¼0:4

p

m=s, and

m

¼0:6, the time-varying grasping force obtained from the proposed neural networks is shown inFig. 7.

We found that the maximum grasping force occurs at the position y¼

p

(t ¼0.5 s) which corresponds to the maximum downward wrench. The simulation results demonstrate that the neural networks are effective in the SOCP applications.

6. Conclusion

In this paper, we have proposed two neural networks for efficiently solving the SOCP. The first neural network is based on gradient of the merit function derived from the FB function and was shown to be asymptotically stable. The second neural network with the CP function has low model complexity, and has been shown to be Lyapunov stable and converge globally to the SOCP solution under the positive definite condition of Hessian matrix of the objective function. The convergence of the neural networks has been validated with the simulation results of the SOCP examples.

When the second neural network with the CP functions yields oscillating trajectory, we can employ the neural network based on FB function instead, though it has higher model complexity. The proposed neural networks are thus ready for the SOCP applications.

During the reviewing process of this paper, we published another paper[26] which focuses on second-order cone constrained varia- tional inequality problem. Since the KKT conditions of second-order cone programs can be recast as variational inequality problem, the paper [26] indeed deals with a broader class of optimization problems. However, the two neural networks considered therein are different from the two neural networks studied in this paper.

More specifically, the FB method used in [26] is based on the smoothed FB function while the one studied here is based on regular FB function; the CP method in[26]is based on a Lagrangian model which is, even when it reduces to SOCP, not the same as the one investigated here. Due to the essential difference, the assumptions used to establish stability are also different. In view of this, it will be an interesting topic to do numerical comparison among these neural networks for SOCP.

Acknowledgement

The work was supported by National Science Council of Taiwan under the Grant NSC 97-2221-E-214-034.

Appendix

In this appendix, we introduce the Jordan product and its properties used in the neural network with the FB function, which are needed when we write codes for simulations.

0 0.2 0.4 0.6 0.8 1

0 1 2 3 4 5 6

Time (sec)

Trajectories of x (t)

x1, x5

x2 x3 x4

x6

Fig. 6. Transient behavior of the neural network with CP function inExample 5.2.

0 0.2 0.4 0.6 0.8 1

−1

−0.5 0 0.5 1 1.5 2

Time (sec)

Grasping force (N)

f11,f32

f12 f13

f21

f22 f23

f31 f33

Fig. 7. Grasping force obtained by using proposed neural networks inExample 5.3.

(8)

For any x ¼ ðx1,x2Þ ARRn1, their Jordan product is defined as

xJy ¼ ðxTy,y1x2þx1y2Þ:

Their sum of square is calculated by x2þy2¼ ðJxJ2þ JyJ2,2x1x2þ2y1y2Þ:

The square root of x is

x1=2¼ s,x2

2s

, s ¼

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1

2ðx1þ

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi x21Jx2J2 q

Þ r

if x ¼ 0, x1=2¼0 and the determinant of x is det ðxÞ ¼ x21Jx2J2. Furthermore, a matrix Lxis defined as

Lx¼ x1 xT2 x2 x1I

" # ,

and when detðxÞa0, Lxis invertible with L1x ¼ 1

det ðxÞ

x1 xT2

x2 detðxÞ x1 I þx1

1x2xT2 2

4

3 5:

Based on the properties of the Jordan product described above, the formulae of

r

xCFBðx,yÞ and

r

yCFBðx,yÞ in neural network (8) are calculated (see[3]) as

r

xCFBðx,yÞ ¼ ðLxL1ðx2þy2Þ1=2IÞfFBðx,yÞ and

r

yCFBðx,yÞ ¼ ðLyL1ðx2þy2Þ1=2IÞfFBðx,yÞ:

References

[1] M.S. Lobo, L. Vandenberghe, S. Boyd, H. Lebret, Applications of second-order cone programming, Linear Algebra and its Applications 284 (1) (1998) 193–228.

[2] F. Alizadeh, D. Goldfarb, Second-order cone programming, Mathematical Programming 95 (1) (2003) 3–51.

[3] J.-S. Chen, P. Tseng, An unconstrained smooth minimization reformulation of the second-order cone complementarity problem, Mathematical Program- ming 104 (2005) 293–327.

[4] S.P. Boyd, B. Wegbreit, A Fast computation of optimal contact forces, IEEE Transactions on Robotics 23 (6) (2007) 1117–1132.

[5] S. Boyd, C. Crusius, A. Hansson, Control applications of nonlinear convex programming, Journal of Control Process 8 (5) (1998) 313–324.

[6] D. Bertsimas, D.B. Brown, Constrained stochastic LQC: a tractable approach, IEEE Transactions on Automatic Control 52 (10) (2007) 1826–1841.

[7] D.W. Tank, J.J. Hopfield, Simple neural optimization networks: an A/D converter, signal decision circuit, and a linear programming circuit, IEEE Transactions on Circuits and Systems 33 (5) (1986) 533–541.

[8] M.P. Kennedy, L.O. Chua, A Neural network for nonlinear programming, IEEE Transaction on Circuits and Systems 35 (5) (1988) 554–562.

[9] Y.S. Xia, A new neural network for solving linear and quadratic programming problems, IEEE Transactions on Neural Networks 7 (6) (1996) 1544–1547.

[10] Q. Tao, J.D. Cao, M.S. Xue, H. Qiao, A high performance neural network for solving nonlinear programming problems with hybrid constraints, Physics Letters A 288 (2) (2001) 88–94.

[11] J. Wang, Q. Hu, D. Jiang, A Lagrangian neural network for kinematic control of redundant robot manipulators, IEEE Transactions on Neural Networks 10 (5) (1999) 1123–1132.

[12] Y. Xia, J. Wang, A recurrent neural network for solving nonlinear convex programs subject to linear constraints, IEEE Transactions on Neural Networks 16 (3) (2005) 379–386.

[13] Y. Xia, H. Leung, J. Wang, A projection neural network and its application to constrained optimization problems, IEEE Transactions on Circuits and Sys- tems – Part I 49 (2002) 447–458.

[14] Y. Xia, J. Wang, A recurrent neural network for nonlinear convex optimization subject to nonlinear inequality constraints, IEEE Transactions on Circuits and Systems I: Regular Papers 51 (7) (2004) 1385–1394.

[15] Y. Xia, H. Leung, J. Wang, A general projection neural network for solving monotone variational inequalities and related optimization problems, IEEE Transactions on Neural Networks 15 (2) (2004) 318–328.

[16] X. Mu, S. Liu, Y. Zhang, A neural network algorithm for second-order conic programming, in: Proceedings of the Second International Symposium on Neural Networks, Chongqing, China, Part II, 2005, pp. 718–724.

[17] Y. Xia, J. Wang, L.M. Fok, Grasping-force optimization for multifingered robotic hands using a recurrent neural network, IEEE Transactions on Robotics and Automation 20 (3) (2004) 549–554.

[18] L.Z. Liao, H.D. Qi, A neural network for the linear complementarity problem, Mathematical and Computer Modeling 29 (3) (1999) 9–18.

[19] J.S. Chen, X. Chen, P. Tseng, Analysis of nonsmooth vector-valued function associated with second-order cone, Mathematical Programming 101 (1) (2004) 95–117.

[20] C. Kanzow, I. Ferenczi, M. Fukushima, On the local convergence of semi- smooth Newton methods for linear and nonlinear second-order cone pro- grams without strict complementarity, SIAM Journal on Optimization 20 (2009) 297–320.

[21] D.P. Bertsekas, Nonlinear Programming, Athena Scientific, Belmont, MA, 1995.

[22] J.M. Ortega, W.C. Rheinboldt, Iterative Solution of Nonlinear Equations in Several Variables, SIAM, Philadelphia, 2000.

[23] M. Fukushima, Equivalent differentiable optimization problems and descent methods for asymmetric variational inequality problems, Mathematical Programming 53 (1) (1992) 99–110.

[24] M. Fukushima, Z.-Q. Luo, P. Tseng, Smoothing functions for second- order-cone complementarity problems, SIAM Journal on Optimization 12 (2002) 436–460.

[25] R. Golden, Mathematical Methods for Neural Network Analysis and Design, The MIT Press, Cambridge, MA, 1996.

[26] J. Sun, J.-S. Chen, C.-H. Ko, Neural networks for solving second-order cone constrained variational inequality problem, Computational Optimization and Applications, in press, doi:10.1007/s10589-010-9359-x.

Chun-Hsu Ko received the Ph.D. degree in Electrical and Control Engineering from National Chiao Tung University, Taiwan, ROC, in 2003. He worked at ITRI in Taiwan in 1994–1998. He is currently an Associate Professor in the Department of Electrical Engineering, I-Shou University, Taiwan. His research interests include neural networks, control, and robotics.

Jein-Shan Chen, an associate professor at National Taiwan Normal University, obtained his Ph.D. degree in mathematics under Prof. Paul Tseng from University of Washington in 2004. His research interest is mainly on continuous optimization. He has published over 50 papers including a few in top journals like Mathema- tical Programming, SIAM Journal on Optimization, etc.

Ching-Yu Yang obtained his Ph.D. degree in mathe- matics from National Taiwan Normal University, Tai- wan, ROC, in 2010. He is currently a Lecturer in the Department of Mathematics, National Taiwan Normal University. His research interest is optimization.

參考文獻

相關文件

Keywords Second-order cone · Variational inequality · Fischer-Burmeister function · Neural network · Lyapunov stable · Projection function.. Sun is also affiliated with Department

In this paper, we have studied a neural network approach for solving general nonlinear convex programs with second-order cone constraints.. The proposed neural network is based on

Wang, A recurrent neural network for solving nonlinear convex programs subject to linear constraints, IEEE Transactions on Neural Networks, vol..

In this paper, we have studied a neural network approach for solving general nonlinear convex programs with second-order cone constraints.. The neural network is based on the

Wang, Solving pseudomonotone variational inequalities and pseudo- convex optimization problems using the projection neural network, IEEE Transactions on Neural Network,

W ANG , A general projection neural network for solving monotone variational inequalities and related optimization problems, IEEE Transactions on Neural Networks, 15 (2004)

Abstract We establish that the Fischer–Burmeister (FB) complementarity function and the natural residual (NR) complementarity function associated with the symmetric cone have the

Abstract Based on a class of smoothing approximations to projection function onto second-order cone, an approximate lower order penalty approach for solving second-order cone