• 沒有找到結果。

AnovelgeneralizationofthenaturalresidualfunctionandaneuralnetworkapproachfortheNCP Neurocomputing

N/A
N/A
Protected

Academic year: 2022

Share "AnovelgeneralizationofthenaturalresidualfunctionandaneuralnetworkapproachfortheNCP Neurocomputing"

Copied!
15
0
0

加載中.... (立即查看全文)

全文

(1)

A novel generalization of the natural residual function and a neural network approach for the NCP

Jan Harold Alcantara, Jein-Shan Chen

,1

Department of Mathematics, National Taiwan Normal University, Taiwan

a r t i c l e i n f o

Article history:

Received 30 January 2020 Revised 20 April 2020 Accepted 11 June 2020 Available online 23 June 2020 Communicated by Q. Wei

MSC[2010]:

37-N40 65-K10 65-K15 Keywords:

Complementarity functions Natural residual function

Nonlinear complementarity problem

a b s t r a c t

The natural residual (NR) function is a mapping often used to solve nonlinear complementarity problems (NCPs). Recently, three discrete-type families of complementarity functions with parameter pP 3 (where p is odd) based on the NR function were proposed. Using a neural network approach based on these families, it was observed from some preliminary numerical experiments that lower values of p pro- vide better convergence rates. Moreover, higher values of p require larger computational time for the test problems considered. Hence, the value p¼ 3 is recommended for numerical simulations, which is rather unfortunate since we cannot exploit the wide range of values for the parameter p of the family of NCP functions. This paper is a follow-up study on the aforementioned results. Motivated by previously reported numerical results, we formulate a continuous-type generalization of the NR function and two corresponding symmetrizations. The new families admit a continuous parameter p> 0, giving us a wider range of choices for p and smooth NCP functions when p> 1. Moreover, the generalization subsumes the discrete-type generalization initially proposed. The numerical simulations show that in general, increased stability and better numerical performance can be achieved by taking values of p in the interval

1; 3

ð Þ. This is indeed a significant improvement of preceding studies.

Ó 2020 Elsevier B.V. All rights reserved.

1. Motivation

The nonlinear complementarity problem (NCP) is very impor- tant in engineering and economic applications[11], as well as in operations research [8]. In particular, given a mapping F: Rn! Rn, the problem consists of finding a vector x2 Rnsatis- fying the conditions

xP 0; F xð Þ P 0 and hx; F xð Þi ¼ 0:

This problem will be denoted by NCP(F). The solution set of this problem is denoted by SOL Fð Þ and the feasible region is denoted by XF:¼ x 2 Rf njx P 0; F xð Þ P 0g. Some solution methods for NCP(F) can be found in[1,6,9,10,13,14,21,17,18,25,27]. A natural reformu- lation of NCP(F) is to consider the fixed-point problem

x¼ PKðx F xð ÞÞ;

where PKdenotes the projection onto K with K¼ Rnþ. Consequently, NCP(F) is equivalent to solving the equation

/NRðx1; F1ð ÞxÞ ...

/NRðxn; Fnð ÞxÞ 0

BB

@

1 CC A ¼ 0;

where

/NRða; bÞ ¼ a  a  bð Þþ; ð1Þ

and tþ:¼ max t; 0f g. The function /NRis called the natural residual (NR) function. In fact, /NRcan be replaced by any other function / : R2! R with the property that

/ a; bð Þ ¼ 0 () aP 0; b P 0; ab ¼ 0; ð2Þ that is, NCP(F) and the system

UFð Þ :¼x

/ xð 1; F1ð ÞxÞ ...

/ xð n; Fnð ÞxÞ 0

BB

@

1 CC

A ¼ 0 ð3Þ

https://doi.org/10.1016/j.neucom.2020.06.059 0925-2312/Ó 2020 Elsevier B.V. All rights reserved.

Corresponding author.

E-mail addresses:[email protected](J.H. Alcantara),[email protected].

edu.tw(J.-S. Chen).

1 The author’s work is supported by Ministry of Science and Technology, Taiwan.

Contents lists available atScienceDirect

Neurocomputing

j o u r n a l h o m e p a g e : w w w . e l s e v i e r . c o m/ l o c a t e / n e u c o m

(2)

are equivalent. A function satisfying(2)is known in the literature as an NCP-function. Other than the NR function, the generalized Fischer-Burmeister (GFB) function

/pFBða; bÞ ¼ k a; bð Þkp a þ bð Þ; p > 1 ð4Þ is another popular NCP function used in dealing with the comple- mentarity problem. The GFB function is known as a ‘‘continuous”

extension of the famous Fischer-Burmeister (FB) function given by

/FBða; bÞ ¼

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi a2þ b2 q

 a þ bð Þ;

which can be obtained by taking p¼ 2 in expression(4). The gener- alization is considered continuous since p can take on any value in the interval 1ð ; 1Þ. Motivated by this extension, a generalization of the NR function(1)was formulated in[5]which is given by /pNRða; bÞ ¼ ap a  bð Þþp

; ð5Þ

where p is an odd integer. Indeed, taking p¼ 1 yields the NR func- tion(1). This generalization is considered to be of ‘‘discrete’” type since p can only take odd integral values. Note that /pNRis a twice continuously differentiable function for pP 3 but its surface is not symmetric. To resolve this, two symmetrizations were proposed in[3], which are given by

/pSNRða; bÞ ¼ ap a  bð Þp if aP b;

bp b  að Þp if a< b;

(

ð6Þ

and

wpSNRða; bÞ ¼ apbp a  bð Þpbp if aP b;

apbp b  að Þpap if a< b;

(

ð7Þ

where pP 3 is an odd integer. Properties of these three discrete- type families are elaborated in[2,15].

The first attempt to use the above three discrete-type functions in designing solution methods for NCP was a neural network approach, which was presented in our previous work[2]. To con- struct the neural network, note that by takingUF as defined in (3), the unconstrained minimization problem minx2RnWFð Þ, wherex WFð Þ ¼x 12kUFð Þkx 2¼12Xn

j¼1

/ x j; Fjð Þx2

; ð8Þ

is equivalent to NCP(F). Then the gradient dynamical system dx

dt¼ 

q

rWFðx tð ÞÞ; x 0ð Þ ¼ x0 ð9Þ is a natural neural network to be considered to deal with NCP(F).

In[2], the discrete-type functions(5), (6), and(7)were used to form the merit functionWF. Preliminary numerical experiments conducted in[2] showed that lower values of the parameter p result to faster convergence, although theoretical evidence for this phenomenon is yet to be verified. Moreover, longer compu- tation time is usually required when a higher value of p is used.

There are also test instances when larger values of p lead to ill- conditioning problems. In turn, the choice p¼ 3 may seem opti- mal in practice. In other words, the results suggest that choosing higher values of p need not be done. Consequently, this seems to suggest that the discrete-type generalization appears to be not very useful in the sense that only one member of each of the fam- ilies is useful for numerical purposes. This motivates us to explore if there exists a continuous generalization of the NR function, i.e. a generalization parametrized by p which assumes values on some interval. This will provide us more values to con- sider for the tunable parameter, instead of just the odd integers with value at least 3.

We provide an affirmative answer to this problem. More pre- cisely, the main contributions of this paper are as follows:

(i) We propose a continuous-type generalization of the NR function. The proposed function does not have a symmetric surface, but we provide two symmetrizations which also admit a continuous parameter p. This generalizes the results in[3,5].

(ii) We establish several properties of these newly formulated NCP functions which are prerequisite to designing solution methods for the complementarity problem, which are not lim- ited to the neural network approach. These properties extend the results in[15].

(iii) Stability properties of the neural network(9)will be estab- lished as important extensions of the results in[2].

More importantly,

(iv) We illustrate that the proposed continuous generalization is meaningful. In particular, it provides a wider range of values of p which offer better convergence rates than the ones based on the discrete-type generalization and their symmetrizations illustrated in[2].

(v) We provide theoretical evidence for the performance dependence on p of the gradient dynamical systems based on the three new families of NCP functions. This was not accomplished in[2].

(vi) This work is a significant improvement of the numerical results that were initially presented in [2], since the pro- posed families not only provide faster convergence rates but also higher stability. That is, the proposed generaliza- tions yield neural networks which are less sensitive to initial conditions, which is one of the main issues encountered in [2].

In summary, this paper can be viewed as an important exten- sion of the works presented in[2,3,5,15]where the discrete-type generalization and two discrete-type symmetrizations of the NR function were studied.

This paper is organized as follows: In Section2, we present our proposed continuous generalization of the NR function.

We also prove important properties of the obtained general- ization, which are extensions of the results given in [2,3,5,15]. The theoretical properties proved in this section will later be used in the analysis of the neural network, which will be presented in Section 3. Results of several numerical experiments are presented in Section 4 and elaborately dis- cussed in Section 5. Concluding remarks are presented in Section6.

2. Continuous generalization

Our proposed generalization of the NR function is defined as

/pNRða; bÞ ¼ sgn að Þjajp a  bð Þþp

: ð10Þ

Here, we assume that p is any number in 0ð ; 1Þ and sgn tð Þ :¼

1 if t> 0 0 if t¼ 0

1 if t < 0 8>

<

>: :

Observe that /pNR is an NCP function. Indeed, note that /pNRða; bÞ ¼ f að Þ  f a  bð Þþ

, where f tð Þ ¼ sgn tð Þjtjp which is a bijective function. It follows that

(3)

/pNRða; bÞ ¼ 0 () f að Þ ¼ f a  bð Þþ () a ¼ a  bð Þþ() /NRða; bÞ ¼ 0:

Note that if p is odd, then /pNR¼ /pNRand so the above general- ization subsumes the discrete-type extension given by(5). We note herein that the transformation employed on /NR via the monotonic function f can always be applied to any NCP func- tion of the form / ¼ /1 /2. This fact has also been noted in [12].

It is easy to see that the function(10)does not have a symmet- ric surface. Employing the same strategy as in[3], we propose two symmetrizations of /pNRas

/

p

SNRða; bÞ ¼ sgn að Þjajp a  bð Þp if aP b;

sgn bð Þjbjp b  að Þp if a< b;

(

ð11Þ

and

wpSNRða; bÞ ¼ sgn að Þsgn bð Þjajpjbjp sgn bð Þ a  bð Þpjbjp if aP b;

sgn að Þsgn bð Þjajpjbjp sgn að Þ b  að Þpjajp if a< b;

(

ð12Þ where p> 0. Notice that /pSNR¼ /pSNRand wpSNR¼ wpSNRwhenever p is odd.

Proposition 1. For any p> 0, the functions /pNR; /pSNR, and wpSNR are NCP functions. Moreover, /pNRða; bÞ > 0 (/pSNRða; bÞ > 0) if and only if a> 0 and b > 0, while wpSNRða; bÞ P 0 for all a; bð Þ 2 R2.

Proof. That /pNRis an NCP function follows from the above discus- sion. Moreover, note that a> 0 and b > 0 if and only if a > a  bð Þþ. Since f tð Þ ¼ sgn tð Þjtjp is strictly increasing, we see that a> 0 and b> 0 if and only if sgn að Þjajp> sgn a  bð Þþ

j a  bð Þþjp, i.e.

/

p

NRða; bÞ > 0. On the other hand, observe that

/pSNRða; bÞ ¼ /pNRða; bÞ if a P b;

/pNRðb; aÞ if a < b;

8<

: ð13Þ

and

wpSNRða; bÞ ¼ sgn bð Þjbjp/

p

NRða; bÞ if a P b;

sgn að Þjajp/pNRðb; aÞ if a < b:

8<

: ð14Þ

Using above identities and the fact that /pNRis an NCP function, then /pSNRand wpSNRare also NCP functions with algebraic signs as specified in the proposition. h

In view of the above proposition, we may then view the func- tions /pNR, /pSNRand wpSNRas continuous generalizations of the functions /pNR; /pSNRand wpSNR. Now, we establish some proper- ties of the above functions which will later be used in the neu- ral network approach. We begin with smoothness properties.

By C1ð Þ and CX 2ð Þ, we mean the class of continuously differen-X tiable and twice continuously differentiable functions defined onX Rn, respectively.

Proposition 2. The following result holds:

(a) If p> 1, the function /pNR2 C1 R2

whose gradient is given by

r/pNRða; bÞ ¼ p jajp1 a  bð Þp1sgn að  bÞþ a b

ð Þp1sgn að  bÞþ

" #

:

If p> 2, then /pNR2 C2 R2

whose Hessian is given by r2/

p

NRða;b¼p p1ð Þ

sgn að Þjajp2 abð Þp2sgn abð Þþ ab

ð Þp2sgn abð Þþ ab

ð Þp2sgn abð Þþ

 abð Þp2sgn abð Þþ

" #

:

(b) If p> 1, the function /

p

SNR2 C1ð ÞX where X:¼ a; bfð Þ j a – bg. In this case, the gradient of /pSNRis given by

r/

p

SNRða;bÞ¼ phjajp1 abð Þp1; abð Þp1iT

if a>b;

p bahð Þp1;jbjp1 bað Þp1iT

if a<b:

8>

<

>:

Further, /pSNRis differentiable at 0ð ; 0Þ withr/pSNRð0; 0Þ ¼ 0; 0½ T. If p> 2, then /pSNR2 C2ð Þ with Hessian given byX

r2/

p

SNRða;bÞ¼

p p1ð Þ sgn að Þjajp2 abð Þp2 ðabÞp2 ab

ð Þp2  abð Þp2

" #

if a>b;

p p1ð Þ  bað Þp2 ðbaÞp2 ba

ð Þp2 sgn bð Þjbjp2 bað Þp2

" #

if a<b:

8>

>>

>>

<

>>

>>

>:

(c) If p> 1, then wpSNR2 C1 R2

whose gradient is given by rwpSNRða;bÞ

¼

p sgn bð Þjbjpjajp1 abð Þp1

sgn að Þjajpjbjp1 abð Þpjbjp1þsgn bð Þ abð Þp1jbjp 2

4

3 5if a>b;

pjaj2p1 1 1

if a¼b;

p sgn bð Þjajp1jbjp bað Þpjajp1þsgn að Þ bað Þp1jajp sgn að Þjajpjbjp1 bað Þp1

2 4

3 5 if a<b;

8>

>>

>>

>>

>>

>>

<

>>

>>

>>

>>

>>

>:

If p> 2, then wpSNR2 C2 R2

whose Hessian is given by

(4)

r2wpSNRða; bÞ ¼ p

p 1

ð Þ sgn ah ð Þsgn bð Þjajp2jbjpi

 p  1ð Þ a  bð Þp2sgn bð Þjbjp

p 1

ð Þ a  bð Þp2sgn bð Þjbjp þp jajh p1 a  bð Þp1i

jbjp1

p 1

ð Þ a  bð Þp2sgn bð Þjbjp þp jajh p1 a  bð Þp1i

jbjp1

p 1

ð Þ sgn ah ð Þsgn bð Þjajpjbjp2i

 p  1ð Þ a  bð Þpsgn bð Þjbjp2 þ2p a  bð Þp1jbjp1

 p  1ð Þ a  bð Þp2sgn bð Þjbjp 2

6666 6666 64

3 7777 7777 75 ifa> b;

p ðp 1Þsgn að Þsgn bð Þjajp2jbjp pjajp1jbjp1 pjajp1jbjp1 ðp 1Þsgn að Þsgn bð Þjajpjbjp2

ifa¼ b;

p p 1

ð Þ sgn ah ð Þsgn bð Þjajp2jbjpi

 p  1ð Þ b  að Þpsgn að Þjajp2 þ2p b  að Þp1jajp1

 p  1ð Þ b  að Þp2sgn að Þjajp

p 1

ð Þ b  að Þp2sgn að Þjajp þp jbjh p1 b  að Þp1i

jajp1

p 1

ð Þ b  að Þp2sgn að Þjajp þp jbjh p1 b  að Þp1i

jajp1

p 1

ð Þ sgn ah ð Þsgn bð Þjajpjbjp2i

 p  1ð Þ b  að Þp2sgn að Þjajp 2

6666 6666 64

3 7777 7777 75 ifa< b:

8>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

<

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>>

>:

Proof. Note that f tð Þ ¼ sgn tð Þjtjp is continuously differentiable when p> 1 with f 0 tð Þ ¼ pjtjp1. Moreover, f is twice continuously differentiable when p> 2 with f tð Þ ¼ p p  1ð Þsgn tð Þjtjp2. Using these and the alternative formulas given in(13) and (14), the gra- dients and Hessians can be easily obtained. The calculations are omitted. h

The above proposition is a generalization of[5, Proposition 2.2], and[15, Proposition 4.3]. On the other hand, the following result is an extension of [15, Proposition 3.4, Proposition 4.5, and Proposition 5.4].

Proposition 3. Let p> 1. Then, the following hold:

(a)ra/

p

NRða; bÞ rb/

p NR

a; b

ð Þ > 0 on a; bfð Þja > b > 0 or a > b > 2ag;

¼ 0 onfða; bÞja 6 b or a > b ¼ 2a or a > b ¼ 0g;

< 0 otherwise:

8<

: (b)ra/

p

SNRða;bÞ rb/pSNR

ða;bÞ

> 0 on a;bfð Þja > b > 0 or a > b > 2ag and onfða;bÞjb > a > 0 or b > a > 2bg;

¼ 0 onnða;bÞj/pSNRða;bÞ ¼ 0 or a > b ¼ 2a or b > a ¼ 2bo

;

< 0 otherwise:

8>

>>

<

>>

>:

(c)raw

p

SNRða; bÞ rbw

p

SNRða; bÞ > 0 on the first quadrant R2þþ, and wpSNRða; bÞ ¼ 0 ()rwpSNRða; bÞ ¼ 0.

Proof. UsingProposition 2(a), ra/pNRða; bÞ rb/pNRða; bÞ

¼ p2 jajp1 a  bð Þp1sgn að  bÞþ

 a  bð Þp1sgn að  bÞþ h

p2hjajp1 a  bð Þp1i a b

ð Þp1 ifa> b

0 ifa6 b

(

:

Suppose now that a> b. Since g tð Þ :¼ tp1is a strictly increasing function on 0½ ; 1Þ; jajp1 a  bð Þp1> 0 if and only if jaj > a  b, which happens if and only if b> 0 or b > 2a. This establishes

Proposition 3(a). Statement (b) easily follows from (a), while (c) can be easily verified using the result ofProposition 2(c). h

We now establish the growth behavior of the proposed families of functions. We first establish the following simple lemma.

Lemma 1. For any x2 0; 1½  and any p > 0, we have 1 x

ð Þp6 1

1þ px :

Proof. Define f : 0; 1½  ! R by f xð Þ ¼ 1  xð Þpð1þ pxÞ. A simple cal- culation yields f0 xð Þ ¼ p p þ 1ð Þx 1  xð Þp1. Then, f monotonically decreases on ½0; 1 from f 0ð Þ ¼ 1 to f 1ð Þ ¼ 0. Consequently, 06 f xð Þ 6 1. This completes the proof. h

Proposition 4. Let / 2 /pNR; /pSNR; wpSNR

n o

. Then j/ a k; bk j ! 1 for any sequence nak; bko1

k¼1 in R2 such that jakj ! 1 and jbkj ! 1.

Proof. The proposition follows from the preceding lemma and analogous arguments in the proof of[2, Lemma 5.1]. h

3. Stability analysis

In this section, we consider the neural network given by (9) using the functions /pNR; /pSNRand wpSNR. The corresponding merit functions defined by (8) will be denoted, respectively, by WpNR;WpS1NRandWpS2NR. The case when p is an odd integer greater than 1 is the neural network studied in[2].

We note that since the results presented in Section2generalize the results for the discrete families originally formulated in [3,5,15], then the discussion presented in[2]can be extended to establish the properties of the induced merit functions WpNR;WpS1NRandWpS2NR corresponding to the continuous general- ization. In the following proposition, we summarize the properties of these merit functions. For conciseness and clarity, we present a shortened proof of the following result pointing out the arguments

(5)

that needed to be modified in the proofs of the results in[2]. We refer the reader to the monograph[8]for definitions and properties of nonlinear mappings (P0-functions, monotone functions, etc.) and the book[22]for standard results in the theory of ordinary differ- ential equations.

Proposition 5. Let p> 1. Then the following hold:

(a) IfðrF IÞ is a P-matrix, then every stationary point ofWpNRis a global minimizer.

(b) If F xð Þ P 0; ðrF xð Þ  I Þ is a P0-matrix and xis a stationary point ofWpNR, then xis a global minimizer ofWpNR.

(c) Suppose that x2XFandrF xð Þ is a P 0-matrix. If xis a sta- tionary point ofWpS1NRorWpS2NR, then xis a global minimizer.

Proof. To prove (a) and (b), we define two diagonal matrices A xð Þ and B xð Þ where

Aiið Þ ¼ jxx ijp1 and Biið Þ ¼ xx  i Fið Þx

sgn x i Fið Þx 

þ; where xis an equilibrium point of(9)withWF¼WpNR. Then, analo- gous arguments as in the proof of[2, Proposition 4.4 and Remark 4.1]lead to the desired conclusion. To prove (c), we proceed as in the proof of [2, Proposition 4.5]. That is, we verify the following properties:

(P1)8ða; bÞ 2 R2þ, we haveraw a; bð Þ rbw a; bð Þ P 0; and

(P2) 8ða; bÞ 2 R2þ, we have

raw a; bð Þ ¼ 0 ()rbw a; bð Þ ¼ 0 () / a; bð Þ ¼ 0,

where w :¼12/2and / 2 /pSNR; wpSNR

n o

. Property (P1) can be easily verified. To show (P2), we only need to show that given a; b P 0, the following holds:

(i)ra/pSNRða; bÞ ¼ 0 implies /pSNRða; bÞ ¼ 0; and (ii)rawpSNRða; bÞ ¼ 0 implies wpSNRða; bÞ ¼ 0.

We first prove (i). Ifra/

p

SNRða; bÞ ¼ 0, then we see fromPropo- sition 2 (b) that we must have a> b or a ¼ b ¼ 0. Otherwise, ra/

p

SNRða; bÞ ¼ p b  að Þp1 would be positive. If a¼ b ¼ 0, then /pSNRða; bÞ ¼ 0 as desired. If a> b, then 0¼1pra/pSNRða; bÞ ¼ ap1 a  bð Þp1. Since t# tp1 is strictly increasing on ½0; 1Þ, then a¼ a  b, i.e. b¼ 0. Then /

p

SNRða; bÞ ¼ 0 since a > b ¼ 0 and /pSNR is an NCP function. To prove (ii), assume that raw

p

SNRða; bÞ ¼ 0. From Proposition 2(c), we must have

0¼1

prawpSNRða;bÞ ¼ ap1bp a bð Þp1bp if aP b;

ap1bp b að Þpap1þ b að Þp1ap if a< b:

(

If aP b, then we can proceed as in[2, Prop 4.5]. If a< b, then

0¼ ap1bp b  að Þpap1þ b  að Þp1ap

¼ ap1bp b  að Þpþ b  að Þp1a

: ð15Þ

From here, we conclude that a¼ 0. Otherwise, we must have bp> b  að Þp and so bp b  að Þpþ b  að Þp1a> b  að Þp1a> 0.

This contradicts(15). Hence, a¼ 0 and since b > a ¼ 0, we obtain that wpSNRða; bÞ ¼ 0 by definition of an NCP function.

In view of the above proposition and the stability analysis pre- sented in[2], we present herein analogous stability results. The proofs are similar to corresponding propositions for the discrete generalization established in[2], and are thus omitted. In particu- lar, Proposition 3.2 (a) follows from[2, Theorem 5.1], Proposition 3.2 (b) and (c) follow from[2, Theorem 5.2], and Proposition 3.2 (d) is a consequence of[2, Theorem 5.2].

Proposition 6. Let xbe an equilibrium point of(9).

(a) If WF2 WpNR;WpS1NR;WpS2NR

and F is a uniformly P- function, then the solution to(9)through any x02 Rnconverges to x.

(b) IfWF¼WpNR, then x2 SOL Fð Þ provided thatðrF IÞ is a P- matrix. If xis isolated, then it is asymptotically stable.

(c) If x2XF andWF¼WpS1NRorWF¼WpS2NR, then x2 SOL Fð Þ provided that F is a P0-function. If x is isolated, then it is asymptotically stable.

(d) If rUFð Þ is nonsingular, where / 2 /x pNR; /pSNR; wpSNR

n o

, and x is isolated, then x2 SOL Fð Þ and x is exponentially stable.

The parameter p has a very significant influence in the rate of convergence of the neural network. For the discrete type families, a few set of test problems was considered in[2], where the numer- ical experiments revealed that a lower value of p2 3; 5; 7; . . .f g often provides faster convergence. However, there is no theoretical evidence yet for this phenomenon.

In fact, as we shall see in Section 4, different convergence behaviors can be observed when we vary the values of p. In partic- ular, a lower value of p does not always lead to faster convergence.

There are test instances when a higher value of p offers faster con- vergence rate. The numerical experiments presented in the next section suggest that there is no simple relation that can be obtained regarding the performance dependence on p of the neural network(9)withWF2 WpNR;WpS1NR;WpS2NR

. Moreover, the simu- lations suggest that initial conditions have a significant influ- ence on the performance of the neural network and its dependence on p. To make sense of these phenomenon, we establish the following theorem. The first part of the proof is a derivation of an error bound for the NCP(F) (see equation(18)) where F is a locally Lipschitz uniformly P-function. The tech- nique employed in the derivation is similar to the idea used in [8, Proposition 6.3.1].

Theorem 1. Consider the neural network(9)withWF¼WpS1NRfor a given p> 1. Suppose that x2 SOL Fð Þ is exponentially stable and F is a uniformly P function that is locally Lipschitz continuous. Then there exist positive constants K;

x

and d such that for all t P 0, we have

kx tð Þ  xk 6 K pþ 1 p

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 2WpS1NRð Þx0 q

1p

ext 8x02XF\ Ndð Þ;x

where Ndð Þ ¼ y : ky  xx f k < dg.

(6)

Proof. Suppose F is uniformly P with modulus

j

> 0. Given x 2 Rn, let j2 1; . . . ; nf g such that

xj xj

 

Fjð Þ  Fx jð Þx

 

P x i xi

Fið Þ  Fx ið Þx

ð Þ 8i¼ 1; . . . ; n:

Then

j

kx  xk26 x j xj

Fjð Þ  Fx jð Þx

 

¼ xjFjð Þ  xx j  xj

 

Fjð Þ:x ð16Þ

Meanwhile, note that sð  tþÞ tðþ tÞ P 0 for any s P 0 and t2 R. Since min x j; Fjð Þx

¼ xj x j Fjð Þx

þ, then taking s¼ xj P 0 and t ¼ xj Fjð Þ, we havex

xj  xjþ min x j; Fjð Þx

 

Fjð Þ  min xx  j; Fjð Þx

 

P 0 which implies that

xj xj

 

Fjð Þ P xx j  xj

 

min x j; Fjð Þx

 Fjð Þx

 min x j; Fjð Þx

: ð17Þ

Since xjP min x j; Fjð Þx

and Fjð Þ P 0, we have from inequal-x ities(16) and (17)that

j

kx  xk26 F jð Þ  Fx jð Þx 

 xj  xj

 

h i

min x j; Fjð Þx 6 kF xð ð Þ  F xð Þk þ kx  x kÞj min x j; Fjð Þx

j

Since F is locally Lipschitz, we conclude that given any x2 Rnin some neighborhood of x, there exists an index j¼ j xð Þ and L > 0 such that

j

kx  xk26 1 þ Lð Þ  j min x j; Fjð Þx

j  kx  xk: ð18Þ Now, let x02XF. We have from part (a) of the proof of [2, Lemma 5.1] and using Lemma 1 that /

p

SNRða; bÞ P

p

pþ1ðmin af ; bgÞp for any a; b P 0. By (18), there exists j¼ j x 0

2 1; . . . ; nf g such that

j

kx0 xk 6 1 þ Lð Þ  pþ 1

p /pSNRx0j; Fj x0 

1p

: ð19Þ

Since x is exponentially stable, there exist positive constants d; c and

x

such that for any tP 0, kx tð Þ  xk 6 cextkx0 xk for all x02 Ndð Þ. This, together with inequalityx (19), gives the desired result with K:¼jcð1þ LÞ. h

Similarly, we get the following error bound result for the other two merit functionsWpNRandWpS2NR.

Theorem 2. Consider the neural network(9)for a given p> 1, and let x2 SOL Fð Þ be exponentially stable. Suppose that F is a uniformly P-function and locally Lipschitz continuous. Then

(a) IfWF¼WpNR, there exist positive constants K,

x

and d such that for all tP 0, we have

kx tð Þ  xk 6 K pþ 1 p

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 2WpNRð Þx0 q

1p

ext 8x02XF\ Ndð Þ:x

(b) IfWF¼WpS2NR, there exist positive constants K,

x

and d such that for all tP 0, we have

kx tð Þ  xk 6 K pþ 1 p

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 2WpS2NRð Þx0 q

2p1

ext 8x02XF\ Ndð Þ:x

Proof. For aP b P 0, then /pNRða; bÞ ¼ /pSNRða; bÞ Ppþ1p bp as in part (a) of the proof of[2, Lemma 5.1]. When 06 a < b, we have /pNRða; bÞ ¼ apPpþ1p ap. It follows that /pNRða; bÞ Ppþ1p ðmin af ; bgÞp. On the other hand, using the identity (14) and the fact that /pSNRða; bÞ Ppþ1p ðmin af ; bgÞp for any a; b P 0, we derive that w

p

SNRða; bÞ Ppþ1p ðmin af ; bgÞ2p. Using these identities and the same arguments as inTheorem 1, we get the desired inequalities. h

0 5 10 15 20 25

p 0

1 2 3 4 5 6 7

ga,b(p) (2.0748,4.2431)

a=4, b=0.5 a=4,b=3 a=4,b=2

Fig. 1. Graph of upper bound for the error termkx tð Þ  xk for some values of a and b with a; b P 0 and a > b.

30 32 34 36 38 40

p 4.0019

4.00195 4.002 4.00205 4.0021 4.00215 4.0022

ga,b(p)

(34.4458,4.0022)

Fig. 2. Graph of g4;0:5ð Þ on the interval 30; 40p ½ .

(7)

Table 1

Numerical results for NCP1 and NCP2 using the neural networks based on /pNR; /pSNRand wpSNRfor different values of p.

p NCP1 NCP2

CT1 Gap1 CT2 Gap2 CT3 Gap3 CT1 Gap1 CT2 Gap2 CT3 Gap3

1.0 3.0E+0 1.4E7 3.0E+0 7.6E8 1.4E+3 6.1E3 1.9E+1 6.7E7 1.7E+1 3.4E7 6.7E+3 3.1E2

1.1 3.2E+0 1.5E7 2.7E+0 9.5E8 1.8E+3 8.4E3 5.1E+1 1.6E5 2.0E+1 3.1E7 7.9E+3 3.7E2

1.5 5.1E+0 1.9E7 3.1E+0 1.8E7 3.9E+3 2.5E2 1.1E+3 2.4E3 4.5E+1 6.0E7 1.4E+4 7.5E2

1.9 1.0E+1 4.1E7 6.5E+0 3.6E7 6.9E+3 5.4E2 4.7E+3 1.9E2 1.2E+2 1.9E6 2.0E+4 1.3E1

2.0 1.2E+1 4.2E7 8.0E+0 4.3E7 7.8E+3 6.4E2 6.0E+3 2.7E2 1.6E+2 2.6E6 2.1E+4 1.5E1

2.1 1.5E+1 6.1E7 1.0E+1 5.4E7 8.7E+3 7.5E2 7.3E+3 3.6E2 2.1E+2 3.7E6 2.2E+4 1.7E1

2.5 3.5E+1 1.5E6 2.5E+1 1.5E6 1.3E+4 1.3E1 1.3E+4 9.1E2 6.1E+2 1.5E5 2.6E+4 2.5E1

2.9 8.9E+1 4.0E6 6.7E+1 4.0E6 1.7E+4 2.0E1 1.9E+4 1.7E1 1.6E+3 6.2E5 2.9E+4 3.4E1

3.0 1.1E+2 5.2E6 8.6E+1 5.2E6 1.7E+4 2.2E1 2.0E+4 1.9E1 1.9E+3 8.8E5 3.0E+4 3.7E1

3.5 3.9E+2 2.0E5 3.1E+2 2.0E5 2.1E+4 3.4E1 2.7E+4 3.3E1 4.2E+3 4.2E4 3.3E+4 5.0E1

4.0 1.4E+3 8.0E5 1.1E+3 8.0E5 2.3E+4 4.8E1 3.2E+4 4.8E1 6.1E+3 1.3E3 3.4E+4 6.3E1

4.5 5.1E+3 3.4E4 4.2E+3 3.4E4 2.3E+4 6.3E1 3.5E+4 6.5E1 7.4E+3 2.9E3 3.5E+4 7.7E1

5.0 1.8E+4 1.7E3 1.5E+4 1.7E3 2.4E+4 7.8E1 3.8E+4 8.3E1 8.3E+3 5.1E3 3.6E+4 9.1E1

5.5 2.8E+4 2.8E2 1.7E+4 2.8E2 2.4E+4 9.2E1 4.0E+4 1.0E+0 9.0E+3 7.9E3 3.6E+4 1.0E+0

6.0 2.8E+4 6.6E2 4.6E+3 5.5E2 2.3E+4 1.1E+0 4.1E+4 1.2E+0 9.4E+3 1.1E2 3.6E+4 1.2E+0

6.5 2.8E+4 1.1E1 1.2E+4 5.7E2 2.3E+4 1.2E+0 4.2E+4 1.4E+0 9.8E+3 1.5E2 3.5E+4 1.3E+0

7.0 2.9E+4 1.6E1 2.9E+4 6.1E2 2.2E+4 1.3E+0 4.3E+4 1.5E+0 1.0E+4 1.9E2 3.5E+4 1.4E+0

20.0 3.7E+4 3.3E+0 1.5E+4 1.4E+0 1.2E+4 3.1E+0 3.1E+4 4.4E+0 * * 1.5E+4 2.9E+0

50.0 1.9E+4 6.9E+0 5.6E+3 2.2E+0 5.5E+3 4.1E+0 1.9E+4 6.3E+0 * * 6.3E+3 3.5E+0

Table 2

Numerical results for NCP3 and NCP4 using the neural networks based on /pNR; /pSNRand wpSNRfor different values of p.

p NCP3 NCP4

CT1 Gap1 CT2 Gap2 CT3 Gap3 CT1 Gap1 CT2 Gap2 CT3 Gap3

1.01 1.5E+1 5.2E6 1.4E+1 1.2E6 5.1E+3 4.5E2 1.6E+1 2.3E6 1.4E+1 3.3E7 5.1E+3 1.9E3

1.1 4.3E+1 3.8E5 2.7E+1 1.9E10 5.8E+3 4.9E2 4.3E+1 1.8E5 1.1E+1 5.2E8 5.6E+3 2.5E3

1.5 9.7E+2 3.7E3 2.4E+2 3.3E7 8.7E+3 7.6E2 8.3E+2 1.6E3 3.5E+0 2.0E7 7.6E+3 7.1E3

1.9 4.0E+3 2.7E2 7.2E+2 9.1E6 1.1E+4 1.1E1 3.2E+3 1.2E2 1.4E+0 1.2E7 9.3E+3 1.4E2

2 5.0E+3 3.7E2 8.6E+2 1.6E5 1.1E+4 1.2E1 4.0E+3 1.6E2 1.1E+0 9.5E8 9.7E+3 1.7E2

2.1 6.1E+3 5.0E2 1.3E+1 6.0E1 1.1E+4 1.3E1 4.9E+3 2.1E2 9.0E1 7.0E8 1.0E+4 1.9E2

2.5 1.1E+4 1.1E1 8.2E+0 6.9E1 1.2E+4 1.7E1 8.5E+3 5.1E2 4.0E1 2.5E8 1.1E+4 1.8E2

2.9 1.6E+4 1.9E1 6.9E+0 7.4E1 1.3E+4 2.1E1 1.2E+4 9.3E2 1.9E1 6.2E9 1.2E+4 1.7E2

3 1.7E+4 2.1E1 6.8E+0 7.5E1 1.3E+4 2.2E1 1.3E+4 1.0E1 1.6E1 3.6E9 1.3E+4 2.4E2

3.5 2.2E+4 3.1E1 6.6E+0 8.0E1 1.3E+4 2.7E1 1.6E+4 1.7E1 7.0E2 1.6E10 1.4E+4 5.3E2

4 2.6E+4 3.8E1 7.4E+0 8.2E1 1.3E+4 3.2E1 1.9E+4 2.4E1 4.0E2 1.1E10 1.4E+4 8.0E2

4.5 2.8E+4 4.3E1 8.9E+0 8.4E1 1.3E+4 3.6E1 2.1E+4 3.2E1 3.0E2 3.7E10 1.5E+4 1.1E1

5 3.0E+4 4.6E1 1.1E+1 8.6E1 1.3E+4 4.1E1 2.3E+4 4.0E1 3.0E2 2.5E13 1.5E+4 1.3E1

5.5 3.1E+4 4.8E1 1.4E+1 8.7E1 1.3E+4 4.5E1 2.4E+4 4.8E1 2.0E2 5.3E10 1.5E+4 1.6E1

6 3.1E+4 4.8E1 1.8E+1 8.8E1 1.2E+4 4.9E1 2.5E+4 5.6E1 2.0E2 4.5E13 1.5E+4 1.9E1

6.5 2.9E+4 4.7E1 2.4E+1 8.9E1 1.0E+4 5.4E1 2.5E+4 6.3E1 2.0E2 1.3E15 1.5E+4 2.1E1

7 2.6E+4 4.8E1 3.1E+1 9.0E1 8.2E+3 6.0E1 2.5E+4 7.1E1 2.0E2 1.5E15 1.5E+4 2.4E1

20 * * * * * * 4.6E+4 1.9E+0 9.1E+4 4.5E16 1.1E+4 6.8E1

50 * * * * * * * * * * 5.2E+3 9.7E1

Table 3

Numerical results for NCP5 and NCP6 using the neural networks based on /pNR; /pSNRand wpSNRfor different values of p.

p NCP5 NCP6

CT1 Gap1 CT2 Gap2 CT3 Gap3 CT1 Gap1 CT2 Gap2 CT3 Gap3

1.01 2.4E+2 7.9E6 2.4E+2 7.9E6 3.6E+4 1.6E2 2.8E+1 4.0E6 2.4E+1 3.0E6 6.3E+3 5.9E2

1.1 1.6E+2 5.6E6 1.6E+2 5.6E6 3.7E+4 1.6E2 7.6E+1 2.6E5 1.4E+1 1.5E6 6.5E+3 6.9E2

1.5 3.7E+1 1.1E6 3.7E+1 1.1E6 4.1E+4 1.9E2 1.1E+3 5.6E3 5.9E+0 4.6E8 7.5E+3 8.5E2

1.9 1.1E+1 1.3E7 1.1E+1 1.3E7 4.3E+4 3.8E2 4.4E+3 4.3E2 4.6E+0 1.6E7 9.7E+3 7.1E2

2 8.7E+0 4.7E8 8.7E+0 4.7E8 4.4E+4 5.3E2 5.6E+3 6.0E2 4.7E+0 1.7E7 1.0E+4 6.2E2

2.1 7.0E+0 1.7E9 7.0E+0 1.7E9 4.4E+4 6.7E2 6.8E+3 8.0E2 * * 1.1E+4 5.4E2

2.5 3.4E+0 7.6E8 3.4E+0 7.6E8 4.7E+4 1.1E1 1.2E+4 1.9E1 * * 1.2E+4 2.8E2

2.9 2.1E+0 1.1E7 2.1E+0 1.1E7 4.9E+4 1.4E1 1.8E+4 3.5E1 * * 1.4E+4 2.1E2

3 2.0E+0 5.5E8 2.0E+0 5.5E8 4.9E+4 1.5E1 1.9E+4 4.0E1 * * 1.4E+4 2.2E2

3.5 1.4E+0 3.7E8 1.4E+0 3.7E8 5.1E+4 2.0E1 2.5E+4 6.5E1 * * 1.6E+4 3.7E2

4 1.0E+0 6.0E8 1.0E+0 6.0E8 5.2E+4 2.4E1 3.0E+4 9.2E1 * * 1.7E+4 6.2E2

4.5 8.0E1 3.1E8 8.0E1 3.1E8 5.2E+4 2.9E1 3.4E+4 1.2E+0 * * 1.8E+4 8.8E2

5 9.0E1 4.1E11 9.0E1 4.1E11 5.2E+4 3.3E1 3.8E+4 1.5E+0 * * 1.8E+4 1.1E1

5.5 7.0E1 1.2E10 7.0E1 1.2E10 5.2E+4 3.7E1 4.0E+4 1.7E+0 * * 1.9E+4 1.3E1

6 5.0E+2 8.9E16 1.0E+3 8.9E16 5.1E+4 4.0E1 3.9E+4 2.0E+0 * * 1.9E+4 1.4E1

6.5 5.0E1 8.5E11 5.0E1 8.5E11 5.1E+4 4.3E1 3.3E+4 2.2E+0 * * 1.9E+4 1.6E1

7 4.0E1 3.3E10 4.0E1 3.3E10 5.0E+4 4.7E1 4.6E+4 2.4E+0 * * 1.9E+4 1.7E1

20 1.0E1 1.5E12 1.0E1 1.5E12 3.1E+4 8.4E1 3.6E+4 6.8E+0 * * 1.4E+4 7.6E1

50 * * * * * * 1.7E+4 1.5E+1 * * 7.2E+3 1.5E+0

參考文獻

相關文件

In particular, combining the numerical results of the two papers, we may obtain the conclusion that the merit function method based on φ p has a better a global convergence and a

In this paper, we have studied a neural network approach for solving general nonlinear convex programs with second-order cone constraints.. The proposed neural network is based on

In this paper, we have studied a neural network approach for solving general nonlinear convex programs with second-order cone constraints.. The neural network is based on the

The proposed neural network model was developed based on a smoothed natural residual merit function involving an uncon- strained minimization reformulation of the

Wang, Solving pseudomonotone variational inequalities and pseudo- convex optimization problems using the projection neural network, IEEE Transactions on Neural Network,

In this paper, we build a new class of neural networks based on the smoothing method for NCP introduced by Haddou and Maheux [18] using some family F of smoothing functions.

W ANG , A general projection neural network for solving monotone variational inequalities and related optimization problems, IEEE Transactions on Neural Networks, 15 (2004)

Particularly, combining the numerical results of the two papers, we may obtain such a conclusion that the merit function method based on ϕ p has a better a global convergence and