• 沒有找到結果。

Properties of a Family of Generalized NCP-Functions and a Derivative Free Algorithm for Complementarity Problems

N/A
N/A
Protected

Academic year: 2022

Share "Properties of a Family of Generalized NCP-Functions and a Derivative Free Algorithm for Complementarity Problems"

Copied!
25
0
0

加載中.... (立即查看全文)

全文

(1)

Properties of a Family of Generalized NCP-Functions and a Derivative Free Algorithm for Complementarity

Problems

Sheng-Long Hu

Department of Mathematics School of Science, Tianjin University

Tianjin 300072, P.R. China

Zheng-Hai Huang

Department of Mathematics School of Science, Tianjin University

Tianjin 300072, P.R. China Email: huangzhenghai@tju.edu.cn

Jein-Shan Chen

Department of Mathematics National Taiwan Normal University

Taipei, Taiwan 11677 Email: jschen@math.ntnu.edu.tw

September 7, 2008; Revised: October 24, 2008

Corresponding Author. The author’s work is partially supported by the National Natural Science Foundation of China (Grant No. 10571134 and No. 10871144) and the Natural Science Foundation of Tianjin (Grant No. 07JCYBJC05200).

Member of Mathematics Division, National Center for Theoretical Sciences, Taipei Office. The author’s work is partially supported by National Science Council of Taiwan.

(2)

Abstract

In this paper, we propose a new family of NCP-functions and the corresponding merit functions, which are the generalization of some popular NCP-functions and the related merit functions. We show that the new NCP-functions and the corresponding merit functions possess a system of favorite properties. Specially, we show that the new NCP-functions are strongly semismooth, Lipschitz continuous, and continuously dif- ferentiable; and that the corresponding merit functions have SC1 property (i.e., they are continuously differentiable and their gradients are semismooth) and LC1 property (i.e., they are continuously differentiable and their gradients are Lipschitz continuous) under suitable assumptions. Based on the new NCP-functions and the corresponding merit functions, we investigate a derivative free algorithm for the nonlinear comple- mentarity problem and discuss its global convergence. Some preliminary numerical results are reported.

Key words: Complementarity problem, NCP-function, merit function, derivative free algorithm.

AMS subject classifications (2000): 90C33, 90C56, 65K10.

(3)

1 Introduction

In the last decades, people have put a lot of their energy and attention on the com- plementarity problem due to its various applications in operation research, economics, and engineering (see, for examples, [10, 13, 23]). The nonlinear complementarity problem (NCP) is to find a point x ∈ <n such that

x ≥ 0, F (x) ≥ 0, xTF (x) = 0, (1.1)

where F : <n → <n is a continuously differentiable mapping with F := (F1, F2, . . . , Fn)T. Many solution methods have been developed to solve NCP (1.1), for examples, [3, 5, 13, 14, 15, 16, 17, 19, 23, 27, 28]. For more details, please refers to the excellent monograph [9]. One of the most popular methods is to reformulate the NCP (1.1) as a unconstrained optimization problem and then to solve the reformulated problem by the unconstrained optimization technique. This kind of methods is called the merit function method, where the merit function is generally constructed by some NCP-function.

Definition 1.1 A function φ : <2 → < is called an NCP-function [2, 18, 25, 26], if it satisfies

φ(a, b) = 0 ⇐⇒ a ≥ 0, b ≥ 0, ab = 0.

Furthermore, if φ(a, b) ≥ 0 for all (a, b) ∈ <2 then the NCP-function φ is called a nonnegative NCP-function. In addition, if a function Ψ : <n → < is nonnegative and Ψ(x) = 0 if and only if x solves the NCP, then Ψ is called a merit function for the NCP.

If φ is an NCP-function, then it is easy to see that the function Ψ : <n → < defined by Ψ(x) := Pn

i=1φ2(xi, Fi(x)) is a merit function for the NCP. Thus, finding a solution of the NCP is equivalent to finding a global minimum of the unconstrained minimization minx∈<nΨ(x) with optimal value 0.

Many NCP-functions have been proposed in the literature. Among them, the FB function is one of the most popular NCP-functions, which is defined by

φ(a, b) :=√

a2+ b2 − a − b, ∀(a, b) ∈ <2.

One of the main generalization of FB function was given by Kanzow and Kleinmichel [18]:

φθ(a, b) :=p

(a − b)2+ θab − a − b, θ ∈ (0, 4), ∀(a, b) ∈ <2. (1.2) Another main generalization was given by Chen and Pan [5]:

φp(a, b) := pp

|a|p+ |b|p− a − b, p ∈ (1, ∞), ∀(a, b) ∈ <2. (1.3)

(4)

It has been proved in [3, 4, 5, 6, 18, 22] that the functions φθ given in (1.2) and φp given in (1.3) possess a system of favorite properties, such as, strong semismoothness, Lipschitz continuity, and continuous differentiability. It has also been proved that the corresponding merit functions of φθand φp have SC1property (i.e., they are continuously differentiable and their gradients are semismooth) and LC1 property (i.e., they are continuously differentiable and their gradients are Lipschitz continuous) under suitable assumptions.

Motivated by Kanzow and Kleinmichel [18] and Chen and Pan [5], we introduce in this paper the following functions:

φθp(a, b) := pp

θ(|a|p+ |b|p) + (1 − θ)|a − b|p− a − b, p > 1, θ ∈ (0, 1], (a, b) ∈ <2. (1.4) and

Ψθp(x) := 1 2

Xn i=1

φ2θp(xi, Fi(x)). (1.5)

Is the function φθp an NCP-function? If it is, do the functions given by (1.4) and (1.5) have the same properties as those known functions mentioned above? Furthermore, how is the numerical behavior of the merit function methods based on the functions defined by (1.4) and (1.5)?

In this paper, we will answer the questions mentioned above partly. Firstly, we show that the function φθpdefined by (1.4) is an NCP-function; and discuss some favorite properties of the NCP-function (1.4) and its nonnegative NCP-function, including strong semismoothness, Lipschitz continuity, and continuous differentiability. Since the function φθp defined by (1.4) is an NCP-function, it follows that the function Ψθp defined by (1.5) is a merit function associated to the NCP-function φθp. We also show that the merit function Ψθp has SC1 property and LC1 property. Secondly, we investigate a derivative free method based on the functions defined by (1.4) and (1.5) and show its global convergence. (Note: usually the nonsmooth Newton method is faster than the derivative free method for solving NCPs.

However, the derivative free algorithm may overcome the case where strong conditions are sometimes needed to guarantee that the Jacobian of the merit function is nonsingular or very expensive to compute.) Thirdly, we report the preliminary numerical results for test problems from MCPLIB. The preliminary numerical results show, on the average, that the algorithm works better when θ = 1 (according to the FB-type function), θ = 0.9 and θ = 0.25, and when p = 1.1 or p = 2 or p = 20 generally.

The rest of this paper are organized as follows. Various properties of the new NCP- function (1.4) and the nonnegative NCP-function associated to (1.4) are established in the next section. In Section 3, some properties of the merit function defined by (1.5) are an- alyzed. In Section 4, we investigate a derivative free algorithm for the NCP and show its global convergence. Some preliminary numerical results are reported in Section 5 and final conclusions are given in the last section.

(5)

Throughout this paper, unless stated otherwise, all vectors are column vectors, the sub- script T denotes transpose, <n denotes the space of n-dimensional real column vectors, and

<n+ (respectively, <n++) denotes the nonnegative (respectively, positive) orthant in <n. For any vectors u, v ∈ <n, we write (uT, vT)T as (u, v) for simplicity. For x ∈ <n, we use x ≥ 0 (respectively, x > 0) to mean x ∈ <n+ (respectively, x ∈ <n++). We use “:=” to mean ”be defined as”. We denote by kuk the 2-norm of u and kukp the p-norm with p > 1. We use

∇F to denote the gradient of F (while ∂F (x)∂xi denotes to the i-th component of the gradient of F ) and ∇2F to denote the second order derivative of F . We use α = o(β) (respectively, α = O(β)) to mean αβ tends to zero (respectively, bounded uniformly) as β → 0.

2 Properties of the New NCP-Function

In this section, we show that the function φθp defined by (1.4) is an NCP-function, and discuss its properties which are similar to those obtained in [3, 5] for the function φp defined by (1.3). We also study a nonnegative NCP-function associated with φθp, and discuss its properties. In addition, we discuss the semismooth-related properties due to its importance in semismooth and smooth analysis [8, 10, 15, 16, 20, 24].

For convenience, we define ηθp(a, b) := pp

θ(|a|p+ |b|p) + (1 − θ)|a − b|p, p > 1, θ ∈ (0, 1], (a, b) ∈ <2. (2.1) The proofs of the following propositions are trivial, we omit their proofs here.

Proposition 2.1 The function φθp defined by (1.4) is an NCP-function.

Proposition 2.2 The function ηθp defined by (2.1) is a norm on <2 for all p > 1, θ ∈ (0, 1].

Now, we briefly introduce the concept of semismoothness, which was originally introduced by Mifflin [20] for functionals and was extended to vector valued functions by Qi and Sun [24]. A locally Lipschitz function F : <n → <m, which has the generalized Jacobian ∂F (x) in the sense of Clarke [8], is said to be semismooth (or strongly semismooth) at x ∈ <n, if F is directionally differentiable at x and

F (x + h) − F (x) − V h = o(khk) (or = O(khk2) holds for any V ∈ ∂F (x + h).

Proposition 2.3 Let φθp be defined by (1.4), then for all θ ∈ (0, 1] and p > 1,

(i) φθpis sub-additive, i.e., φθp((a, b)+(c, d)) ≤ φθp(a, b)+φθp(c, d) for all (a, b), (c, d) ∈ <2;

(6)

(ii) φθp is positive homogenous, i.e., φθp(α(a, b)) = αφθp(a, b) for all (a, b) ∈ <2 and α > 0;

(iii) φθp is a convex function on <2, i.e., φθp(α(a, b) + (1 − α)(c, d)) ≤ αφθp(a, b) + (1 − α)φθp(c, d) for all (a, b), (c, d) ∈ <2 and α ∈ [0, 1];

(iv) φθp is Lipschitz continuous on <2;

(v) φθp is continuously differentiable on <2\{(0, 0)};

(vi) φθp is strongly semismooth on <2.

Proof. By using φθp((a, b)) = ηθp(a, b) − (a + b) and Proposition 2.2, we can obtain that the results (i), (ii), and (iii) hold.

Consider the result (iv). Since ηθp is a norm on <2 from Proposition 2.2 and any two norms in finite dimensional space are equivalent, it follows that there exists a positive con- stant κ such that

ηθp(a, b) ≤ κk(a, b)k, ∀(a, b) ∈ <2,

where k · k represents the Euclidean norm on <2. Hence, for all (a, b), (c, d) ∈ <2,

θp(a, b) − φθp(c, d)| = |ηθp(a, b) − (a + b) − ηθp(c, d) + (c + d)|

≤ |ηθp(a, b) − ηθp(c, d)| + |a − c| + |b − d|

≤ ηθp(a − c, b − d) +√

2k(a − c, b − d)k

≤ κk(a − c, b − d)k +√

2k(a − c, b − d)k

= (κ +√

2)k(a − c, b − d)k.

Hence, φθp is Lipschitz continuous with Lipschitz constant κ +√

2, i.e., the result (iv) holds.

Consider the result (v). If (a, b) 6= (0, 0), then ηθp(a, b) 6= 0 by Proposition 2.2. By a direct calculation, we get

∂φθp(a, b)

∂a = θsgn(a)|a|p−1+ (1 − θ)sgn(a − b)|a − b|p−1

ηθp(a, b)p−1 − 1; (2.2)

∂φθp(a, b)

∂b = θsgn(b)|b|p−1− (1 − θ)sgn(a − b)|a − b|p−1

ηθp(a, b)p−1 − 1, (2.3)

where sgn(·) is the symbol function. It is easy to see from (2.2) and (2.3) that the result (v) holds.

Consider the result (vi). Since φθp is a convex function by the result (iii), we get that it is a semismooth function. Noticing that φθp is continuously differentiable except (0, 0), it is

(7)

sufficient to prove that it is strongly semismooth at (0, 0). For any (h, k) ∈ <2\{(0, 0)}, φθp

is differentiable at (h, k), and hence, ∇φθp(h, k) =

³∂φθp(h,k)

∂a ,∂φθp∂b(h,k)

´T . So,

φθp((0, 0) + (h, k)) − φθp(0, 0) −

µ∂φθp(h, k)

∂a ,∂φθp(h, k)

∂b

¶ µ h k

= pp

θ(|h|p+ |k|p) + (1 − θ)|h − k|p− (h + k)

−(sgn(h)|h|p−1+ sgn(h − k)|h − k|p−1 ηθp(h, k)p−1 − 1)h

−(sgn(k)|k|p−1− sgn(h − k)|h − k|p−1 ηθp(h, k)p−1 − 1)k

= pp

θ(|h|p+ |k|p) + (1 − θ)|h − k|p

−sgn(h)|h|p−1h + sgn(k)|k|p−1k + sgn(h − k)|h − k|p−1(h − k) ηθp(h, k)p−1

= pp

θ(|h|p+ |k|p) + (1 − θ)|h − k|p |h|p+ |k|p + |h − k|p ηθp(h, k)p−1

= ηθp(h, k) − |h|p + |k|p+ |h − k|p ηθp(h, k)p−1

= ηθp(h, k)p − (|h|p + |k|p+ |h − k|p) ηθp(h, k)p−1

= 0

= O(k(h, k)k2).

Thus, we obtain that φθp is strongly semismooth.

We complete the proof. 2

Proposition 2.4 Let φθp be defined by (1.4) and {(ak, bk)} ⊆ <2. Then, |φθp(ak, bk)| → ∞ if one of the following conditions is satisfied.

(i). ak→ −∞; (ii). bk→ −∞; (iii). ak → ∞ and bk → ∞.

Proof. (i) Suppose that ak → −∞. If {bk} is bounded from above, then the result holds trivially. When bk → ∞, we have −ak > 0 and bk > 0 for all k sufficiently large, and hence,

pp

θ(|ak|p+ |bk|p) + (1 − θ)|ak− bk|p− bk pp

θ|bk|p+ (1 − θ)|bk|p− bk= 0.

This, together with −ak → ∞ and the definition of φθp, implies that the result holds.

(ii) For the case of bk → −∞, a similar analysis yields the result of the proposition.

(iii) Suppose that ak → ∞ and bk → ∞. Since p > 1 and θ ∈ (0, 1], we have (1 − θ)|ak bk|p ≤ (1 − θ)(|ak|p+ |bk|p) for all sufficiently large k. Thus, for all sufficiently large k,

pp

θ(|ak|p+ |bk|p) + (1 − θ)|ak− bk|p pp

|ak|p+ |bk|p,

(8)

and hence,

(ak+ bk) − pp

θ(|ak|p+ |bk|p) + (1 − θ)|ak− bk|p ≥ (ak+ bk) −pp

|ak|p+ |bk|p. By [5, Lemma 3.1] we know that (ak + bk) − pp

|ak|p + |bk|p → ∞ as k → ∞ when the condition (iii) is satisfied. Thus, we obtain that

θp(ak, bk)| = (ak+ bk) − pp

θ(|ak|p+ |bk|p) + (1 − θ)|ak− bk|p → ∞

as k → ∞, which completes the proof. 2

Now, we define a nonnegative function, associated with the function φθp, as follows.

ψθp(a, b) := 1

2φ2θp(a, b), p > 1, θ ∈ (0, 1], (a, b) ∈ <2. (2.4) Proposition 2.5 Let ψθp be defined by (2.4), then for all θ ∈ (0, 1] and p > 1,

(i) ψθp is an NCP-function;

(ii) ψθp(a, b) ≥ 0 for all (a, b) ∈ <2;

(iii) ψθp is continuously differentiable on <2; (vi) ψθp is strongly semismooth on <2;

(v) ∂ψθp∂a(a,b) · ∂ψθp∂b(a,b) ≥ 0 for all (a, b) ∈ <2, where the equality holds if and only if φθp(a, b) = 0;

(vi) ∂ψθp∂a(a,b) = 0 ⇐⇒ ∂ψθp∂b(a,b) = 0 ⇐⇒ φθp(a, b) = 0.

Proof. By the definition of ψθp, it is easy to see that the results (i) and (ii) hold.

Consider the result (iii). By using Proposition 2.3 and the definition of ψθp, it is sufficient to prove that ψθp is differentiable at (0, 0) and the gradient is continuous at (0, 0). In fact, for all (a, b) ∈ <2\{(0, 0)}, we have,

θp(a, b)| =

¯¯

¯pp

θ(|a|p + |b|p) + (1 − θ)|a − b|p− a − b

¯¯

¯

¯¯

¯pp

θ|a|p+pp

θ|b|p+pp

(1 − θ)|a − b|p

¯¯

¯ + |a| + |b|

≤ |a| + |b| + |a − b| + |a| + |b|

≤ 3(|a| + |b|),

where the second inequality follows from p > 1 and the third inequality follows from θ ∈ (0, 1]. Hence,

ψθp(a, b) − ψθp(0, 0) = 1

2φ2θp(a, b) ≤ 1

2(3(|a| + |b|))2 ≤ O(|a|2+ |b|2).

(9)

Thus, similar to that of [7, Proposition 1], we can get that ψθpis differentiable at (0, 0) with

∇ψθp(0, 0) = (0, 0)T. Now, we prove that for all (a, b) ∈ <2\{(0, 0)},

¯¯

¯¯θsgn(a)|a|p−1+ (1 − θ)sgn(a − b)|a − b|p−1 ηθp(a, b)p−1

¯¯

¯¯ ≤ 1, (2.5)

¯¯

¯¯θsgn(b)|b|p−1− (1 − θ)sgn(a − b)|a − b|p−1 ηθp(a, b)p−1

¯¯

¯¯ ≤ 1. (2.6)

In fact,

¯¯

¯¯θsgn(a)|a|p−1+ (1 − θ)sgn(a − b)|a − b|p−1 ηθp(a, b)p−1

¯¯

¯¯

θ|a|p−1+ (1 − θ)|a − b|p−1 ηθp(a, b)p−1

= θ1p1pa|p−1+ (1 − θ)1p|(1 − θ)1p(a − b)|p−1 ηθp(a, b)p−1

((θ1p)p+ ((1 − θ)1p)p)1p((|θ1pa|p−1)p−1p + (|(1 − θ)1p(a − b)|p−1)p−1p )p−1p ηθp(a, b)p−1

= (θ + (1 − θ))(xp+ zp)p−1p ηθp(a, b)p−1

= (xp+ zp)p−1p (xp+ yp+ zp)p−1p

= ( xp+ zp

xp+ yp+ zp)p−1p

≤ 1,

where x := |θ1pa|p, y := |θ1pb|p, z := |(1 − θ)1p(a − b)|p; the first inequality follows from the triangle inequality; the second inequality follows from the well-known H¨older inequality; the second equality follows from the definitions of x and z; the third equality follows from the definitions of ηθp(a, b), x, y and z; and the third inequality follows from the fact that x, y and z are all nonnegative. So, (2.5) holds. Similar analysis will derive that (2.6) holds.

Thus, it follows from (2.5) and (2.6) that both ∂φθp∂a(a,b) and ∂φθp∂b(a,b) are uniformly bounded. Since φθp(a, b) → 0 as (a, b) → (0, 0), we get the desired result.

Consider the result (iv). Since the composition of strongly semismooth function is also strongly semismooth (see [11, Theorem 19]), by Proposition 2.3(vi) and the definition of ψθp

we obtain that the desired result holds.

Consider the result (v). It is obvious that ∂ψθp∂a(a,b) = 0 when (a, b) = (0, 0). Now, suppose that (a, b) 6= (0, 0). Since

∂ψθp(a, b)

∂a ·∂ψθp(a, b)

∂b = ∂φθp(a, b)

∂a · ∂φθp(a, b)

∂b · φθp(a, b)2, (2.7)

(10)

by (2.2), (2.3), (2.5), and (2.6), we obtain that ∂φθp∂a(a,b) ≤ 0 and ∂φθp∂b(a,b) ≤ 0 for all (a, b) ∈ <2, that is, the first result of (v) holds. In addition, from (2.7) it is obvious that the sufficient condition of the second result of (v) holds. Now, we suppose that ∂ψθp∂a(a,b) · ∂ψθp∂b(a,b) = 0.

Then, it is sufficient to prove that φθp(a, b) = 0 when ∂φθp∂a(a,b) · ∂φθp∂b(a,b) = 0. Suppose that

∂φθp(a,b)

∂a = 0 without loss of generality. From the proof of (iii) in this proposition, it is easy to see that it must be y = 0, and hence, b = 0. After a simple symbol discussion for (2.2), we may get a ≥ 0. Hence φθp(a, b) = 0 by Proposition 2.1. So, the result (v) holds.

Consider the result (vi). Since

∂ψθp(a, b)

∂a = ∂φθp(a, b)

∂a φθp(a, b), ∂ψθp(a, b)

∂b = ∂φθp(a, b)

∂b φθp(a, b), the result (vi) is immediately satisfied from the above analysis.

We complete the proof. 2

Lemma 2.1 [21, Theorem 3.3.5] If f : D ⊆ <n→ <m has a second derivative at each point of a convex set D0 ⊆ D, then k∇f (y) − ∇f (x)k ≤ sup0≤t≤1k∇2f (x + t(y − x))k · ky − xk.

Theorem 2.1 The gradient function of the function ψθp defined by (2.4) with p ≥ 2, θ ∈ (0, 1] is Lipschitz continuous, that is, there exists a positive constant L such that

k∇ψθp(a, b) − ∇ψθp(c, d)k ≤ Lk(a, b) − (c, d)k (2.8) holds for all (a, b), (c, d) ∈ <2.

Proof. It follows from the definition of ψθp and the proof of Proposition 2.5(iii) that

∇ψθp(a, b) = ∇φθp(a, b)φθp(a, b) when (a, b) 6= (0, 0), and ∇ψθp(0, 0) = (0, 0)T. From Propo- sition 2.5(iii) we know that ψθp is continuous differentiable. The proof is divided into the following three cases.

Case 1. If (a, b) = (c, d) = (0, 0), it follows from Proposition 2.5 that ∇ψθp(0, 0) = (0, 0), and hence, (2.8) holds for all positive number L.

Case 2. Consider the case that one of (a, b) and (c, d) is (0, 0), but not all. We assume that (a, b) 6= (0, 0) and (c, d) = (0, 0) without loss of generality. Then,

k∇ψθp(a, b) − ∇ψθp(c, d)k = k∇ψθp(a, b) − (0, 0)k

= k∇φθp(a, b)φθp(a, b) − (0, 0)k

= k∇φθp(a, b)kφθp(a, b)

= k∇φθp(a, b)k|φθp(a, b) − φθp(0, 0)|

≤ Lk(a, b) − (0, 0)k,

(11)

where the inequality follows from the fact that {k∇φθp(a, b)k} is uniformly bounded on <2 (which can be obtained from the proof of Proposition 2.5(iii)) and φθpis Lipschitz continuous on <2 given in Proposition 2.3 (iv). Hence, (2.8) holds for some positive constant L.

Case 3. If both (a, b) and (c, d) are not (0, 0), we will use Lemma 2.1 to prove (2.8) holds for this case. For simplicity, we denote

ˆh1 := θsgn(a)|a|p−1+ (1 − θ)sgn(a − b)|a − b|p−1 ηp−1θp (a, b) ; ˆh2 := θsgn(b)|b|p−1− (1 − θ)sgn(a − b)|a − b|p−1

ηθpp−1(a, b) ; ˆa1 := (θ|a|p−2+ (1 − θ)|a − b|p−2pθp(a, b);

ˆa2 := −ˆh21ηθp2p−2(a, b);

ˆb1 := −(1 − θ)|a − b|p−2ηθpp (a, b);

ˆb2 := −ˆh1ˆh2η2p−2θp (a, b);

ˆc1 := (θ|b|p−2+ (1 − θ)|a − b|p−2θpp (a, b);

ˆc2 := −ˆh22ηθp2p−2(a, b).

When (a, b) 6= (0, 0), by a direct calculation, we have

2ψθp(a, b)

∂a2 = (ˆh1− 1)2+ (p − 1) ˆa1+ ˆa2

ηθp2p−1(a, b)(ηθp(a, b) − (a + b));

2ψθp(a, b)

∂a∂b = (ˆh1− 1)(ˆh2− 1) + (p − 1) ˆb1+ ˆb2

η2p−1θp (a, b)(ηθp(a, b) − (a + b));

2ψθp(a, b)

∂b2 = (ˆh2− 1)2+ (p − 1) ˆc1+ ˆc2

ηθp2p−1(a, b)(ηθp(a, b) − (a + b));

2ψθp(a, b)

∂b∂a = 2ψθp(a, b)

∂a∂b ,

where the last equality follows from the fact that 2ψ∂a∂bθp(a,b) and 2ψ∂b∂aθp(a,b) are continuous when (a, b) 6= (0, 0). Since p ≥ 2 and ηθp(·, ·) is a norm on <2 by Proposition 2.2, it is easy to verify that

|a + b| ≤ |a| + |b| ≤ pp

|a|p + |b|p+pp

|a|p+ |b|p = 2k(a, b)kp ≤ 2κηθp(a, b), where κ > 0 is a constant depending on θ and p.

ˆa1

ηθp2p−2(a, b) = θ|a|p−2+ (1 − θ)|a − b|p−2 ηp−2θp (a, b)

= θ|a|p−2

ηθpp−2(a, b) + (1 − θ)|a − b|p−2 ηθpp−2(a, b)

≤ θ2p + (1 − θ)2p

≤ 2.

(12)

Similarly, we have

|ˆb1|

η2p−2θp (a, b) ≤ 1; ˆc1

ηθp2p−2(a, b) ≤ 2.

These, together with the results |ˆh1| ≤ 1 and |ˆh2| ≤ 1 given in Proposition 2.5, yield

|ˆa2|

ηθp2p−2(a, b) ≤ 1; |ˆb2|

ηθp2p−2(a, b) ≤ 1; |ˆc2|

ηθp2p−2(a, b) ≤ 1.

Thus,

¯¯

¯¯2ψθp(a, b)

∂a2

¯¯

¯¯ =

¯¯

¯¯

¯(ˆh1− 1)2 + (p − 1) ˆa1+ ˆa2

η2p−1θp (a, b)(ηθp(a, b) − (a + b))

¯¯

¯¯

¯

≤ |(ˆh1− 1)2| + (p − 1) ï¯

¯¯

¯

ˆa1+ ˆa2

ηθp2p−1(a, b)ηθp(a, b)

¯¯

¯¯

¯+

¯¯

¯¯

¯

ˆa1+ ˆa2

η2p−1θp (a, b)(a + b)

¯¯

¯¯

¯

!

≤ 4 + (1 + 2κ)(p − 1)

à ˆa1

ηθp2p−2(a, b) + |ˆa2| η2p−2θp (a, b)

!

≤ 4 + 3(1 + 2κ)(p − 1);

¯¯

¯¯2ψθp(a, b)

∂a∂b

¯¯

¯¯ =

¯¯

¯¯

¯(ˆh1− 1)(ˆh2 − 1) + (p − 1) ˆa1+ ˆa2

ηθp2p−1(a, b)(ηθp(a, b) − (a + b))

¯¯

¯¯

¯

≤ |(ˆh1− 1)(ˆh2 − 1)|

+(p − 1) ï¯

¯¯

¯

ˆb1 + ˆb2

ηθp2p−1(a, b)ηθp(a, b)

¯¯

¯¯

¯+

¯¯

¯¯

¯

ˆb1+ ˆb2

ηθp2p−1(a, b)(a + b)

¯¯

¯¯

¯

!

≤ 4 + (1 + 2κ)(p − 1)

à |ˆb1|

ηθp2p−2(a, b) + |ˆb2| η2p−2θp (a, b)

!

≤ 4 + 2(1 + 2κ)(p − 1);

¯¯

¯¯2ψθp(a, b)

∂b2

¯¯

¯¯ =

¯¯

¯¯

¯(ˆh2− 1)2 + (p − 1) ˆc1+ ˆc2

η2p−1θp (a, b)(ηθp(a, b) − (a + b))

¯¯

¯¯

¯

≤ |(ˆh2− 1)2| + (p − 1) ï¯

¯¯

¯

ˆc1+ ˆc2

ηθp2p−1(a, b)ηθp(a, b)

¯¯

¯¯

¯+

¯¯

¯¯

¯

ˆc1+ ˆc2

η2p−1θp (a, b)(a + b)

¯¯

¯¯

¯

!

≤ 4 + (1 + 2κ)(p − 1) Ã

ˆc1

ηθp2p−2(a, b) + |ˆc2| η2p−2θp (a, b)

!

≤ 4 + 3(1 + 2κ)(p − 1).

Hence, there exists a positive constant L such that (2.8) holds by Lemma 2.1.

Combining Cases 1–3, we complete the proof. 2

(13)

Remark 2.1 It should be noted that ∇ψθp is not Lipschitz continuous for all θ ∈ (0, 1] when p ∈ (1, 2). In fact, if we fixed θ = 1. For (a, b) 6= (0, 0) and (c, d) 6= (0, 0), we have

k∇ψ1p(a, b) − ∇ψ1p(c, d)k

= k∇φ1p(a, b)φ1p(a, b) − ∇φ1p(c, d)φ1p(c, d)k

¯¯

¯¯sgn(a)|a|p−1 k(a, b)kp−1p

φ1p(a, b) − sgn(c)|c|p−1 k(c, d)kp−1p

φ1p(c, d) + φ1p(c, d) − φ1p(a, b)

¯¯

¯¯

¯¯

¯¯sgn(a)|a|p−1 k(a, b)kp−1p

φ1p(a, b) − sgn(c)|c|p−1 k(c, d)kp−1p

φ1p(c, d)

¯¯

¯¯ − |φ1p(c, d) − φ1p(a, b)|

¯¯

¯¯sgn(a)|a|p−1 k(a, b)kp−1p

φ1p(a, b) − sgn(c)|c|p−1 k(c, d)kp−1p

φ1p(c, d)

¯¯

¯¯ − (κ +

√2)k(c, d) − (a, b)k,

where κ +√

2 is given in Proposition 2.3(iv). If we let (a, b) = (1, −n), (c, d) = (−1, −n) with n ∈ (1, ∞), we have

¯¯

¯¯sgn(a)|a|p−1 k(a, b)kp−1p

φ1p(a, b) − sgn(c)|c|p−1 k(c, d)kp−1p

φ1p(c, d)

¯¯

¯¯

=

p

1 + np+ (n − 1) (1 + np)(p−1)/p +

p

1 + np+ (n + 1) (1 + np)(p−1)/p

= 2

p

1 + np+ n (1 + np)(p−1)/p

4n

(1 + np)(p−1)/p

= 4n2−pnp−1 (1 + np)(p−1)/p

= 4n2−p

(1 + (1/n)p)(p−1)/p

≥ n2−p,

where the first and the second inequalities follow from 2 > p > 1 and n > 1. Since k(a, b) − (c, d)k = 2 and n ∈ (1, ∞), form the above inequalities it is easy to verify that ∇ψ1p is not Lipschitz continuous.

3 Properties of Merit Function

In this section, we consider the merit function for the NCP defined by (1.5), and then discuss its several important properties. These properties provide the theoretical basis for the algorithm we discussed in the next section. In addition, we also discuss the semismooth- related properties of the merit function.

(14)

Define

Φθp(x) :=

φθp(x1, F1(x)) . . . φθp(xn, Fn(x))

 . (3.1)

Then, the merit function defined by (1.5) can be written as

Ψθp(x) = 1

2θp(x)k2 = Xn

i=1

ψθp(xi, Fi(x)). (3.2)

Proposition 3.1 (i) The function ψθp defined by (2.4) with p ≥ 2, θ ∈ (0, 1] is an SC1 function. Hence, if every Fi is an SC1 function, then the function Ψθp defined by (3.2) with p ≥ 2, θ ∈ (0, 1] is also an SC1 function.

(ii) If every Fi is an LC1 function, then the function Φθp defined by (3.1) with p > 1, θ ∈ (0, 1] is strongly semismooth.

(iii) The function ψθp defined by (2.4) with p ≥ 2, θ ∈ (0, 1] is an LC1 function. Hence, if every Fi is an LC1function, then the function Ψθpdefined by (3.2) with p ≥ 2, θ ∈ (0, 1]

is also an LC1 function.

Proof. (i) By Proposition 2.5, it is sufficient to prove that ∇ψθp is semismooth. It is obvious from the proof of Theorem 2.1 that ∇ψθp(a, b) is continuously differentiable when (a, b) 6= (0, 0), so we only need to show the semismoothness of ∇ψθp(a, b) at (0, 0). For any (h1, h2) ∈ <2\{(0, 0)}, we know that ∇ψθp is differentiable at (h1, h2), and hence, we only need to show that

∇ψθp(h1, h2) − ∇ψθp(0, 0) − ∇2ψθp(h1, h2) · (h1, h2)T = o(k(h1, h2)k). (3.3) In fact, let ˆa1, ˆa2, ˆb1, ˆb2, ˆc1, ˆc2 be similarly defined as those in Theorem 2.1 with (a, b) being replaced by (h1, h2). Denote

ˆh3 := (p − 1) ˆa1+ ˆa2

ηθp2p−1(h1, h2)φθp(h1, h2);

ˆh4 := (p − 1) ˆb1+ ˆb2

ηθp2p−1(h1, h2)φθp(h1, h2);

ˆh5 := (p − 1) ˆc1+ ˆc2

ηθp2p−1(h1, h2)φθp(h1, h2), and

m1 : = (θ|h1|p−2+ (1 − θ)|h1− h2|p−2pθp(h1, h2)h1− ˆh21ηθp2p−2(h1, h2)h1;

(15)

m2 : = (1 − θ)|h1− h2|p−2ηθpp (h1, h2)h2+ ˆh1ˆh2ηθp2p−2(h1, h2)h2; m3 : = (θ|h1|p−2+ (1 − θ)|h1− h2|p−2pθp(h1, h2)h1

−(1 − θ)|h1− h2|p−2ηθpp (h1, h2)h2; m4 : = ˆh1ˆh2ηθp2p−2(h1, h2)h2+ ˆh21ηθp2p−2(h1, h2)h1;

m5 : = (θsgn(h1)|h1|p−1+ (1 − θ)sgn(h1− h2)|h1− h2|p−1θpp (h1, h2);

m6 : = ˆh1ˆh2ηθp2p−2(h1, h2)h2+ ˆh21ηθp2p−2(h1, h2)h1. Then,

µ H1 H2

¶ :=

à ˆh1− 1 ˆh2− 1

!

· φθp(h1, h2) − µ 0

0

Ã

(ˆh1− 1)2+ ˆh3 (ˆh1− 1)(ˆh2− 1) + ˆh4

(ˆh1− 1)(ˆh2− 1) + ˆh4 (ˆh2− 1)2+ ˆh5

!

· µ h1

h2

.

and hence,

H1 = (ˆh1− 1)φθp(h1, h2) − ((ˆh1− 1)2+ ˆh3)h1− ((ˆh1− 1)(ˆh2− 1) + ˆh4)h2

= (ˆh1− 1)φθp(h1, h2) − ˆh3h1− ˆh4h2− (ˆh1− 1)((ˆh1− 1)h1+ (ˆh2− 1)h2)

= (ˆh1− 1)φθp(h1, h2) − ˆh3h1− ˆh4h2− (ˆh1− 1)φθp(h1, h2)

= −(p − 1) Ã

ˆa1+ ˆa2

ηθp2p−1(h1, h2)h1+ ˆb1+ ˆb2 ηθp2p−1(h1, h2)h2

!

φθp(h1, h2)

= −(p − 1)φθp(h1, h2) Ã

m1 − m2

η2p−1θp (h1, h2)

!

= −(p − 1)φθp(h1, h2)

à m3 − m4 η2p−1θp (h1, h2)

!

= −(p − 1)φθp(h1, h2)

à m5 − m6 η2p−1θp (h1, h2)

!

= −(p − 1)φθp(h1, h2) Ã

ˆh1− ˆh1ˆh1h1+ ˆh2h2 ηθp(h1, h2)

!

= −(p − 1)φθp(h1, h2)(ˆh1− ˆh1)

= 0,

where the third equality follows from ˆh1h1+ ˆh2h2 = ηθp given in the proof of Proposition 2.3 and the definition of φθp, the fourth equality follows from the definitions of ˆh3, ˆh4, the fifth equality follows from the definitions of ˆa1, ˆa2, ˆb1, ˆb2, and the eighth equality follows from ˆh1h1+ ˆh2h2 = ηθp given in the proof of Proposition 2.3.

Similar analysis yields H2 = 0. Thus, ∇ψθp is semismooth. Furthermore, ψθp is an SC1 function.

(16)

(ii) Since the LC1 function is strongly semismooth and the composition of strongly semismooth function is also strongly semismooth, it follows from Proposition 2.3(vi) that the desired results holds.

(iii) By using the above results, it is easy that the result (iii) holds.

We complete the proof. 2

Remark 3.1 The results of Proposition 3.1(i)(iii) do not hold when p ∈ (1, 2) for all θ ∈ (0, 1] since ∇ψθp is not locally Lipschitz continuous in general. For example, let (a, b) = (1n, −1) and (c, d) = (−1n, −1), similar to Remark 2.1, we can obtain that ∇ψθp is not Lipschitz continuous in any neighborhood of (0, −1).

Definition 3.1 Let F : <n→ <n.

• F is said to be monotone if (x − y)T(F (x) − F (y)) ≥ 0 for all x, y ∈ <n.

• F is said to be strongly monotone with modulus µ > 0 if (x − y)T(F (x) − F (y)) ≥ µkx − yk2 for all x, y ∈ <n.

• F is said to be a P0-function if max1≤j≤n,xi6=yi(xi − yi)(Fi(x) − Fi(y)) ≥ 0 for all x, y ∈ <n and x 6= y.

• F is said to be a uniform P -function with modulus µ > 0 if max1≤j≤n(xi− yi)(Fi(x) − Fi(y)) ≥ µkx − yk2 for all x, y ∈ <n.

Proposition 3.2 Let Ψθp : <n → < be defined by (3.2) with p > 1, θ ∈ (0, 1]. Then Ψθp(x) ≥ 0 for all x ∈ <n and Ψθp(x) = 0 if and only if x solves the NCP (1.1). Moreover, suppose that the solution set of the NCP (1.1) is nonempty, then x is a global minimizer of Ψθp if and only if x solves the NCP (1.1).

Proof. The result follows from Proposition 2.5 immediately. 2

Proposition 3.3 Let Ψθp: <n→ < be defined by (3.2) with p > 1, θ ∈ (0, 1]. Suppose that F is either a monotone function or a P0-function, then every stationary point of Ψθp is a global minima of minx∈<nΨθp(x); and therefore solves the NCP (1.1).

Proof. By using Proposition 2.5 and [5, Lemma 2.1], the proof of the proposition is similar to the one given in [5, Proposition 3.4]. We omit it here. 2

參考文獻

相關文件

Chen, The semismooth-related properties of a merit function and a descent method for the nonlinear complementarity problem, Journal of Global Optimization, vol.. Soares, A new

Particularly, combining the numerical results of the two papers, we may obtain such a conclusion that the merit function method based on ϕ p has a better a global convergence and

Then, it is easy to see that there are 9 problems for which the iterative numbers of the algorithm using ψ α,θ,p in the case of θ = 1 and p = 3 are less than the one of the

Numerical results are reported for some convex second-order cone programs (SOCPs) by solving the unconstrained minimization reformulation of the KKT optimality conditions,

For different types of optimization problems, there arise various complementarity problems, for example, linear complemen- tarity problem, nonlinear complementarity problem

11 (1998) 227–251] for the nonnegative orthant complementarity problem to the general symmet- ric cone complementarity problem (SCCP). We show that the class of merit functions

In this paper, we have shown that how to construct complementarity functions for the circular cone complementarity problem, and have proposed four classes of merit func- tions for

Huang, A nonmonotone smoothing-type algorithm for solv- ing a system of equalities and inequalities, Journal of Computational and Applied Mathematics, vol. Hao, A new