3 Properties of the New NCP-Function

(1)

to appear in Computational Optimization and Applications, 2010

A new class of penalized NCP-functions and its properties

Jein-Shan Chen ¹ Department of Mathematics National Taiwan Normal University

Taipei, Taiwan 11677 E-mail: jschen@math.ntnu.edu.tw

Zheng-Hai Huang ² Department of Mathematics

Tianjin University Tianjin 300072, China E-mail: huangzhenghai@tju.edu.cn

Chin-Yu She

Department of Mathematics National Taiwan Normal University

Taipei, Taiwan 11677 E-mail: bbgmmshe@hotmail.com

April 17, 2009

(ﬁrst revised on September 15, 2009) (second revised on December 18, 2009)

Abstract. In this paper, we consider a class of penalized NCP-functions, which includes several existing well-known NCP-functions as special cases. The merit function induced by this class of NCP-functions is shown to have bounded level sets and provide error bounds under mild conditions. A derivative free algorithm is also proposed, its global

1Member of Mathematics Division, National Center for Theoretical Sciences, Taipei Oﬃce. The author’s work is partially supported by National Science Council of Taiwan.

2This author’s work is partly supported by by the National Natural Science Foundation of China (Grant No. 10871144).

(2)

convergence is proved and numerical performance compared with those based on some existing NCP-functions is reported.

Key Words. NCP-function, penalized, bounded level sets, error bounds.

1 Introduction

The nonlinear complementarity problem (NCP) is to ﬁnd a point x∈ IRⁿ such that

x≥ 0, F (x) ≥ 0, ⟨x, F (x)⟩ = 0, (1)

where ⟨·, ·⟩ is the Euclidean inner product and F = (F1, . . . , F_n)^T is a map from IRⁿ to IRⁿ. We assume that F is continuously diﬀerentiable throughout this paper. The NCP has attracted much attention because of its wide applications in the ﬁelds of economics, engineering, and operations research [6, 14].

Many methods have been proposed to solve the NCP; see [3, 5, 14, 16, 17, 18, 20, 23, 26, 30, 31]. For more details, please refers to the excellent monograph [9]. One of the most powerful and popular methods is to reformulate the NCP as a system of nonlinear equations [24, 25, 31], or as an unconstrained minimization problem [7, 10, 11, 12, 19, 21, 27, 30]. The objective function that can constitute an equivalent unconstrained minimization problem is called a merit function, whose global minima are coincident with the solutions of the original NCP (1). To construct a merit function, a class of functions called NCP-functions and deﬁned below, plays a signiﬁcant role.

Definition 1.1 A function ϕ : IR² → IR is called an NCP-function if it satisﬁes

ϕ(a, b) = 0 ⇐⇒ a ≥ 0, b ≥ 0, ab = 0. (2)

Many NCP-functions have been proposed in the literature. Among them, the Fischer- Burmeister (FB) function is one of the most popular NCP-functions, which is deﬁned by

ϕ_FB(a, b) =√

a²+ b² − (a + b), ∀(a, b) ∈ IR². (3) Through this NCP-function ϕ_FB, the NCP (1) can be reformulated as a system of nons- mooth equations:

Φ_FB(x) :=







ϕ_FB(x1 , F1(x))

··

·

ϕ_FB(xn , Fn(x))





= 0. (4)

(3)

Thus, the function Ψ_FB : IRⁿ → IR+ deﬁned as below is a merit function for the NCP:

Ψ_FB(x) := 1

2∥ΦFB(x)∥² =

∑n i=1

ψ_FB(x_i, F_i(x)), (5)

where ψ_FB : IR² → IR+ is the square of ϕ_FB, i.e., ψ_FB(a, b) = 1

2 √

a²+ b²− (a + b)². (6) Consequently, the NCP is equivalent to the unconstrained minimization problem:

xmin∈IRⁿΨ_FB(x). (7)

There are several generalizations of the FB function in the literature. For example, Kanzow and Kleinmichel [22] extend ϕ_FB function to

ϕ_θ(a, b) :=√

(a− b)²+ θab− (a + b), θ ∈ (0, 4).

Chen, Chen, and Kanzow [2] study a penalized FB function

ϕ_λ(a, b) := λϕ_FB(a, b) + (1− λ)a+b₊, λ ∈ (0, 1).

Some other types of penalized FB functions are also investigated by Sun and Qi in [28].

Recently, Chen and Pan [3, 5] consider the following generalization of the FB function:

ϕ_p(a, b) :=∥(a, b)∥p− (a + b), (8) where p > 1 and ∥(a, b)∥p denotes the p-norm of (a, b), i.e., ∥(a, b)∥p = √^p

|a|^p+|b|^p. Another further generalization is proposed by Hu, Huang and Chen in [15]:

ϕθ,p(a, b) :=√^p

θ(|a|^p+|b|^p) + (1− θ)(|a − b|^p)− (a + b), (9) where p > 1, θ∈ (0, 1].

All the aforementioned functions are NCP-functions. The corresponding function ψ_θ, ψ_λ, ψ_p, and ψ_θ,p is square of ϕ_θ, ϕ_λ, ϕ_p, and ϕ_θ,p, respectively, and naturally induces a merit function Ψθ, Ψλ, Ψp, and Ψθ,p like what ψ_FB function does. Along this track, in this paper, we study the following merit function Ψ_α,θ,p : IRⁿ→ IR+ for the NCP:

Ψ_α,θ,p(x) :=

∑n i=1

ψ_α,θ,p(x_i , F_i(x)), (10)

where ψ_α,θ,p : IR² → IR+ is an NCP-function deﬁned by ψ_α,θ,p(a, b) := α

2(max{0, ab})² + ψ_θ,p(a, b) (11)

(4)

with α≥ 0 being a real parameter. Note that ψα,θ,p includes all the above functions ψ_FB, ψ_p, ψ_θ, ψ_θ,p (and ψ₇ in [28]) as special cases. Although ψ_α,θ,p is obtained by penalizing the function ψ_θ,p considered in [15], more favorable properties of ψ_α,θ,p are explored in this work. In particular, Ψα,θ,p has property of bounded level sets and provides a global error bound for the NCP under mild condition which were not studied in [15]. Thus, this paper can be viewed as a follow-up of [15]. On the other hand, as remarked in [2], penalized Fischer-Burmeister (FB) function not only possesses stronger properties than FB function but also gives extremely promising numerical performance, which is another motivation of our considering this generalization of several NCP-functions.

This paper is organized as follows. In Section 2, we review some deﬁnitions and preliminary results to be used in the subsequent analysis. In Section 3, we show some properties of the proposed merit function. In Section 4, we propose a derivative free algorithm based on this merit function Ψ_α,θ,p, show its global convergence, and report some numerical results. In Section 5, we make concluding remarks.

Throughout this paper, IRⁿ denotes the space of n-dimensional real column vectors and ^T denotes transpose. For every diﬀerentiable function f : IRⁿ → IR, ∇f(x) denotes the gradient of f at x. For every diﬀerentiable mapping F = (F₁, . . . , F_n)^T : IRⁿ → IRⁿ,

∇F (x) = (∇F1(x) . . . ∇Fn(x)) denotes the transpose Jacobian of F at x. We use ∥x∥p

to denote the p-norm of x and denote ∥x∥ the Euclidean norm of x. The level set of a function Ψ : IRⁿ → IR is denoted by L(Ψ, c) := {x ∈ IRⁿ | Ψ(x) ≤ c}. In addition, we will frequently mention two merit functions. One is the natural residual merit function Ψ_NR : IRⁿ → IR+ deﬁned by

Ψ_NR(x) := 1 2

∑n i=1

ϕ²

NR(x_i , F_i(x)), (12)

where ϕ_NR : IR² → IR denotes the minimum NCP-function min{a, b}. Another one is Ψ_θ,p : IRⁿ→ IR+ induced by ψ_θ,p:

Ψθ,p(x) := 1 2

∑n i=1

ϕ²_θ,p(xi , Fi(x)). (13)

Unless otherwise stated, in the sequel, we always suppose that p is a ﬁxed real number in (1,∞).

(5)

2 Preliminaries

This section brieﬂy recalls some concepts about the mapping F that will be used later.

A matrix is said to be P -matrix if each of its principal minors is positive, and is called P₀-matrix if each of its principal minors is nonnegative. Obviously, P -matrix is a gen- eralization of positive deﬁnite matrix, while P₀-matrix is a generalization of positive semideﬁnite matrix. Such concepts of P -matrix and P₀-function can be further extended to nonlinear mapping, which we call them P -function and P₀-function.

Definition 2.1 Let F = (F₁, . . . , F_n)^T with F_i : IRⁿ→ IR for i = 1, . . . , n. We say that

(a) F is monotone if ⟨x − y, F (x) − F (y)⟩ ≥ 0 for all x, y ∈ IRⁿ.

(b) F is strongly monotone if ⟨x − y, F (x) − F (y)⟩ ≥ µ∥x − y∥² for some µ > 0 and for all x, y ∈ IRⁿ.

(c) F is a P0-function if max

1≤i≤n xi̸=yi

(xi− yi)(Fi(x)− Fi(y))≥ 0 for all x, y ∈ IRⁿ and x̸= y.

(d) F is a uniform P -function with modulus µ > 0 if max

1≤i≤n(x_i − yi)(F_i(x)− Fi(y)) ≥ µ∥x − y∥² for all x, y∈ IRⁿ.

(e) F is Lipschitz continuous if there exists a constant L > 0 such that ∥F (x)−F (y)∥ ≤ L∥x − y∥ for all x, y ∈ IRⁿ.

It is well-known that every monotone function is an P₀ function and every strongly monotone function is a uniform P -function. For a continuously diﬀerentiable function F , if its (transpose) Jacobian ∇F (x) is an P -matrix then F is an P -function (the converse may not be true), whereas the (transpose) Jacobian ∇F (x) is an P0-matrix if and only if F is an P₀-function. For more detailed properties of various monotone and P (P₀)- function, please refer to [9].

3 Properties of the New NCP-Function

In this section, we study some favorable properties of the merit function ψ_α,θ,p, and then present some mild conditions under which the merit function Ψ_α,θ,phas bounded level sets and provides a global error bound, respectively. To this end, we present some technical lemmas which are needed for subsequent analysis.

(6)

Lemma 3.1 For p > 1, a > 0, b > 0, we have a^p+ b^p ≤ (a + b)^p

Proof. We present two diﬀerent ways to prove this lemma.

(1) For any p > 1, p = n + m, where n = [p] (the greatest integer less than or equal to p) and m = p− n, applying binomial theorem gives

(a + b)^p = (a + b)ⁿ(a + b)^m

≥ (aⁿ+ bⁿ)(a + b)^m

= aⁿ(a + b)^m+ bⁿ(a + b)^m

≥ aⁿa^m+ bⁿb^m

= a^p+ b^p.

(2) Let f (t) = (t + 1)^p− (t^p+ 1). It is easy to verify that f is increasing on [0,∞) when p > 1. Hence, f (a/b)≥ f(0) = 0 which yields (a + b)^p ≥ a^p+ b^p. 2

Lemma 3.2 The function ψ_α,θ,p deﬁned by (11) has the following favorable properties:

(a) ψ_α,θ,p is an NCP-function and ψ_α,θ,p≥ 0 for all (a, b) ∈ IR².

(b) ψα,θ,p is continuously diﬀerentiable everywhere. Moreover, if (a, b)̸= (0, 0),

∇aψ_α,θ,p(a, b)

= αb(ab)₊+

(θsgn(a)· |a|^p⁻¹+ (1− θ)sgn(a − b)|a − b|^p⁻¹ [θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 1

)

ϕ_θ,p(a, b),

∇bψ_α,θ,p(a, b)

= αa(ab)₊+

(θsgn(b)· |b|^p⁻¹− (1 − θ)sgn(a − b)|a − b|^p⁻¹ [θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 1

)

ϕ_θ,p(a, b), (14) and otherwise, ∇aψ_α,θ,p(0, 0) =∇bψ_α,θ,p(0, 0) = 0.

(c) For p ≥ 2, the gradient of ψα,θ,p is Lipschitz continuous on any nonempty bounded set S, i.e., there exists L > 0 such that for any (a, b), (c, d)∈ S,

∥∇ψα,θ,p(a, b)− ∇ψα,θ,p(c, d)∥ ≤ L∥(a, b) − (c, d)∥.

(d) ∇aψ_α,θ,p(a, b)· ∇bψ_α,θ,p(a, b) ≥ 0 for any (a, b) ∈ IR², and the equality holds if and only if ψ_α,θ,p(a, b) = 0.

(e) ∇aψ_α,θ,p(a, b) = 0⇐⇒ ∇bψ_α,θ,p(a, b) = 0⇐⇒ ψα,θ,p(a, b) = 0.

(f ) Suppose that α > 0. If a→ −∞ or b → −∞ or ab → ∞, then ψα,θ,p(a, b)→ ∞.

(7)

Proof. (a) It is clear that ψ_α,θ,p(a, b) ≥ 0 for all (a, b) ∈ IR² from the deﬁnition of ψ_α,θ,p. Then by [15, Prop. 2.1], we have

ψ_α,θ,p(a, b) = 0⇐⇒ α

2(max{0, ab})² = 0 and ψ_θ,p(a, b) = 0⇐⇒ a ≥ 0, b ≥ 0, ab = 0.

Hence, ψ_α,θ,p is an NCP-function.

(b) First, direct calculations give the partial derivatives of ψ_α,θ,p. Then, using αb(ab)₊ → (0, 0) and αa(ab)+ → (0, 0) as (a, b) → (0, 0), we have ^α₂(max{0, ab})² is continuously differentiable everywhere. By [15, Prop. 2.5], it is known that ψ_θ,p is continuously differentiable everywhere. In view of the expression of ∇ψα,θ,p(a, b), ψ_α,θ,p is also continuously differentiable everywhere.

(c) First, we claim that a(ab)₊ for any a, b∈ IR is Lipschitz continuous on any nonempty bounded set S. For any (a, b) ∈ S and (c, d) ∈ S, without loss of generality, we may assume that a²+ b² ≤ k and c²+ d² ≤ k which imply |a| ≤ k + 1, |b| ≤ k + 1, |c| ≤ k + 1 and |d| ≤ k + 1. Then,

a(ab)⁺− c(cd)+

= 1 2

a²b + a|ab| − c²d− c|cd|

= 1 2

a²b− a²d + a²d− c²d + a|ab| − c|ab| + c|ab| − c|cd|

≤ 1 2

(

|a²b− a²d| + |a²d− c²d| +a|ab| − c|ab|+c|ab| − c|cd|)

= 1 2

(

a²|b − d| + |a + c||d||a − c| + |ab||a − c| + |c||ab − cd|

)

≤ 1 2

[

k|b − d| + (|a| + |c|)|d||a − c| + k|a − c| + (k + 1)|ab − ad + ad − cd|

]

≤ 1 2

[

k|b − d| + 2(k + 1)²|a − c| + k|a − c| + (k + 1)²(|b − d| + |a − c|) ]

= 1 2

{ [2(k + 1)²+ k + (k + 1)²]

|a − c| +[

k + (k + 1)²]

|b − d|

}

≤ l(

|a − c| + |b − d|)

≤ √

2l∥(a, b) − (c, d)∥,

where l = 2(k + 1)² + k + (k + 1)². Hence, the mapping a(ab)₊ is Lipschitz continuous on any nonempty bounded set S and so is αa(ab)₊. Similarly, αb(ab)₊ is Lipschitz continuous on any nonempty bounded set S. All of these imply the gradient function of the function ^α₂(max{0, ab})² is Lipschitz continuous on any bounded set S. On the

(8)

other hand, by [15, Theorem 2.1], the gradient function of the function ψ_θ,p with p≥ 2, θ ∈ (0, 1] is Lipschitz continuous. Thus, the gradient of ψα,θ,p is Lipschitz continuous on any nonempty bounded set S.

(d) If (a, b) = (0, 0), part (d) clearly holds. Now we assume that (a, b)̸= (0, 0). Then,

∇aψα,θ,p(a, b)· ∇bψα,θ,p(a, b) (15)

= cdϕ²_θ,p(a, b) + α²ab(ab)₊²+ αa(ab)₊cϕ_θ,p(a, b) + αb(ab)₊dϕ_θ,p(a, b), where

c =

) , d =

(θsgn(b)· |b|^p⁻¹− (1 − θ)sgn(a − b)|a − b|^p⁻¹ [θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 1

) . From the proof of [15, Prop. 2.5 ], we know ab(ab)²₊ ≥ 0 and

)

≤ 0, (θsgn(b)· |b|^p⁻¹− (1 − θ)sgn(a − b)|a − b|^p⁻¹

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 1 )

≤ 0, (16)

it suﬃces to show that the last two terms of (15) are nonnegative. For this purpose, we claim that

αa(ab)₊

(θsgn(a)· |a|^p⁻¹+ (1− θ)sgn(a − b)|a − b|^p⁻¹ [θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p−1)/p − 1

)

ϕ_θ,p(a, b)≥ 0 (17) for all (a, b) ̸= (0, 0). If a ≤ 0 and b ≤ 0, then ϕθ,p(a, b) ≥ 0, which together with the second inequality in (16) implies that (17) holds. If a ≤ 0 and b ≥ 0, then (ab)+ = 0, which says that (17) holds. If a > 0 and b > 0, then |a|^p +|b|^p ≥ |a − b|^p. Thus, ϕ_θ,p(a, b) ≤ ϕp(a, b)≤ 0, which together with the second inequality in (16) yields (17). If a > 0 and b≤ 0, then (ab)+= 0, and hence (17) holds. Similarly, we also have

αb(ab)₊

(θsgn(b)· |b|^p−1− (1 − θ)sgn(a − b)|a − b|^p−1 [θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 1

)

ϕ_θ,p(a, b)≥ 0

for all (a, b) ̸= (0, 0). Consequently, ∇aψ_α,θ,p(a, b) · ∇bψ_α,θ,p(a, b) ≥ 0. Besides, by the proof of [15, Prop. 2.5], we know c = 0 if and only if b = 0 and a > 0; d = 0 if and only if a = 0 and b > 0. This together with (15) says ∇aψ_α,θ,p(a, b)· ∇bψ_α,θ,p(a, b) = 0 if and only if {ψθ,p(a, b) = 0 and α²ab(ab)₊² = 0} or {c = 0} or {d = 0} if and only if {ψθ,p(a, b) = 0 and ab ≤ 0} or {c = 0} or {d = 0} if and only if ψθ,p(a, b) = 0 and

α

2(max{0, ab})² = 0 if and only if ψ_α,θ,p(a, b) = 0.

(9)

(e) If ψ_α,θ,p(a, b) = 0, then ^α₂(max{0, ab})² = 0 and ψ_θ,p(a, b) = 0, which imply ab ≤ 0 and ϕ_θ,p(a, b) = 0. Hence, ∇aψ_α,θ,p(a, b) = 0 and ∇bψ_α,θ,p(a, b) = 0. Now, it remains to show that ∇aψ_α,θ,p(a, b) = 0 implying ψ_α,θ,p(a, b) = 0. Suppose that ∇aψ_α,θ,p(a, b) = 0.

Then,

αb(ab)+ =−

)

ϕθ,p(a, b). (18) We will argue that the equality (18) implies(

b = 0, a≥ 0) or(

b > 0, a = 0)

. To see this, we let

c = αb(ab)₊,

d = −

(θsgn(a)· |a|^p⁻¹+ (1− θ)sgn(a − b)|a − b|^p⁻¹ [θ(|a|^p +|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 1

)

ϕ_θ,p(a, b), e =

(θsgn(a)· |a|^p⁻¹+ (1− θ)sgn(a − b)|a − b|^p⁻¹ [θ(|a|^p +|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 1

) .

It is not hard to observe that (

e ≤ 0) and (

e = 0 implies b = 0)

which are helpful for the following discussions.

Case 1: b = 0 and a < 0. Then, c = 0 but d̸= 0 which violates (18).

Case 2: b < 0 and a ≥ 0. Then, we have e < 0, and hence c = 0 but d ̸= 0, which violates (18).

Case 3: b < 0 and a < 0. Then, we have e < 0 and ϕθ,p(a, b) > 0, which lead to c ≤ 0 but d > 0. This contradicts to (18) too.

Case 4: b > 0 and a > 0. Then, we have e < 0 and ϕ_θ,p(a, b) < 0, which imply c≥ 0 but d < 0. This contradicts to (18) too.

Case 5: b > 0 and a < 0. Similar arguments as in Case 2 cause a contradiction.

Thus, (18) implies (

b = 0, a ≥ 0) or (

b > 0, a = 0)

, and each of which always yields ψα,θ,p(a, b) = 0. By symmetry, ∇bψα,θ,p(a, b) = 0 also implies ψα,θ,p(a, b) = 0.

(f) If a→ −∞ or b → −∞, from [15, Prop. 2.4], we know |ϕθ,p(a, b)| → ∞. In addition, the fact ^α₂(max{0, ab})² ≥ 0 gives ψα,θ,p(a, b) → ∞. If ab → ∞, since α > 0, we have

α

2(max{0, ab})² → ∞. This together with ψθ,p(a, b)≥ 0 says ψα,θ,p(a, b)→ ∞. 2

By Lemma 3.2(a), we immediately have the following theorem.

Theorem 3.1 Let Ψ_α,θ,p be deﬁned as in (10). Then Ψ_α,θ,p(x)≥ 0 for all x ∈ IRⁿ and Ψ_α,θ,p(x) = 0 if and only if x solves the NCP. Moreover, if the NCP has at least one solution, then x is a global minimizer of Ψ_α,θ,p if and only if x solves the NCP.

(10)

Proof. Since ψ_θ,p is an NCP-function, from [15, Prop. 2.5], we have that x solving the NCP ⇐⇒ x ≥ 0, F (x) ≥ 0, ⟨x, F (x)⟩ = 0 ⇐⇒ x ≥ 0, F (x) ≥ 0, xiF_i(x) = 0 for all i ∈ {1, 2, · · · , n} ⇐⇒ Ψα,θ,p(x) = 0. Besides, Ψ_α,θ,p(x) is nonnegative. Thus, if x solves the NCP, then x is a global minimizer of Ψα,θ,p. Next, we claim that if the NCP has at least one solution, then x is a global minimizer of Ψ_α,θ,p =⇒ x solves the NCP.

Suppose x does not solve the NCP. From x solves the NCP ⇐⇒ Ψα,θ,p(x) = 0 and Ψ_α,θ,p(x) is nonnegative, it is clear Ψ_α,θ,p(x) > 0. However, by assumption, the NCP has a solution, say y, which makes that Ψα,θ,p(y) = 0. Then, we get a contradiction that Ψ_α,θ,p(x) > 0 = Ψ_α,θ,p(y) and x is a global minimizer of Ψ_α,θ,p. Thus, we complete the proof. 2

Theorem 3.1 indicates that the NCP can be recast as the unconstrained minimization:

xmin∈IRⁿΨ_α,θ,p(x). (19)

In general, it is hard to ﬁnd a global minimum of Ψ_α,θ,p. Therefore, it is important to know under what conditions a stationary point of Ψ_α,θ,p is a global minimum. Using Lemma 3.2(d) and the same proof techniques as in [21, Theorem 3.5], we can establish that each stationary point of Ψα,θ,p is a global minimum only if F is a P0-function.

Theorem 3.2 Let F be a P₀-function. Then x^∗ ∈ IRⁿ is a global minimum of the unconstrained optimization problem (19) if and only if x^∗ is a stationary point of Ψα,θ,p.

Theorem 3.3 The function Ψ_α,θ,p has bounded level sets L(Ψα,θ,p, c) for all c∈ IR, if F is monotone and the NCP is strictly feasible (i.e., there exists ˆx > 0 such that F (ˆx) > 0) when α > 0, or F is a uniform P -function when α≥ 0.

Proof. From [2], if F is a monotone function with a strictly feasible point, then the following condition holds: for every sequence {x^k} such that ∥x^k∥ → ∞, (−x^k)₊ <

∞, and (−F (x^k))₊ < ∞, we have max

1≤i≤n

{(x^k_i)₊F_i(x^k)₊}

→ ∞. Suppose that there exists an unbounded sequence x^k ⊆ L(Ψα,θ,p, c) for some c ∈ IR. Since Ψα,θ,p(x^k) ≤ c, there is no index i such that x^k_i → −∞ or Fi(x^k) → −∞ by Lemma 3.2(f). Hence,

1≤i≤nmax

{(x^k_i)₊F_i(x^k)₊}

→ ∞. Also, there is an index j, and at least a subsequence x^kj

such that {

(x^k_j)₊F_j(x^k)₊}

→ ∞. However, this implies that Ψα,θ,p(x^k) is unbounded by Lemma 3.2(f), contracting to the assumption on level sets. Another part of the proof is similar to the proof of [5, Prop. 3.5]. 2

(11)

In what follows, we show that the merit functions Ψ_θ,p, Ψ_NR and Ψ_α,θ,phave the same order of growth behavior on every bounded set. For this purpose, we need the following crucial technical lemma.

Lemma 3.3 Let ϕ_θ,p : IR² → IR be deﬁned as in (9). Then for any p > 1 and all θ ∈ (0, 1] we have

(2− 2¹^p)| min{a, b}| ≤ |ϕθ,p(a, b)| ≤ (2 + 2¹^p)| min{a, b}|. (20)

Proof. Without loss of generality, we assume a≥ b. We will prove the desired results by considering the following two cases: (1) a + b≤ 0 and (2) a + b > 0.

Case(1): a + b≤ 0. In this case, we need to discuss two subcases:

(i)|a|^p+|b|^p ≥ |a − b|^p. In this subcase, we have

|ϕθ,p(a, b)| ≥ |√^p

θ(|a − b|^p) + (1− θ)(|a − b|^p)− (a + b)|

= |√^p

(|a − b|^p)− (a + b)|

= |(|a − b| − (a + b)|

= |a − b − (a + b)|

= |2b|

= 2| min{a, b}|

≥ (2 − 2¹^p)| min{a, b}| (21)

On the other hand, since |a|^p+|b|^p ≥ |a − b|^p and by [4, Lemma 3.2], we have

|ϕθ,p(a, b)| ≤ |ϕp(a, b)| ≤ (2 + 2¹^p)| min{a, b}|. (22)

(ii) |a|^p +|b|^p <|a − b|^p. Since |a|^p+|b|^p <|a − b|^p and by [4, Lemma 3.2], we have

|ϕθ,p(a, b)| > |ϕp(a, b)| ≥ (2 − 2¹^p)| min{a, b}|. (23) On the other hand, by the discussion of Case(1),

|ϕθ,p(a, b)| < 2|b| ≤ (2 + 2¹^p)| min{a, b}|. (24) Case(2): a + b > 0. If ab=0, then (20) clearly holds. Thus, we proceed the arguments by discussing two subcases:

(i) ab < 0. In this subcases, we have a > 0, b < 0,|a| > |b|. By Lemma 3.1, |a|^p+|b|^p ≤

|a − b|^p. Then,

ϕ_θ,p(a, b)≥ ϕp(a, b)≥ |a| − a − b ≥ −b = | min{a, b}| ≥ (2 − 2¹^p)| min{a, b}|. (25)

(12)

On the other hand,

ϕ_θ,p(a, b)≤ |a − b| − (a + b) = −2b = 2| min{a, b}| ≤ (2 + 2¹^p)| min{a, b}|. (26)

(ii) ab > 0. In this subcases, we have a ≥ b > 0, |a|^p+|b|^p ≥ |a − b|^p. By Lemma 3.1, ϕ_θ,p(a, b) ≤ ϕp(a, b)≤ 0 . Notice that ϕθ,p(a, b)≥ |a − b| − (a + b) = −2b = −2 min{a, b}, and hence we obtain that

|ϕθ,p(a, b)| ≤ 2| min{a, b}| ≤ (2 + 2^p¹)| min{a, b}|. (27) On the other hand, since ϕ_θ,p(a, b) ≤ ϕp(a, b)≤ 0 , and by [4, Lemma 3.2], and hence we obtain that

|ϕθ,p(a, b)| ≥ |ϕp(a, b)| ≥ (2 − 2¹^p)| min{a, b}|. (28) All the aforementioned inequalities (21)-(28) imply that (20) holds. 2

Proposition 3.1 Let Ψ_θ,p, Ψ_NR and Ψ_α,θ,p be deﬁned as in (13), (12) and (10), respec- tively. Let S be an arbitrary bounded set. Then, for any p > 1, we have

(2− 2¹^p)²Ψ_NR(x)≤ Ψθ,p(x)≤ (2 + 2¹^p)²Ψ_NR(x) for all x∈ IRⁿ (29) and

(2− 2¹^p)²Ψ_NR(x)≤ Ψα,θ,p(x)≤ (αB²+ (2 + 2¹^p)²)Ψ_NR(x) for all x∈ S, (30) where B is a constant deﬁned by B = max

1≤i≤n

{ sup

x∈S{max {|xi|, |Fi(x)|}}

}

<∞.

Proof. The inequality in (29) is direct by Lemma 3.3 and the deﬁnitions of Ψ_θ,p and Ψ_NR. In addition, from Lemma 3.3 and the deﬁnition of Ψ_α,θ,p, it follows that

Ψ_α,θ,p(x)≥(

2− 2¹^p)2

Ψ_NR(x) for all x∈ IRⁿ.

It remains to prove the inequality on the right hand side of (30). From the proof of [4, Prop. 3.1], we know for each i,

(xiFi(x))+≤ B| min{xi, Fi(x)}| for all x ∈ S. (31) By Lemma 3.3 and (31), for all i = 1, . . . , n and x∈ S,

ψ_α,θ,p(x_i, F_i(x))≤ 1 2

{

αB²+ (2 + 2¹^p)² }

min{xi, F_i(x)}²

holds for any p > 1. The proof is then complete by the deﬁnitions of Ψ_α,θ,p and Ψ_NR. 2

From Proposition 3.1, we immediately obtain the following result.

(13)

Corollary 3.1 Let Ψ_θ,p and Ψ_α,θ,p be deﬁned by (13) and (10), respectively; and S be any bounded set. Then, for any p > 1 and all x∈ S, we have the following inequalities:

(2− 2¹^p)² (

αB²+ (2 + 2¹^p)²

)Ψα,θ,p(x)≤ Ψθ,p(x)≤ (2 + 2¹^p)² (2− 2¹^p)²

Ψ_α,θ,p(x)

where B is the constant deﬁned as in Proposition 3.1.

Since Ψ_θ,p, Ψ_NR and Ψ_α,θ,p have the same order on a bounded set, one will provide a global error bound for the NCP as long as the other one does. As below, we show that Ψ_α,θ,p provides a global error bound without the Lipschitz continuity of F when α > 0.

Theorem 3.4 Let Ψ_α,θ,p be deﬁned as in (10). Suppose that F is a uniform P -function with modulus µ > 0. If α > 0, then there exists a constant κ₁ > 0 such that

∥x − x^∗∥ ≤ κ1Ψ_α,θ,p(x)¹⁴ for all x∈ IRⁿ;

if α = 0 and S is any bounded set, there exists a constant κ₂ > 0 such that

∥x − x^∗∥ ≤ κ2

( max

{

Ψ_α,θ,p(x),

√

Ψ_α,θ,p(x) })¹

2

for all x∈ S;

where x^∗ = (x^∗₁,· · · , x^∗_n) is the unique solution for the NCP.

Proof. By the proof of [4, Theorem 3.4], we have µ∥x − x^∗∥² ≤ max

1≤i≤nτ_i{(xiF_i(x))₊+ (−Fi(x))₊+ (−xi)₊}, (32) where τ_i := max{1, x^∗i, F_i(x^∗)}. We next prove that for all (a, b) ∈ IR²,

(−a)+2

+ (−b)+2 ≤ [ϕθ,p(a, b)]². (33) To see this, without loss of generality, we assume a≥ b and discuss three cases:

(i) If a≥ b ≥ 0, then (33) holds obviously.

(ii) If a ≥ 0 ≥ b, then |a|^p +|b|^p ≤ |a − b|^p by Lemma 3.1, which implies ϕ_θ,p(a, b) ≥

∥(a, b)∥p− (a + b) ≥ −b ≥ 0. Hence, (−a)+

2+ (−b)+

2 = b² ≤ [ϕθ,p(a, b)]². (iii) If 0≥ a ≥ b, then (−a)+2

+ (−b)+2

= a²+ b² ≤ [ϕθ,p(a, b)]². Hence, (33) follows.

(14)

Suppose that α > 0. Using the inequality (33), we then obtain that

[(ab)++ (−a)++ (−b)+]² = (ab)²₊+ (−b)²++ (−a)²++ 2(ab)+(−a)+

+2(−a)+(−b)++ 2(ab)₊(−b)+

≤ (ab)²₊+ (−b)²₊+ (−a)²₊+ (ab)²₊+ (−a)²₊ +(−a)²++ (−b)²++ (ab)²₊+ (−b)²+

≤ 3[

(ab)²₊+ [ϕ_θ,p(a, b)]²]

≤ τ [α

2(ab)²₊+1

2[ϕ_θ,p(a, b)]² ]

= τ ψ_α,θ,p(a, b), (34)

where τ := max {6

α, 6 }

> 0. Combining (34) with (32) and letting ˆτ = max

1≤i≤nτ_i, we get µ∥x − x^∗∥² ≤ max

1≤i≤nτ_i{τψα,θ,p(x_i, F_i(x))}^1/2

≤ ˆττ^1/2max

1≤i≤nψ_α,θ,p(x_i, F_i(x))^1/2

≤ ˆττ^1/2 { _n

∑

i=1

{ψα,θ,p(xi, Fi(x)) }1/2

= τ τˆ ^1/2Ψ_α,θ,p(x)^1/2.

From this, the ﬁrst desired result follows immediately by setting κ₁ :=[ ˆ

τ τ^1/2/µ]1/2

. Suppose that α = 0. From the proof of Proposition 3.1, the inequality (31) holds.

Combining with equations (32)–(33), it then follows that for all x∈ S, µ∥x − x^∗∥² ≤ max

1≤i≤nτ_i[

B| min{xi, F_i(x)}| + 2(ψθ,p(x_i, F_i(x)))^1/2]

≤ ˆτ max

1≤i≤n

[√2 ˆB(ψ_θ,p(x_i, F_i(x)))^1/2+ 2(ψ_θ,p(x_i, F_i(x)))^1/2 ]

≤ (√

2 ˆB + 2)ˆτ (Ψ_θ,p(x))^1/2

= (√

2 ˆB + 2)ˆτ (Ψα,θ,p(x))^1/2

≤ (√

2 ˆB + 2)ˆτ (max {

Ψα,θ,p(x),

√

Ψα,θ,p(x) }

) where ˆB = B/(2− 2¹^p), ˆτ = max

1≤i≤nτ_iand the second inequality is from Lemma 3.3. Letting κ2 :=

[ (√

2 ˆB + 2)ˆτ /µ ]1/2

, we obtain the desired result from the above inequality. 2

The following lemma is needed for the proof of Proposition 3.2, which we suspect is useful in analysis of convergence rate.

(15)

Lemma 3.4 For all (a, b)̸= (0, 0) and p > 1, we have the following inequality:

( θ[sgn(a)· |a|^p⁻¹+ sgn(b)· |b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 2 )2

≥(

2− 2^p¹)2

∀θ ∈ (0, 1].

Proof. If a = 0 or b = 0, the inequality holds obviously. Then we complete the proof by considering three cases: (i) a > 0 and b > 0, (ii) a < 0 and b < 0, and (iii) ab < 0.

Case (i): Since θ ∈ (0, 1] and p > 1, it follows that θ^1/p ≤ 1. Now, by the proof of [4, Lemma 3.3], we have

θ[sgn(a)· |a|^p⁻¹+ sgn(b)· |b|^p⁻¹] [θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p

= θ[|a|^p⁻¹+|b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p

≤ θ[|a|^p⁻¹+|b|^p⁻¹] [θ(|a|^p+|b|^p)]^(p^−1)/p

= θ^1/p[|a|^p⁻¹+|b|^p⁻¹] [(|a|^p+|b|^p)]^(p^−1)/p

≤ 2^1/p for p > 1.

Therefore, 2− θ[|a|^p⁻¹+|b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p ≥ 2 − 2¹^p for p > 1. Squaring both sides then leads to the desired inequality.

Case (ii): By similar arguments as in case (i), we obtain 2− 2¹^p ≤ 2 − θ[|a|^p⁻¹+|b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p

≤ 2 + θ[|a|^p⁻¹+|b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p for p > 1, from which the result follows immediately.

Case (iii): Again, we suppose |a| ≥ |b| and therefore have 2¹^p ≥ θ[|a|^p⁻¹+|b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p

≥ θ[|a|^p⁻¹− |b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p for p > 1.

Thus, 2−2¹^p ≤ 2− θ[|a|^p⁻¹− |b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p for p > 1 and the desired result is also satisﬁed. 2

(16)

Proposition 3.2 Let ψ_α,θ,p be given as in (11). Then, for all x∈ IRⁿ and p > 1, ∇aψ_α,θ,p(x, F (x)) +∇bψ_α,θ,p(x, F (x)) ² ≥ 2(

2− 2¹^p)2

Ψ_θ,p(x) ∀θ ∈ (0, 1].

In particular, for all x belonging to any bounded set S and p > 1, ∇aψ_α,θ,p(x, F (x)) +∇bψ_α,θ,p(x, F (x)) ² ≥ 2(2− 2¹^p)⁴

(

αB²+ (2 + 2¹^p)²

)Ψα,θ,p(x) ∀θ ∈ (0, 1],

where B is deﬁned as in Proposition 3.1 and

∇aψ_α,θ,p(x, F (x)) :=

(

∇aψ_α,θ,p(x₁, F₁(x)), · · · , ∇aψ_α,θ,p(x_n, F_n(x)) )T

,

∇bψ_α,θ,p(x, F (x)) :=

(

∇bψ_α,θ,p(x₁, F₁(x)), · · · , ∇bψ_α,θ,p(x_n, F_n(x)) )T

. (35)

Proof. The second part of the conclusions is direct by Corollary 3.1 and the first part. Thus, it remains to show the first part. From the definitions of ∇aψ_α,θ,p(x, F (x)),

∇bψ_α,θ,p(x, F (x)) and Ψ_θ,p(x), showing the ﬁrst part is equivalent to proving that the following inequality

(∇aψα,θ,p(a, b) +∇bψα,θ,p(a, b))² ≥ 2(

2− 2¹^p)2

ψθ,p(a, b) (36) holds for all (a, b)∈ IR². When (a, b) = (0, 0), the inequality (36) clearly holds. Suppose (a, b)̸= (0, 0). Then, it follows from equation (14) that

(∇aψ_α,θ,p(a, b) +∇bψ_α,θ,p(a, b))²

= {

α(a + b)(ab)++ (ϕθ,p(a, b))

( θ[sgn(a)· |a|^p⁻¹+ sgn(b)· |b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 2 )}2

= α²(a + b)²(ab)²₊+ (ϕ_θ,p(a, b))²

( θ[sgn(a)· |a|^p⁻¹+ sgn(b)· |b|^p⁻¹]

[θ(|a|^p +|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 2 )2

+2α(a + b)(ab)₊(ϕ_θ,p(a, b))

( θ[sgn(a)· |a|^p⁻¹+ sgn(b)· |b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 2 )

. (37) Now, we claim that for all (a, b)̸= (0, 0) ∈ IR²,

2α(a + b)(ab)₊(ϕ_θ,p(a, b))

( θ[sgn(a)· |a|^p⁻¹+ sgn(b)· |b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 2 )

≥ 0. (38) If ab≤ 0, then (ab)+ = 0 and the inequality (38) is clear. If a, b > 0, then by the proof of Lemma 3.4, we have

( θ[sgn(a)· |a|^p⁻¹+ sgn(b)· |b|^p⁻¹]

[θ(|a|^p+|b|^p) + (1− θ)|a − b|^p)]^(p^−1)/p − 2 )

≤ 0, ∀(a, b) ̸= (0, 0) ∈ IR² (39)

(17)

and ϕ_θ,p(a, b)≤ 0, which imply the inequality (38) also holds. If a, b < 0, then ϕθ,p(a, b)≥ 0, which together with (39) yields the inequality (38). Thus, we obtain that the inequality (38) holds for all (a, b)̸= (0, 0). Now using Lemma 3.4 and equations (37)–(38), we readily obtain the inequality (36) holds for all (a, b)̸= (0, 0). The proof is thus complete. 2

4 Algorithm and Numerical Experiments

In this section, we investigate a derivative free algorithm for complementarity problems based on the new family of NCP-function and its related merit function. In addition, we prove the global convergence of the algorithm.

Algorithm 4.1 (A Derivative Free Algorithm)

Step 0 Given real numbers α > 0, p > 1, θ ∈ (0, 1] and x⁰ ∈ IRⁿ. Choose σ∈ (0, 1) and ρ, γ ∈ (0, 1). Set k := 0.

Step 1 If Ψ_α,θ,p(x^k) = 0, stop, otherwise go to Step 2.

Step 2 Find the smallest nonnegative integer m_k such that

Ψ_α,θ,p(x^k+ ρ^m^kd_k(γ^m^k))≤ (1 − σρ^2m^k)Ψ_α,θ,p(x^k), (40) where d_k(γ^m^k) :=−∂Ψ_α,θ,p(x^k, F (x^k))

∂b − γ^m^k∂Ψ_α,θ,p(x^k, F (x^k))

∂a .

Step 3 Set x^k+1 := x^k+ ρ^m^kdk(γ^m^k), k := k + 1 and go to Step 1.

Proposition 4.1 Let x^k∈ IRⁿand F be a monotone function. Then the search direction deﬁned in Algorithm 4.1 satisﬁes the descent condition ∇Ψα,θ,p(x^k)^Td_k < 0 as long as x^k is not a solution of the NCP. Moreover, if F is strongly monotone with modulus µ > 0, then ∇Ψα,θ,p(x^k)^Td_k<−µ∥dk∥².

Proof. The proof is similar to the one given in [5, Lemma 4.1]. 2

Proposition 4.2 Suppose that F is strongly monotone. Then the sequence {x^k} gener- ated by Algorithm 4.1 has at least one accumulation point and any accumulation point is a solution of the NCP.