A regularization semismooth Newton method based on the generalized Fischer–Burmeister function for P 0 -NCPs

(1)

www.elsevier.com/locate/cam

A regularization semismooth Newton method based on the generalized Fischer–Burmeister function for P ⁰ -NCPs

Jein-Shan Chen

^a,∗,1

, Shaohua Pan

^b

aDepartment of Mathematics, National Taiwan Normal University, Taipei 11677, Taiwan bSchool of Mathematical Sciences, South China University of Technology, Guangzhou 510641, China

Received 1 June 2007; received in revised form 29 August 2007

Abstract

We consider a regularization method for nonlinear complementarity problems with F being a P0-function which replaces the original problem with a sequence of the regularized complementarity problems. In this paper, this sequence of regularized complementarity problems are solved approximately by applying the generalized Newton method for an equivalent augmented system of equations, constructed by the generalized Fischer–Burmeister (FB) NCP-functionspwith p > 1. We test the performance of the regularization semismooth Newton method based on the family of NCP-functions through solving all test problems from MCPLIB.

Numerical experiments indicate that the method associated with a smaller p, for example p ∈ [1.1, 2], usually has better numerical performance, and the generalized FB functions_pwith p ∈ [1.1, 2) can be used as the substitutions for the FB function2.

Keywords: Nonlinear complementarity problem (NCP); Generalized Fischer–Burmeister function; P0-function; Semismooth Newton method

1. Introduction

The nonlinear complementarity problem (NCP) is to ﬁnd a point x ∈ Rⁿsuch that

x 0, F (x)0, x, F (x) = 0, (1)

where·, · is the Euclidean inner product and F = (F1, F2, . . . , F_n)^Tis a map fromRⁿtoRⁿ. We assume that F is continuously differentiable throughout this paper. The NCP has attracted much attention due to its various applications in operations research, economics, and engineering[12,17,24]. There have been many methods proposed for solving the NCP, including merit function approaches[16,21,23,33], nonsmooth Newton methods[11,22,34], smoothing methods [5,18,27,32]and regularization methods[9,19,29,30]. All the aforementioned methods usually exploit so-called NCP- functions deﬁned as below.

Deﬁnition 1.1. A function : R² → R is called an NCP-function (or C-function standing for Complementarity function, see[10]) if it satisﬁes

(a, b) = 0 ⇐⇒ a 0, b0, ab = 0. (2)

∗Corresponding author.

E-mail addresses:[email protected](J. Chen),[email protected](S. Pan).

1Member of Mathematics Division, National Center for Theoretical Sciences, Taipei Ofﬁce.

doi:10.1016/j.cam.2007.08.020

(2)

Over the past two decades, a variety of NCP-functions has been studied; see[15,31]and references therein. Among which, a popular NCP-function intensively studied is the well-known Fischer–Burmeister (FB) NCP-function[13,14]

deﬁned as

FB(a, b) =

a²+ b²− (a + b). (3)

SinceFBsatisﬁes (2), the NCP is equivalent to a system of nonsmooth equations

FB(x) :=

⎛

⎜⎜

⎜⎝

FB(x1, F1(x))

··

FB(xn·, Fn(x))

⎞

⎟⎟

⎟⎠= 0. (4)

Then we have the merit functionFB: Rⁿ→ R₊for the NCP, deﬁned by

FB(x) := 1

2 FB(x) ²=1 2

n i=1

FB(x_i, F_i(x))². (5)

Recently, a family of new NCP-functions based on the FB function (3) were studied in[2,6]. In particular, they deﬁne

_p: R²→ R by

_p(a, b) := (a, b) p− (a + b), (6)

where p is any ﬁxed real number in the interval (1, +∞) and (a, b) pdenotes the p-norm of (a, b), namely, (a, b) p=

√p

|a|^p+ |b|^p. In other words, in the function_p, we replace the Euclidean norm of (a, b) in the FB function (3) by a more general p-norm with p ∈ (1, +∞). Similarly, the NCP is equivalent to the nonsmooth system

_p(x) :=

⎛

⎜⎜

⎜⎝

_p(x1, F1(x))

··

_p(x_n, F· _n(x))

⎞

⎟⎟

⎟⎠= 0, (7)

which induces a family of merit functions_p : Rⁿ→ R for the NCP as below

p(x) := 1

2 p(x) ²=1 2

n i=1

_p(xi, Fi(x))². (8)

As seen in[6], the merit functionpfor any given p > 1 enjoys all favorable properties as the FB merit function FB

holds. Moreover, numerical experiments there indicate that the descent method based on the merit functionp has better performance when p decreases in (1, +∞). However, it is still unknown whether such phenomenon occurs in other approaches for the NCP. The main purpose of this paper is to investigate how the generalized FB NCP-functions

_pwith p ∈ (1, +∞) behave in a regularization semismooth Newton method for solving the NCP.

It is well known that the regularization approach is designed to handle ill-posed problems which substitutes the solution of original problem with the solution of a sequence of well-posed problems whose solutions converging to the solution of the original problem; see[4,3,9,19,30]and references therein. In the context of complementarity problems, if we consider the so-called Tikhonov regularization, this scheme consists of solving a sequence of complementarity problems NCP(Fε):

x 0, F_ε(x)0, x, F_ε(x) = 0, (9)

where ε > 0 is a parameter tending to zero and Fεis given by

Fε(x) := F (x) + εx. (10)

(3)

Let Fε,i(x) denote the ith component of F_ε(x) and deﬁne the map _p,ε: Rⁿ→ Rⁿby

_p,ε(x) :=

⎛

⎜⎜

⎜⎝

_p(x1, F_ε,1(x))

··

_p(x_n, F·_ε,n(x))

⎞

⎟⎟

⎟⎠. (11)

Then the regularized problem NCP(Fε) for any given ε > 0 can be reformulated as

_p,ε(x) = 0,

which leads to a merit function_p,ε: Rⁿ→ R₊for the NCP(Fε):

_p,ε(x) := 1

2 _p,ε(x) ²=1 2

n i=1

_p(x_i, F_ε,i(x))². (12)

Therefore, the original NCP is actually equivalent to solving a sequence of nonsmooth systems of equationsp,ε(x)=0 with ε approaching to 0. From this, we see that the parameter ε plays the same role as the smoothing parameter in smoothing methods for the NCP, except that ε is imposed on the mapping F instead of the NCP-function _p.

In this paper, the sequence of subproblems_p,ε(x) = 0 with ε tending to 0 will be solved approximately by applying the generalized Newton method for an augmented system of equations equivalent to the NCP. Speciﬁcally, we let z := (ε, x) ∈ R₊× Rⁿby viewing ε as a variable, and deﬁne the mapping Hp: R₊× Rⁿ→ Rⁿ⁺¹by

Hp(z) :=

⎡

⎢⎢

⎣

_p(x1, Fε_ε,1(x)) ...

_p(xn, Fε,n(x))

⎤

⎥⎥

⎦ . (13)

Notice that if the function _p,ε(x) deﬁned by (11) is viewed as a function of ε and x, then we may denote it as

_p(z) : =_p(ε, x) = _p,ε(x). Hence, (13) is the same as

H_p(z) =

ε

_p(z)

.

It is easily veriﬁed that the NCP is equivalent to the augmented system of equations

Hp(z) = Hp(ε, x) = 0, (14)

which naturally induces a merit function Gp: Rⁿ⁺¹→ R₊given by G_p(z) =1

2 H_p(z) ²=1

2(ε²+ _p,ε(x) ²) =1

2ε²+ _p(z). (15)

The function Hp is locally Lipschitz continuous since_p is locally Lipschitz continuous (see[6]). Furthermore, as shown in Section 3, Hpis semismooth. By this, we apply the generalized Newton method developed by[26,28]for (14), and establish a regularized semismooth Newton-type algorithm which in each step solves a regularized problem NCP(Fε) approximately. Compared with the semismooth Newton method based on (7), the method has a remarkable advantage in handling the P0-NCPs (see Section 4) since the merit function_p,ε(x) has bounded level sets for such NCPs. We examine the numerical performance of the algorithm by applying it for all test problems from MCPLIB with three speciﬁc NCP-functions_1.1, 2 and5. Numerical results indicate that the method associated with a smaller p, for example p ∈ [1.1, 2], usually has better numerical performance, and the generalized FB functions _p with p ∈ [1.1, 2) can be used as the substitutions for the FB function 2.

Throughout this paper,R+andR++denote the set of nonnegative real numbers and the set of positive real numbers, respectively; Rⁿ represents the space of n-dimensional real column vectors; and ^T is the transpose notation. For any differentiable function f : Rⁿ → R, ∇f (x) denotes the gradient of f at x. For any differentiable mapping

(4)

F = (F1, . . . , F_m)^T : Rⁿ → R^m, F(x) means the Jacobian matrix of F at x while ∇F (x) = [∇F1(x) . . . ∇F_m(x)]

denotes the transpose Jacobian of F at x. If W is an n × n matrix with entries Wj k, j, k = 1, 2, . . . , n, and J and K are index sets such that J, K ⊆ {1, 2, . . . , n}, we denote by W_JKthe|J| × |K| submatrix of W consisting of entries Wj k, j ∈ J, k ∈ K. We denote by x p the p-norm of x and by x the Euclidean norm of x. In addition, unless otherwise stated, we always assume that p is any ﬁxed real number in (1, +∞) and denote S^∗by the solution set of the NCP if it is nonempty.

2. Preliminaries

In this section, we recall some background concepts and materials which will be used in the subsequent analysis.

We start with the deﬁnition of P -matrix and P⁰-matrix.

Deﬁnition 2.1. Given a matrix M ∈ R^n×n, then M is a (a) P0-matrix if each of its principal minors is nonnegative;

(b) P -matrix if each of its principal minors is positive.

Clearly, a positive semideﬁnite matrix is a P⁰-matrix, a positive deﬁnite matrix is a P -matrix, and every P -matrix is also a P⁰-matrix. For more properties about P -matrix and P⁰-matrix, please refer to[8]. The two concepts can be extended to nonlinear mappings.

Deﬁnition 2.2. Given a mapping F : Rⁿ→ Rⁿ, then F is a

(a) monotone function ifx − y, F (x) − F (y)0 for all x, y ∈ Rⁿ; (b) P0-function if max1i n

xi=yi

(x_i − y_i)(F_i(x) − F_i(y))0 for all x, y ∈ Rⁿand x = y;

(c) P -function if max¹i n(x_i− y_i)(F_i(x) − F_i(y)) > 0 for all x, y ∈ Rⁿand x = y;

(d) uniform P -function with modulus > 0 if max¹i n(x_i− y_i)(F_i(x) − F_i(y)) x − y ²for all x, y ∈ Rⁿ. From the above deﬁnitions, it is obvious that F is a P0-function if F is monotone, and the Jacobian matrix of every continuously differentiable P0-function is a P0-matrix. The following lemma states that the mapping Fεis a P -function if F is a P0-function.

Lemma 2.1 (Facchinei and Kanzow[9, Lemma 3.2]). For any ε > 0, let Fε : Rⁿ → Rⁿ be given by (10). If F is a P0-function, then the Jacobian matrices F_ε(x) for all x ∈ Rⁿ are P -matrices. In particular, the function Fε is a P -function.

Next, we review some favorable properties of_pwhere the proofs of Property 2.1 can be found in[2, Proposition 3.1]and[25, Lemmas 2.1 and 2.2]whereas the proof of Property 2.2 is given by[25, Lemma 3.1].

Property 2.1. Let_p: R²→ R be deﬁned as in (6). Then, the following results hold.

(a) _pis an NCP-function.

(b) _p is Lipschitz continuous with the Lipschitz constant L given by L =√

2+ 2^(1/p−1/2) when 1 < p < 2 and L = 1 +√

2 when p 2.

(c) _pis strongly semismooth.

(d) Given any point (a, b) ∈ R², each element in the generalized gradient j_p(a, b) has the representation ( − 1, − 1), where

 =sgn(a) · |a|^p−1

(a, b) ^p−1_p and  =sgn(b) · |b|^p−1

(a, b) ^p−1_p if (a, b) = (0, 0)

and otherwise (, ) ∈ R²denotes an arbitrary vector satisfying||^p/(p−1)+ ||^p/(p−1)1.

(5)

(e) If{(a^k, b^k)} ⊆ R²with a^k → −∞, or b^k → −∞, or a^k→ +∞ and b^k → +∞, then we have |_p(a^k, b^k)| → +∞ for k → +∞.

Property 2.2. Let_p : R²→ R be deﬁned as in (6). Then, there exists two positive constants c1> 0 and c2> 0 such that c1| min{a, b}||_p(a, b)|c2| min{a, b}|.

The semismooth property is very important from computational point of view. In particular, it plays a fundamental role in the superlinear convergence analysis of generalized Newton methods[26,28]. If the mapping G : Rⁿ→ R^mis locally Lipschitz continuous, then G is almost everywhere differentiable by Rademacher’s Theorem (see[7]). In this case, the generalized JacobianjG(x) of G at x (in the Clarke sense) can be deﬁned as the convex hull of the generalized Jacobianj_BG(x), where

j_BG(x) := {V ∈ R^m×n|∃{x^k} ⊆ D_G: {x^k} → x and G(x^k) → V }

with DGdenoting the set of differentiable points of G. Assume that G : Rⁿ→ R^mis locally Lipschitz continuous. G is called semismooth at x if G is directionally differentiable at x and for any V ∈ jG(x + h) and h → 0,

G(x + h) − G(x) − V h = o( h );

G is called strongly semismooth at x if G is semismooth at x and for any V ∈ jG(x + h) and h → 0,

G(x + h) − G(x) − V h = O( h ²); (16)

G is called a (strongly) semismooth function if it is (strongly) semismooth everywhere.

3. Properties ofH_p(z) and G_p(z)

In this section, we will study the semismoothness of the mapping Hpand characterize its generalized Jacobian matrix at any point z. In particular, we also give a sufﬁcient condition for the nonsingularity of all generalized Jacobians at a solution of (14). Then, we investigate some favorable properties of the merit function Gp(z) which are crucial to the convergence analysis of the regularized semismooth Newton algorithm described as in the next section.

Proposition 3.1. The mapping Hp : R₊× Rⁿ → Rⁿ deﬁned as in (13) is semismooth. Moreover, it is strongly semismooth if Fis locally Lipschitz continuous.

Proof. Since a function is (strongly) semismooth if and only if its component functions are (strongly) semismooth, to prove that Hpis (strongly) semismooth we only need to prove that Hp,i, i = 1, 2, . . . , n + 1 are (strongly) semismooth.

Apparently, Hp,1is strongly semismooth by formula (16) since Hp,1(z) = ε. For H_p,i, i = 2, 3, . . . , n + 1, since _p is strongly semismooth by Property 2.1 (c) and the composite of two (strongly) semismooth functions is (strongly) semismooth by[14, Theorem 19], we conclude that Hp,i, i = 2, 3, . . . , n + 1 are semismooth. If Fis locally Lipschitz continuous, then Fεis strongly semismooth, and consequently, Hp,i, i = 2, 3, . . . , n + 1 are strongly semismooth.

We next give the estimation of the generalized Jacobian of Hpby Property 2.1 (d).

Proposition 3.2. For any z = (ε, x) ∈ R+× Rⁿ, we have (jH_p(z))^T⊆

1 x^TB(z)

0 (A(z) − I ) + (∇F (x) + εI )(B(z) − I )

, (17)

where A(z) and B(z) are possibly multi-valued n × n diagonal matrices with ith diagonal elements Aii(z) and B_ii(z) given by

Aii(z) = sgn(xi) · |x_i|^p−1

(x_i, F_ε,i(x)) ^p−1_p , Bii(z) =sgn(Fε,i(x)) · |F_ε,i(x)|^p−1 (x_i, F_ε,i(x)) ^p−1_p

(6)

if (xi, F_ε,i(x)) = (0, 0); and otherwise given by

A_ii(z) = _i, B_ii(z) = _i for any (i, _i) such that |_i|^p/(p−1)+ |_i|^p/(p−1)1.

Proof. By the known rules on the evaluation of the generalized Jacobian (see[7, Proposition 2.6.2(e)]), we have jHp(z)^T⊆ jHp,1(z) × jHp,2(z) × · · · × jHp,n+1(z),

where the right-hand side denotes a set of matrices whose ith column belongs to jHp,i(z), and Hp,iis the ith component function of Hp. Clearly,

jH_p,1(z) =

1 0

∈ Rⁿ⁺¹.

For j = 2, 3, . . . , n + 1, letting i = j − 1 and applying Property 2.1 (d) yield jH_p,j(z) =

sgn(xi) · |xi|^p−1 (x_i, F_ε,i(x)) ^p−1_p − 1

0 e_i

+

xi

∇Fi(x) + εei

sgn(Fε,i(x)) · |F_ε,i(x)|^p−1 (xi, Fε,i(x)) ^p−1_p − 1

if (xi, F_ε,i(x)) = (0, 0); and otherwise jH_p,j(z) = (_i − 1)

0 e_i

+

x_i

∇F_i(x) + εe_i

(_i− 1)

with|_i|^p/(p−1)+|_i|^p/(p−1)1, where e_idenotes the vector whose ith element is zero and other elements are 1. From these equalities, the conclusion easily follows.

Now, exploiting the estimation ofjH_p(z) given by (17), we may present a sufﬁcient condition to guarantee the nonsingularity of all generalized Jacobians of Hpat a solution z^∗of (14). This result is important for the superlinear (or quadratic) convergence of the semismooth Newton method (see[11]). Let z^∗= (ε^∗, x^∗) ∈ R₊× Rⁿbe a solution of (14). Clearly, ε^∗= 0 and x^∗is a solution of the NCP. For the sake of notation, let

I := {i ∈ {1, 2, . . . , n} | x_i^∗> 0, F_i(x^∗) = 0}, J := {i ∈ {1, 2, . . . , n} | x_i^∗= 0, F_i(x^∗) = 0}, K := {i ∈ {1, 2, . . . , n} | x_i^∗= 0, F_i(x^∗) > 0}.

By rearrangement we assume that∇F (x^∗) can be written as

∇F (x^∗) =

 ∇F_II(x^∗) ∇F_IJ(x^∗) ∇F_IK(x^∗)

∇F_JI(x^∗) ∇F_JJ(x^∗) ∇F_JK(x^∗)

∇F_KI(x^∗) ∇F_KJ(x^∗) ∇F_KK(x^∗)

. (18)

The NCP is called R-regular at x^∗ if ∇F_II(x^∗) is nonsingular and its Schur-complement in the matrix

∇FII(x^∗)

∇FJI(x^∗)

∇F_IJ(x^∗)

∇FJJ(x^∗)

is a P -matrix.

Proposition 3.3. Suppose that z^∗= (ε^∗, x^∗) ∈ R₊× Rⁿbe a solution of (14) and the NCP is R-regular at x^∗, then all V ∈ jHp(z^∗) are nonsingular.

Proof. From Proposition 3.2, it is easy to see that for any V ∈ jHp(z^∗)^T, there exists a vector u(z^∗) ∈ Rⁿand a matrix W (z^∗) ∈ R^n×nsuch that

V =

1 u(z^∗)^T 0 W (z^∗)

,

(7)

where

W (z^∗) = (A(z^∗) − I ) + (∇F (x^∗) + ε^∗I )(B(z^∗) − I )

with A(z^∗) and B(z^∗) characterized as in Proposition 3.2. Therefore, proving that V is nonsingular is equivalent to arguing that W (z^∗) is nonsingular. Using the expression of ∇F (x^∗) in (18) and noting that ε^∗= 0, we can rewrite W (z^∗) in the partitioned form

W (z^∗) =

 −∇F_II ∇F_IJ(B_JJ− I_JJ) 0_IK

−∇F_JI ∇F_IJ(B_JJ− I_JJ) + (A_JJ− I_JJ) 0_JK

−∇F_KI ∇F_KJ(B_JJ− I_JJ) −I_KK

,

where for convenience we dispense with the notations z^∗ and x^∗. The rest of the proof is identical to that of[11, Proposition 3.2].

In what follows, we concentrate on the properties of Gp. First, applying[6, Propositon 3.2 (c)]and Theorem 2.6.6 of[7], we immediately obtain the following conclusion.

Proposition 3.4. For any ε 0, the function p,ε deﬁned by (12) is continuously differentiable everywhere, and consequently, Gpdeﬁned as in (15) is continuously differentiable everywhere and∇Gp(z) = V^THp(z) for any V ∈ jHp(z).

Proposition 3.5. Suppose that F is a P0-function and ˆε, ˜ε are two given positive numbers such that ˆε < ˜ε. Then, the merit function Gpdeﬁned as in (15) has the property:

k→+∞lim G_p(z^k) = +∞

for any sequence{z^k= (ε^k, x^k)} such that ε^k∈ [ˆε, ˜ε] and x^k → +∞.

Proof. We prove this by contradiction which is a standard and common technique. Suppose lim_k→+∞Gp(z^k) = +∞.

Then from (15) and (12) it follows that there exists an unbounded sequence{x^k} such that {_p,ε^k(x^k)} is bounded. Let J := {i ∈ {1, 2, . . . , n}|{x_i^k} is unbounded}.

Since{x^k} is unbounded, we have J = ∅. Without loss of generality, we assume that {|x_j^k|} → ∞ for any j ∈ J . Now, we deﬁne a bounded sequence by

y_i^k :=

0 if i ∈ J, x_i^k if i /∈ J.

From the deﬁnition of{y^k} and F being a P0-function, we have 0 max

1i n x^k_i=y^k_i

(x_i^k− y^k_i)(F_i(x^k) − F_i(y^k))

= max

i∈J x_i^k· (F_i(x^k) − F_i(y^k))

= x_j^k₀· (F_j0(x^k) − F_j0(y^k)), (19)

where j0is one of the indices for which the max is attained. Since j0∈ J , we have that {|x_j^k₀|} → +∞ as k → +∞. If x_j^k₀ → −∞ as k → +∞, using Property 2.1(e) immediately yields that _p(x_j^k₀, F_εk,j0(x^k)) → +∞. If x_j^k₀ → +∞ as k → +∞, noting that F_j0(y^k) is bounded by the continuity of F_j0, we have from (19) that Fj0(x^k) does not tend to −∞, which in turn implies that{F_j0(x^k)+ε^kx_j^k₀} → +∞. From Property 2.1(e) where {x^k_j₀} → +∞ and {F_j0(x^k)+ε^kx_j^k₀} → +∞,we also obtain that _p(x_j^k₀, F_ε^k_,j₀(x^k)) → +∞. Thus, both cases yield _p(x_j^k₀, F_ε^k_,j₀(x^k)) → +∞ which is a contradiction to the boundedness of{_p,ε^k(x^k)}. Consequently, we prove that limk→+∞Gp(z^k) = +∞.

(8)

Remark 3.1. Proposition 3.5 implies that_p,εhas bounded level sets under the assumption of F being a P⁰-function.

However, from[6, Proposition 3.5], we know that a stronger condition (i.e., F being a uniform P -function) is needed to guarantee the level sets ofpto be bounded.

To close this section, we present two results which will be used to analyze the global convergence of the algorithm in the next section. The ﬁrst result is extracted from Theorem 5.4 of[9], while the second result can be obtained by using Property 2.2 and following the same arguments as in[30, Proposition 2.2].

Proposition 3.6. Suppose that F is a P⁰-function and the solution set S^∗of the NCP is nonempty and bounded. Suppose that {ε^k} and {x^k} are two inﬁnite sequences such that for each k 0, ε^k> 0, ^k0 satisfying limk→+∞ε^k = 0, lim_k→+∞^k = 0. For each k 0, let x^k ∈ Rⁿ satisfy p(ε^k, x^k) ^k. Then {x^k} remains bounded and every accumulation point of{x^k} is a solution of the NCP.

Proposition 3.7. Suppose that F is a monotone function and the solution set S^∗of the NCP is nonempty. Suppose that {ε^k} and{x^k} are two inﬁnite sequences such that for each k 0, ε^k> 0, ^k0, ^kCε^kand lim_k→+∞ε^k= 0, where C > 0 is a constant. For each k 0, let x^k ∈ Rⁿsatisfy p(ε^k, x^k) ^k. Suppose that x^∗= arg min_x∈S∗ x and F is Lipschitz continuous. Then{x^k} remains bounded and every accumulation point of {x^k} is a solution of the NCP.

4. Regularization semismooth Newton method

From the discussions of last section, we see that Hp(z) and Gp(z) for all p > 1 enjoy the same desirable properties.

Sun[30]used H2(z) and G2(z) to develop a regularization semismooth Newton method for the NCP. In this section, we will develop a regularization semismooth Newton algorithm by any Hp(z) and G_p(z) with p > 1. This algorithm is guaranteed to solve P0-complementarity problems due to Proposition 3.5.

Now we are ready to describe this speciﬁc algorithm. We adopt almost the same notations used in[30]. Choose

¯ε ∈ (0, +∞) and ∈ (0, 1) such that ¯ε < 1. Let t ∈ [1/2, 1] and ¯z := (¯ε, 0) ∈ R₊₊×Rⁿ. Deﬁne : R₊×Rⁿ→ R₊ by

(z) := min{1, Gp(z)^t}. (20)

We also denote

:= {z = (ε, x) ∈ R₊× Rⁿ|ε(z)¯ε}. (21)

Note that(z) for any z ∈ R₊× Rⁿby (20). Hence, (¯ε, x) ∈ for any x ∈ Rⁿ. In addition, by the deﬁnition of (z), it is easily shown the following relation holds.

Proposition 4.1. Let Hpand be deﬁned as in (13) and (20), respectively. Then, H_p(z) = 0 ⇐⇒ (z) = 0 ⇐⇒ H_p(z) = (z)¯z.

Algorithm 4.1 (The Regularization Newton Algorithm).

(Step 0) Given any p > 1 and choose constants ∈ (0, 1), t ∈ [1/2, 1] and ∈ (1, 1/2). Let ε⁰:= ¯ε and x⁰∈ Rⁿbe an arbitrary point. Set k := 0.

(Step 1) If Hp(z^k) = 0, then stop. Otherwise, let _k:= (z^k) = min{1, Gp(z^k)^t}.

(Step 2) Choose Vk∈ jH_p(z^k) and compute z^k= (ε^k, x^k) ∈ R × Rⁿby

H_p(z^k) + V_kz^k= _k¯z. (22)

(Step 2) Let lkbe the smallest nonnegative integer l such that

G_p(z^k+ ^lz^k)[1 − 2 (1 − ¯ε)^l]G_p(z^k). (23)

(9)

(Step 2) Set z^k+1 := z^k+ ^l^kz^k. (Step 4) Set k := k + 1 and go to Step 1.

From Proposition 3.2, we know that for any V ∈ jHp(z) with z=(ε, x) ∈ R++×Rⁿ, there exists a W =(u(z) W (z)) ∈ jp(z) with u(z) ∈ Rⁿand W (z) ∈ R^n×nsuch that

V =

1 0

u(z) W (z)

. (24)

Suppose that F is a P0-function. Then by Lemma 2.1 F_ε(x) is a P -matrix. Hence, for any x ∈ Rⁿand ε > 0, W (z) is nonsingular by the proof of Proposition 2 of[20]. It thus follows that all V ∈ jHp(z) with z = (ε, x) ∈ R₊₊× Rⁿ are nonsingular. Therefore, the Newton step in (22) is well-deﬁned, and moreover, from (22), for any k 0 and ε^k> 0, there exists a Wk∈ j_p(z^k) such that

(∇p(z^k))^Tz^k= p(z^k)^TWkz^k= −p(z^k)^Tp(z^k) = −2p(z^k). (25) Using the equality and Proposition 4.1, we next show that Algorithm 4.1 is well-deﬁned.

Proposition 4.2. Suppose that F is a P0-function and z^k= (ε^k, x^k) ∈ R₊₊× Rⁿfor k 0. Then z^k+1 ∈ R₊₊× Rⁿ and Algorithm 4.1 is well-deﬁned.

Proof. Since ε^k _k= (z^k) > 0. From the ﬁrst component in the relation (22) in Algorithm 4.1, we have

ε^k+ ε^k= _k¯ε ⇒ ε^k= −ε^k+ _k¯ε. (26)

Then, for any ∈ [0, 1], there has

ε^k+ ε^k= (1 − )ε^k+ _k¯ε > 0. (27)

Thus, combining the fact that(z) Gp(z)^1/2with (22) and (27) yields that (ε^k+ ε^k)²= [(1 − )ε^k+ _k¯ε]²

= (1 − )²(ε^k)²+ 2(1 − )_kε^k¯ε + ²²_k¯ε²

(1 − )²(ε^k)²+ 2_kε^k¯ε + O(²)

(1 − )²(ε^k)²+ 2 Gp(z^k)^1/2 Hp(z^k) ¯ε + O(²)

= (1 − 2)(ε^k)²+ 2√

2 ¯εGp(z^k) + O(²). (28)

Now, we deﬁne

() := _p(z^k+ z^k) − _p(z^k) − (∇_p(z^k))^Tz^k.

Since_pis continuously differentiable at any z^k ∈ R₊₊× Rⁿby Proposition 3.4, we obtain() = o(). On the other hand, from (22) and (25) it follows that

1

2 _p(z^k+ z^k) ²= _p(z^k+ z^k)

= p(z^k) + (∇p(z^k))^Tz^k+ ()

= _p(z^k) − 2_p(z^k) + o()

= (1 − 2)_p(z^k) + o() (29)

(10)

for any ∈ [0, 1]. Therefore, using Eqs. (28) and (29), we obtain G_p(z^k+ z^k) =¹₂ H_p(z^k+ z^k) ²

=¹₂(ε^k+ ε^k)²+¹₂ _p(z^k+ z^k) ²

¹₂(1 − 2)(ε^k)²+√

2 ¯εG_p(z^k) + (1 − 2)_p(z^k) + o()

(1 − 2)Gp(z^k) + 2 ¯εGp(z^k) + o()

= [1 − 2(1 − ¯ε)]G_p(z^k) + o() (30)

for any ∈ [0, 1]. The inequality (30) implies that there exists ¯ ∈ (0, 1] such that G_p(z^k+ z^k)[1 − 2 (1 − ¯ε)]G_p(z^k) ∀ ∈ [0, ¯],

which indicates that Algorithm 4.1 is well-deﬁned.

Proposition 4.3. Suppose that F is a P0-function. For each k 0, if ε^k> 0 and z^k ∈ , then for any ∈ [0, 1] such that

G_p(z^k+ z^k)[1 − 2 (1 − ¯ε)]G_p(z^k), (31)

there holds that z^k+ z^k∈ .

Proof. We prove this proposition by considering the following two cases:

Case (i): Gp(z^k) > 1. Then _k= . From z^k ∈ and (z) = min{1, G_p(z)^t} for any z ∈ R₊× Rⁿ, it follows that for any ∈ [0, 1],

(ε^k+ ε^k) − (z^k+ z^k)¯ε (1 − )ε^k+ _k¯ε − ¯ε

(1 − )_k¯ε + _k¯ε − ¯ε

= 0. (32)

Case (ii): Gp(z^k)1. Then, for any ∈ [0, 1] satisfying (31), we have

G_p(z^k+ z^k)[1 − 2 (1 − ¯ε)]G_p(z^k)1. (33)

Therefore, for any ∈ [0, 1] satisfying (31), (z^k+ z^k) = G_p(z^k+ z^k)^t.

Using the fact that z^k ∈ and the ﬁrst inequality in (33), we then obtain that for any ∈ [0, 1] satisfying (31), (ε^k+ ε^k) − (z^k+ z^k)¯ε (1 − )ε^k+ _k¯ε − G_p(z^k+ z^k)^t¯ε

(1 − )_k¯ε + _k¯ε − [1 − 2 (1 − ¯ε)]^tG_p(z^k)^t¯ε

= _k¯ε − [1 − 2 (1 − ¯ε)]^tGp(z^k)^t¯ε

= Gp(z^k)^t¯ε − [1 − 2 (1 − ¯ε)]^tGp(z^k)^t¯ε

= {1 − [1 − 2 (1 − ¯ε)]^t}G_p(z^k)^t¯ε

0. (34)

Combining (32) and (34) immediately yields the desired result.

Proposition 4.4. Suppose that F is a P0-function. Then Algorithm 4.1 generates an inﬁnite sequence{z^k} with z^k ∈ for all k and

0 < ε^k+1ε^k ¯ε for all k. (35)

(11)

Proof. Since z⁰= (¯ε, x⁰) ∈ , the ﬁrst part of the conclusions follows by repeatedly resorting to Propositions 4.2 and 4.3. We next concentrate on the proof of (35). First, ε⁰= ¯ε > 0. From the design of Algorithm 4.1 and the fact that (z) = min{1, Gp(z)^t} for any z ∈ R+× Rⁿ, it then follows that

ε¹= (1 − ^l⁰)ε⁰+ ^l⁰(z⁰)¯ε (1 − ^l⁰)¯ε + ^l⁰ ¯ε ¯ε.

Hence (35) holds for k = 0. Suppose that (35) holds for k = i − 1. We next prove that (35) holds for k = i. From the design of Algorithm 4.1, we have

εⁱ⁺¹= (1 − ^lⁱ)εⁱ+ ^lⁱ(zⁱ)¯ε.

Noting that εⁱ(zⁱ)¯ε since zⁱ ∈ , we then obtain εⁱ⁺¹(1 − ^lⁱ)εⁱ + ^lⁱεⁱ= εⁱ

and

εⁱ⁺¹(1 − ^lⁱ)(zⁱ)¯ε + ^lⁱ(zⁱ)¯ε = (zⁱ)¯ε > 0.

Therefore, (35) holds for k = i. We complete the proof.

Now, using Propositions 3.5–3.7 and Proposition 4.4 and following the same arguments as in[30], we obtain the following global convergence results of Algorithm 4.1.

Theorem 4.1. Suppose that F is a P0-function and the solution set S^∗of the NCP is nonempty and bounded. Then the inﬁnite sequence{z^k} generated by Algorithm 4.1 is bounded and any accumulation point of {z^k} is a solution of H (z) = 0.

Theorem 4.2. Suppose that F is a monotone function and in Algorithm 4.1 the parameter t =¹₂. Then if the iteration sequence{z^k} is bounded, then the solution set S^∗of the NCP is nonempty. Conversely, if the solution set S^∗of the NCP is nonempty and F is Lipschitz continuous, then the inﬁnite sequence {z^k} generated by Algorithm 4.1 is bounded and any accumulation point of{z^k} is a solution of H(z) = 0.

In addition, using Proposition 3.1 and similar proof as for[30, Theorem 5.1], we obtain the following local superlinear (quadratic) convergence results of Algorithm 4.1.

Theorem 4.3. Suppose that F is a P0-function and the solution set S^∗of the NCP is nonempty and bounded. Suppose that z^∗ := (ε^∗, x^∗) ∈ R × Rⁿis an accumulation point of the inﬁnite sequence{z^k} generated by Algorithm 4.1 and all V ∈ jHp(z^∗) are nonsingular. Then the whole sequence {z^k} converges to z^∗with

z^k+1− z^∗ = o( z^k− z^∗ ), ε^k+1= o(ε^k).

Furthermore, if Fis locally Lipschitz continuous around x^∗, then z^k+1− z^∗ = O( z^k− z^∗ ²), ε^k+1= O(ε^k)².

Moreover, from Proposition 3.3, all the conclusions of Theorem 4.3 hold if the assumption that all V ∈ jHp(z^∗) are nonsingular is replaced by that the NCP is R-regular at x^∗.

5. Numerical experiments

We implemented Algorithm 4.1 by our codes in MATLAB 6.5 for almost all test problems except the unavailable

“pvg105” and “scarfbnum” with the starting points in MCPLIB[1]. All numerical experiments were done at a PC with CPU of 2.8 GHz and RAM of 512 MB. Throughout the experiments, unless otherwise stated, we adopted the following parameters for Algorithm 4.1:

= 0.5, t = 1/2, = 10⁻⁴, = 0.5, ¯ε = 0.1.

(12)

0 20 40 60 80 100 10^2.02

10^2.03 10^2.04 10^2.05 10^2.06 10^2.07

value of p

the number of iteration

Number of iteration v.s. different p

Fig. 1. The number of iterations vs. value of p for Example “bertsekas(3)”.

We terminated the iteration if one of the following conditions was satisﬁed:

(1) H_p(z^k) 1and min{x^k, F (x^k)} 2; (2) the step length_k= ^l^k is less thanmin. (3) the number of iteration exceeds k^max.

Among others, in our implementation the termination parameters were chosen as follows:

1= 10⁻¹⁰, 2= 10⁻⁶, min= 10⁻²⁵ and kmax= 1000.

During the experiments, we incorporated some strategies to improve the numerical behavior of Algorithm 4.1 to some extent. These strategies are well-accepted and used in basically all suitable implementations of complementarity solvers.

The ﬁrst modiﬁcation is in the line search step. We replaced the standard (monotone) Armijo-rule by nonmonotone line search described in[35]to seek a suitable steplength, i.e., we computed the smallest nonnegative integer l such that

G_p(z^k+ ^ld^k)W_k− 2 (1 − ¯ε)^lG_p(z^k) for all k 0, where Wkis given by

Wk= (_k−1Qk−1Wk−1+ Gp(z^k))/Qk

with

Q_k= _k−1Q_k−1+ 1.

In our implementation, we usedW−1= Gp(z⁰), Q−1= 1, ₋₁= 0.85 and _k ≡ 0.85.

The second modification is necessary since the mapping F is often not defined outside the positive orthant whereas our algorithm assumes that F can be evaluated on the whole space Rⁿ. Hence, in order to avoid possible domain violations, we employed a simple backtracking strategy: Given an iterate z^k = (ε^k, x^k) ∈ R₊₊× Rⁿ and a search direction d^k ∈ Rⁿ⁺¹, we first compute the exponent jk := min{0, 1, 2, . . . , } such that

F (x^k+ ^j^kd^k(2 : n + 1))