4 Merit functions for circular cone complementarity problem

(1)

to appear in Computational Optimization and Applications, 2015

Constructions of complementarity functions and merit functions for circular cone complementarity problem

Xin-He Miao¹

Department of Mathematics School of Science Tianjin University Tianjin 300072, P.R. China E-mail: xinhemiao@tju.edu.cn

Shengjuan Guo Department of Mathematics

School of Science Tianjin University Tianjin 300072, P.R. China E-mail: gshengjuan@163.com

Nuo Qi

Department of Mathematics School of Science Tianjin University Tianjin 300072, P.R. China

E-mail: qinuo@163.com

Jein-Shan Chen ² Department of Mathematics National Taiwan Normal University

Taipei 11677, Taiwan E-mail: jschen@math.ntnu.edu.tw

March 30, 2015

1The author’s work is supported by National Young Natural Science Foundation (No. 11101302 and No. 61002027) and National Natural Science Foundation of China (No. 11471241).

2Corresponding author. The author’s work is supported by Ministry of Science and Technology, Taiwan.

(2)

(revised on August 7, 2015)

Abstract In this paper, we consider complementarity problem associated with circular cone, which is a type of nonsymmetric cone complementarity problem. The main purpose of this paper is to show the readers how to construct complementarity functions for such nonsymmetric cone complementarity problem, and propose a few merit functions for solving such a complementarity problem. In addition, we study the conditions under which the level sets of the corresponding merit functions are bounded, and we also show that these merit functions provide an error bound for the circular cone complementarity problem. These results ensure that the sequence generated by descent methods has at least one accumulation point, and build up a theoretical basis for designing the merit function method for solving circular cone complementarity problem.

Keywords. circular cone complementarity problem, complementarity function, merit function, the level sets, strong coerciveness.

1 Motivation and Introduction

The general conic complementarity problem is to find an element x ∈ IRⁿ such that x ∈ K, F (x) ∈ K^∗ and hx, F (x)i = 0, (1) where h·, ·i denotes the Euclidean inner product, F : IRⁿ→ IRⁿ is a continuously differentiable mapping, K represents a closed convex cone, and K^∗ is the dual cone of K given by

K^∗ := {v ∈ IRⁿ| hv, xi ≥ 0, ∀x ∈ K}.

When K is a symmetric cone, the problem (1) is called the symmetric cone complementarity problem [12, 14, 18, 20]. In particular, when K is the so-called second-order cone which is defined as

Kⁿ:= {(x₁, x₂) ∈ IR × IRⁿ⁻¹| kx₂k ≤ x₁},

the problem (1) reduces to the second-order cone complementarity problem [1, 3, 4, 5, 10, 11]. In contrast to symmetric cone programming and symmetric cone complementarity problem, we are not familiar with their nonsymmetric counterparts. Referring the reader to [16, 19] and the bibliographies therein, we observe that there is no any unified way to handle nonsymmetric cone constraints, and the study on each item for such problems usually uses certain specific features of the nonsymmetric cones under consideration.

In this paper, we pay attention to a special nonsymmetric cone K for problem (1).

In particular, we focus on the case of K being the circular cone defined as below, which

(3)

enables the problem (1) reduce to the circular cone complementarity problem (CCCP for short). Indeed in IRⁿ, the circular cone [7, 23] is a pointed closed convex cone having hyper-spherical sections orthogonal to its axis of revolution about which the cone is invariant to rotation. Let its half-aperture angle be θ with θ ∈ (0,^π₂). Then, the circular cone denoted by L_θ can be expressed as

L_θ := x = (x1, x₂) ∈ IR × IRⁿ⁻¹| kxk cos θ ≤ x₁

(2)

= x = (x₁, x₂) ∈ IR × IRⁿ⁻¹| kx₂k ≤ x₁tan θ .

When θ = ^π₄, the circular cone is exactly the second-order cone, which means the circular cone complementarity problem is actually the second-order cone complementarity problem. Thus, the circular cone complementarity problem (CCCP) can be viewed as the generalization of the second-order cone complementarity problem. Moreover, the CCCP includes the KKT system of the circular programming problem [13] as a special case. For real world applications of optimization problems involving circular cones, please refer to [6]. Note that in [23], Zhou and Chen characterize the relation between circular cone L_θ and second-order cone as follows:

L_θ = A⁻¹Kⁿ and Kⁿ= AL_θ with A =tan θ 0

0 I

.

In other words, for any x = (x₁, x₂) ∈ IR × IRⁿ⁻¹ and y = (y₁, y₂) ∈ IR × IRⁿ⁻¹, there have

x ∈ L_θ ⇐⇒ Ax ∈ Kⁿ, y ∈ L^∗_θ ⇐⇒ A⁻¹y ∈ Kⁿ. (3) Relation (3) indicates that after scaling the circular cone complementarity problem and the second-order cone complementarity problem are equivalent. However, when dealing with the circular cone complementarity problem, this approach may not be acceptable from both theoretical and numerical viewpoints. Indeed, if the appropriate scaling is not found or checked, some scaling step can cause undesirable numerical performance due to round-off errors in computers, which has been confirmed by experiments. Moreover, it usually need to exploits its associated merit functions or complementarity functions, which plays an important role in tackling complementarity problem. To this end, we are devoted to seeking a way to construct complementarity functions and merit functions for the circular cone complementarity problem directly. Thus, we pay our attention to the circular cone complementarity problem and the structure of L_θ mainly. There is another relationship between the circular cone and the (nonsymmetric) matrix cone introduced in [8, 9], where the authors study the epigraph of six different matrix norms, such as the Frobeninus norm, the l∞ norm, l₁ norm, the spectral or the operator norm, the nuclear norm, the Ky Fan k-norm. If we regard a matrix as a high-dimensional vector, then the circular cone is equivalent to the matrix cone with Frobeninus norm, see [24] for more details.

(4)

While there have been much attention to the symmetric cone complementarity problem and the second-order cone complementarity problem, the study about nonsymmetric cone complementarity problem is very limited. The main difficulty is that the idea for constructing complementarity functions (C-functions for short) and merit functions is not clear. Hence, The main goal of this paper is showing the readers how to construct C-functions and merit functions for such complementarity problem, and studying the properties of these merit functions. To our best knowledge, the idea is new and we be- lieve that it will help in analyzing other types of nonsymmetric cone complementarity problems.

Recall that for solving the problem (1), a popular approach is to reformulate it as an unconstrained smooth minimization problem or a system of nonsmooth equations. In this category of methods, it is important to adapt a merit function. Officially, a merit function for the circular cone complementarity problem is a function h : IRⁿ→ [0, +∞), provided that

h(x) = 0 ⇐⇒ x solves the CCCP (1).

Hence, solving the problem (1) is equivalent to handling the unconstrained minimization problem

x∈IRminⁿh(x)

with the optimal value zero. For constructing the merit functions in finite dimensional vector space, please refer to [17]. Until now, for solving symmetric cone complementarity problem, a number of merit functions have been proposed. Among them, one of the most popular merit functions is the natural residual (NR) merit function ΨN R : IRⁿ → IR, which is defined as

ΨN R(x) := 1

2kφ_NR(x, F (x))k² = 1

2kx − (x − F (x))+k²,

where (·)₊ denotes the projection onto the symmetric cone K. It is well known that Ψ_{N R}(x) = 0 if and only if x is a solution to the symmetric cone complementarity problem. In this paper, we present two classes of complementarity functions and four types of merit functions for the circular cone complementarity problem. Moreover, we investigate the properties of these proposed merit functions, and study conditions under which these merit functions provide bounded level sets. Note that such properties will guarantee that the sequence generated by descent methods has at least one accumulation point, and build up a theoretical basis for designing the merit function method for solving circular cone complementarity problem.

(5)

2 Preliminaries

In this section, we briefly review some basic concepts and background materials about the circular cone and second-order cone, which will be extensively used in subsequent analysis.

As defined in (2), the circular cone L_θ is a pointed closed convex cone and has a revolution axis which is the ray generated by the canonical vector e₁ := (1, 0, · · · , 0)^T ∈ IRⁿ. Its dual cone denoted by L^∗_θ is given as

L^∗_θ := {y = (y₁, y₂) ∈ IR × IRⁿ⁻¹| kyk sin θ ≤ y₁}.

Note that the circular cone L_θ is not a self-dual cone when θ 6= ^π₄, that is, L^∗_θ 6= L_θ, whenever θ 6= 45^◦. Hence, L_θ is not a symmetric cone for θ ∈ 0,^π₂ \{^π₄}. It is also known from [23] that the dual cone of L_θ can be expressed as

L^∗_θ = {y = (y₁, y₂) ∈ IR × IRⁿ⁻¹| ky₂k ≤ y₁cot θ} = L^π

2−θ.

Now, we talk about the projection onto L_θ and L^∗_θ. To this end, we let x₊ denote the projection of x onto the circular cone L_θ, and x− be the projection of −x onto the dual cone L^∗_θ. With these notations, for any x ∈ IRⁿ, it can be verified that x = x+− x−. Moreover, due to the special structure of the circular cone L_θ, the explicit formula of projection of x ∈ IRⁿ onto L_θ is obtained in [23] as below:

x+=







x if x ∈ L_θ, 0 if x ∈ −L^∗_θ, u otherwise,

(4)

where

u =







x₁ + kx₂k tan θ 1 + tan²θ

x₁+ kx₂k tan θ 1 + tan²θ tan θ

x₂ kx₂k





. Similarly, we can obtain the expression of x− as below:

x₋=







0 if x ∈ L_θ,

−x if x ∈ −L^∗_θ, w otherwise,

(5)

where

w =







−x₁− kx₂k cot θ 1 + cot²θ

x₁− kx2k cot θ 1 + cot²θ cot θ

x2

kx₂k





.

(6)

From the expressions (4)-(5) for x₊ and x−, it is easy to verity that hx₊, x−i = 0 for any x ∈ IRⁿ.

Next, we introduce the Jordan product associated with second-order cone. As mentioned earlier, the SOC in IRⁿ (also called Lorentz cone or ice-cream cone) is defined by

Kⁿ:= {x = (x1, x2) ∈ IR × IRⁿ⁻¹| kx2k ≤ x1}.

It is well known that the dual cone of Kⁿ is itself, and the second-order cone Kⁿ belongs to a class of symmetric cones. In addition, Kⁿ is a special case of L_θ corresponding to θ = ^π₄. In fact, there is a relationship between L_θ and Kⁿ, which is described in (3). In the SOC setting, there is so-called Jordan algebra associated with SOC. More specifically, for any x = (x₁, x₂) ∈ IR × IRⁿ⁻¹ and y = (y₁, y₂) ∈ IR × IRⁿ⁻¹, in the setting of the SOC, the Jordan product of x and y is defined as

x ◦ y :=

hx, yi y₁x₂+ x₁y₂

.

The Jordan product “◦”, unlike scalar or matrix multiplication, is not associative. The identity element under Jordan product is e = (1, 0, · · · , 0)^T ∈ IRⁿ. In this paper, we write x² to mean x ◦ x. It is known that x² ∈ Kⁿ for any x ∈ IRⁿ, and if x ∈ Kⁿ, there exists a unique vector denoted by x¹² in Kⁿ such that (x¹²)² = x¹² ◦ x¹² = x. For any x ∈ IRⁿ, we denote |x| := √

x² and x^soc₊ means the orthogonal projection of x onto the second-order cone Kⁿ. Then, it follows that x^soc₊ = x + |x|

2 . For further details regarding the SOC and Jordan product, please refer to [1, 3, 5, 10].

Lemma 2.1. ([10, Proposition 2.1]) For any x, y ∈ IRⁿ, the following holds:

x ∈ Kⁿ, y ∈ Kⁿ, and hx, yi = 0 ⇐⇒ x ∈ Kⁿ, y ∈ Kⁿ, and x ◦ y = 0.

With the help of (3) and Lemma 2.1, we obtain the following theorem which explains the relationship between SOCCP and CCCP.

Theorem 2.1. Let A =tan θ 0

0 I

. For any x = (x₁, x₂) ∈ IR × IRⁿ⁻¹ and y = (y₁, y₂) ∈ IR × IRⁿ⁻¹, the following are equivalent:

(a) x ∈ Lθ, y ∈ L^∗_θ and hx, yi = 0.

(b) Ax ∈ Kⁿ, A⁻¹y ∈ Kⁿ and hAx, A⁻¹yi = 0.

(c) Ax ∈ Kⁿ, A⁻¹y ∈ Kⁿ and Ax ◦ A⁻¹y = 0.

(d) x ∈ L_θ, y ∈ L^∗_θ and Ax ◦ A⁻¹y = 0.

(7)

In each case, elements x and y satisfy the condition that either y₂ is a multiple of x₂ or x2 is a multiple of y2.

Proof. From the relation between Kⁿ and L_θ given as in (3), we know that x ∈ L_θ ⇐⇒ Ax ∈ Kⁿ and y ∈ L^∗_θ ⇐⇒ A⁻¹y ∈ Kⁿ. Moreover, under condition (a), there holds

hAx, A⁻¹yi = hA⁻¹Ax, yi = hx, yi = 0.

Hence, it follows that (a) and (b) are equivalent. The equivalence of (b) and (c) has been shown in Lemma 2.1. In addition, based on the relation between Kⁿ and L_θ again, the equivalence of (c) and (d) is obvious.

Now, under condition (a), we prove that either y₂ is a multiple of x₂ or x₂ is a multiple of y₂. To see this, note that x ∈ L_θ and y ∈ L^∗_θ which gives

kx₂k ≤ x₁tan θ and ky₂k ≤ y₁cot θ.

This together with hx, yi = 0 yields

0 = hx, yi

= x₁y₁+ hx₂, y₂i

≥ x1y1− kx2kky2k

≥ x₁y₁− x₁y₁

= 0

which implies hx₂, y₂i = kx₂kky₂k. This says that either y₂ is a multiple of x₂ or x₂ is a multiple of y₂. Thus, the proof is complete. 2

3 C-functions for CCCP

In this section, we define C-functions for CCCP and the product of elements in the setting of the circular cone. Moreover, based on the product of elements, we construct some C- functions which play an important role in solving the circular cone complementarity problems by merit function methods.

Definition 3.1. Given a mapping φ : IRⁿ × IRⁿ → IRⁿ, we call φ an C-function for CCCP if, for any (x, y) ∈ IRⁿ× IRⁿ, it satisfies

φ(x, y) = 0 ⇐⇒ x ∈ L_θ, y ∈ L^∗_θ, hx, yi = 0.

(8)

When θ = ^π₄, an C-function for CCCP reduces to an C-function for SOCCP, i.e., φ(x, y) = 0 ⇐⇒ x ∈ Kⁿ, y ∈ Kⁿ, hx, yi = 0.

Two popular and well-known C-functions for SOCCP are Fischer-Burmeister (FB) function and natural residual (NR) function:

φ_FB(x, y) = (x²+ y²)^1/2− (x + y), φ_NR(x, y) = x − (x − y)^soc₊ .

We may ask whether we can modify the above two C-functions for SOCCP to form C-functions for CCCP. The answer is affirmative. In fact, we consider

φg_FB(x, y) := (Ax)²+ (A⁻¹y)²¹₂

− (Ax + A⁻¹y), φg_NR(x, y) := Ax − [Ax − A⁻¹y]^soc₊ .

Then, these two functions are C-functions for CCCP.

Proposition 3.1. Let gφ_FB and gφ_NR be defined as above where (Ax)² equals (Ax) ◦ (Ax) under Jordan product. Then, gφ_FB and gφ_NR are both C-functions for CCCP.

Proof. In view of Theorem 2.1 and Definition 3.1, it is not hard to verify that gφ_FB(x, y) = 0 ⇐⇒ x ∈ L_θ, y ∈ L^∗_θ, hx, yi = 0,

φg_NR(x, y) = 0 ⇐⇒ x ∈ Lθ, y ∈ L^∗_θ, hx, yi = 0, which says that these two functions are C-functions for CCCP. 2

We point out that if we consider directly the FB function φ_FB(x, y) for CCCP, un- fortunately, it cannot be C-function for CCCP because x² is not well-defined associated with the circular cone L_θ for any x ∈ IRⁿ. More specifically, because x² is defined under the Jordan product in the setting of SOC, i.e.,

x² := x ◦ x =

hx, yi x₁y₂+ y₁x₂

,

it follows that x² ∈ Kⁿ, which implies x² may not belong to L_θ or L^∗_θ. Furthermore, when φ_FB(x, y) = 0, we have x + y = (x²+ y²)¹² ∈ Kⁿ, which yields that x, y ∈ Kⁿ. This says that either x /∈ L_θ or y /∈ L^∗_θ. All the above explains that the FB function φ_FB cannot be an C-function for CCCP. Nonetheless, the NR function φ_NR : IRⁿ× IRⁿ → IRⁿ given by

φ_NR(x, y) := x − (x − y)₊ (6)

is always an C-function for CCCP. Moreover, it is also an C-function for general cone complementarity problem, see [11, Proposition 1.5.8].

(9)

Are there any other types of C-functions for CCCP and how to construct an C- function for CCCP? As mentioned earlier, The FB function φ_FB cannot serve as C- functions for CCCP because “x²” is not well-defined in the setting of circular cone. This inspires us to define a special product associated with circular cone, and find other C- functions for CCCP.

For any x = (x1, x2) ∈ IR × IRⁿ⁻¹ and y = (y1, y2) ∈ IR × IRⁿ⁻¹, we define one type of product of x and y as follows:

x • y = x₁ x2

• y₁ y2

=

hx, yi

max{tan²θ, 1} x1y2+ max{cot²θ, 1} y1x2

. (7) From the above product and direct calculation, it is easy to verify that

hx • y, zi = hx, z • yi, ∀z ∈ IRⁿ with θ ∈ 0,π

4 i

(8) and

hx • y, zi = hy, x • zi, ∀z ∈ IRⁿ with θ ∈hπ 4,π

2

. (9)

Moreover, we also obtain the following inequalities which are crucial to establishing our main results.

Lemma 3.1. For any x, y ∈ IRⁿ,

(a) if θ ∈ (0,^π₄], we have hx−, x₊• (−y)−i ≤ 0;

(b) if θ ∈ [^π₄,^π₂), we have h(−y)+, x+• (−y)−i ≤ 0.

Proof. (a) When θ ∈ (0,^π₄], let x₊ := (s, u) ∈ IR × IRⁿ⁻¹, x₋ := (t, v) ∈ IR × IRⁿ⁻¹ and (−y)− := (k, w) ∈ IR × IRⁿ⁻¹. For the elements x₊, x− and (−y)−, if there exist at least one in them is zero, it is easy to obtain

hx−, x₊• (−y)−i = 0.

If all the three elements are not equal to zero, from the definition of x+, x−, and (−y)−, we have k cot θ ≥ kwk, s tan θ = kuk, t cot θ = kvk and

u = αv or v = αu with α < 0.

Without loss of generality, we consider the case u = αv with α < 0 for the following analysis. In fact, using this, we know that

hx−, x₊• (−y)−i

= stk + thu, wi + shv, wi + khu, vi cot²θ

= kukkvkk − kkukkvk cot²θ − kukhv, wi tan θ + kukhv, wi cot θ

= (1 − cot²θ)kkukkvk − (1 − cot²θ)(kukhv, wi tan θ)

= (1 − cot²θ)[kkukkvk − kukhv, wi tan θ]

≤ (1 − cot²θ)[kkukkvk − kukkvkkwk tan θ]

= (1 − cot²θ)kukkvk[k − kwk tan θ]

≤ 0.

(10)

Here the second equality is true due to αt = αkvk tan θ = −kuk tan θ. The last inequality holds due to k cot θ ≥ kwk and θ ∈ (0,^π₄]. Hence, the desired result follows.

(b) When θ ∈ [^π₄,^π₂), with the same skills, we also conclude that h(−y)+, x+• (−y)−i ≤ 0.

Then, the desired result follows. 2

Besides the inequalities in Lemma 3.1, “•” defined as in (7) plays the similar role like what “◦” does in the setting of second-order cone. This is shown as below.

Theorem 3.1. For any x = (x₁, x₂) ∈ IR × IRⁿ⁻¹ and y = (y₁, y₂) ∈ IR × IRⁿ⁻¹, the following statements are equivalent:

(a) x ∈ Lθ, y ∈ L^∗_θ and hx, yi = 0.

(b) x ∈ L_θ, y ∈ L^∗_θ and x • y = 0.

In each case, x and y satisfy the condition that either y₂ is a multiple of x₂ or x₂ is a multiple of y₂.

Proof. In view of Theorem 2.1, we know that part (a) is equivalent to x ∈ L_θ, y ∈ L^∗_θ and Ax ◦ A⁻¹y = 0.

To proceed the proof, we discuss the following two cases.

Case 1: For θ ∈ (0,^π₄], from the definition of the product of x and y, we have x • y =

hx, yi

x₁y₂+ cot²θ y₁x₂

which implies

Ax ◦ A⁻¹y =

hx, yi

x₁tan θ y₂+ cot θ y₁x₂

= 1 0

0 (tan θ)I

(x • y).

This together with Theorem 2.1 yields the conclusion.

Case 2: For θ ∈ [^π₄,^π₂), from the definition of the product of x and y again, we have x • y =

hx, yi

tan²θ x₁y₂+ y₁x₂

which says

Ax ◦ A⁻¹y = 1 0 0 (cot θ)I

(x • y).

(11)

Then, applying Theorem 2.1 again, the desired result follows. 2

Based on the product x • y of x and y. we now introduce a class of functions φ_p : IRⁿ× IRⁿ→ IRⁿ, which is called the penalized natural residual function and defined as

φ_p(x, y) = x − (x − y)₊+ p (x₊• (−y)−) , p > 0. (10) Note that when p = 0, φ_p(x, y) reduces to φ_NR(x, y). In the following, we show that the function φp is an C-function for CCCP. To achieve the conclusion, a technical lemma is needed.

Lemma 3.2. Let φ_p : IRⁿ× IRⁿ → IRⁿ be defined as in (10). Then, for any x, y ∈ IRⁿ, we have

kφp(x, y)k ≥ max {kx−k, k(−y)+k} .

Proof. First, we prove that kφ_p(x, y)k ≥ kx−k. To see this, we observe that kφ_p(x, y)k²

= hx − (x − y)₊+ p x₊• (−y)−, x − (x − y)₊+ p x₊• (−y)−i

= hx₊− x₋− (x − y)₊+ p x₊• (−y)₋, x₊− x₋− (x − y)₊+ p x₊• (−y)₋i

= kx−k²+ kx+− (x − y)++ p x+• (−y)−k²− 2 hx−, x+− (x − y)++ p x+• (−y)−i

≥ kx−k²− 2hx−, x₊i + 2 hx−, (x − y)₊i − 2 hx−, p x₊• (−y)−i

≥ kx−k²− 2p hx−, x₊• (−y)−i .

Here, the last inequality is true due to x₊, (x − y)₊∈ L_θ, x− ∈ L^∗_θ, hx₊, x−i = 0 and the relation between Lθ and L^∗_θ. When θ ∈ (0,^π₄], by Lemma 3.1(a), we have

hx−, x₊• (−y)−i ≤ 0.

When θ ∈ [^π₄,^π₂), from equation (9), we have

hx−, x+• (−y)−i = h(−y)−, x+• x−i = 0

where the second equality holds due to x₊• x−= 0. In summary, from all the above, we prove that

kφ_p(x, y)k² ≥ kx−k².

(12)

With similar arguments, we also obtain kφ_p(x, y)k²

= hx − (x − y)₊+ p x₊• (−y)−, x − (x − y)₊+ p x₊• (−y)−i

= hy − (x − y)−+ p x₊• (−y)−, y − (x − y)−+ p x₊• (−y)−i

= h(−y)−− (−y)₊− (x − y)−+ p x₊• (−y)−, (−y)−− (−y)₊− (x − y)−

+px₊ • (−y)₋i

= k(−y)+k²+ k(−y)−− (x − y)−+ p x+• (−y)−k²− 2h(−y)+, (−y)−− (x − y)−

+px₊ • (−y)−i

≥ k(−y)₊k²− 2h(−y)₊, (−y)−i + 2h(−y)₊, (x − y)−i − 2h(−y)₊, p x₊• (−y)−i

≥ k(−y)₊k²− 2p h(−y)₊, x₊• (−y)₋i

≥ k(−y)+k²,

where the second inequality holds due to due to (−y)₊ ∈ L_θ, (−y)−, (x − y)− ∈ L^∗_θ, h(−y)₊, (−y)−i = 0 and the relation between L_θ and L^∗_θ. The last inequality holds due to equation (8) and Lemma 3.1(b). Therefore, we prove that kφ_p(x, y)k ≥ k(−y)₊k.

Then, the proof is complete. 2

Remark 3.1. From the proof of Lemma 3.2, it also can be seen that kφ_NR(x, y)k ≥ max{kx−k, k(−y)₊k}.

Theorem 3.2. Let φ_p : IRⁿ× IRⁿ→ IRⁿ be defined as in (10). Then, φ_p is an C-function for CCCP, i.e., for any x, y ∈ IRⁿ,

φ_p(x, y) = 0 ⇐⇒ x ∈ L_θ, y ∈ L^∗_θ and hx, yi = 0.

Proof. “=⇒” Suppose that φ_p(x, y) = 0. If either x /∈ L_θ or y /∈ L^∗_θ, applying Lemma 3.2 yields

kφ_p(x, y)k ≥ max{kx₋k, k(−y)₊k} > 0.

This contradicts with φ_p(x, y) = 0. Hence, there must have x ∈ L_θ and y ∈ L^∗_θ. Next, we argue that hx, yi = 0. To see this, we consider the first component of φp(x, y), which is denoted by [φ_p(x, y)]₁. In other words,

[φ_p(x, y)]₁ = [x − (x − y)₊+ p x • y]₁

=







y₁+ p hx, yi if x − y ∈ L_θ, x₁+ p hx, yi if x − y ∈ −L^∗_θ, w + p hx, yi otherwise,

(13)

where

w = x1− x₁− y₁+ kx₂− y₂k tan θ

1 + tan²θ = x₁tan²θ + y₁− kx₂− y₂k tan θ 1 + tan²θ . Since x ∈ L_θ and y ∈ L^∗_θ, it follows that x₁, y₁ ≥ 0, hx, yi ≥ 0 and

x₁tan²θ + y₁− kx₂− y₂k tan θ

1 + tan²θ ≥ tan θ(x₁tan θ − kx₂k + y₁cot θ − ky₂k)

1 + tan²θ ≥ 0.

This together with φ_p(x, y) = 0 gives phx, yi = 0. Thus, we conclude that hx, yi = 0 because p > 0.

“⇐=” Suppose that x ∈ Lθ, y ∈ L^∗_θ and hx, yi = 0. Since φ_NR is always an C-function for CCCP, we have x − (x − y)₊= 0. Using Theorem 3.1 again yields x₊• (−y)− = x • y = 0, which says φ_p(x, y) = 0. 2

Remark 3.2. In fact, for any x = (x₁, x₂) ∈ IR × IRⁿ⁻¹ and y = (y₁, y₂) ∈ IR × IRⁿ⁻¹, we define another type of product of x and y as follows:

x • y = x₁ x₂

• y₁ y₂

=

hx, yi

min{tan²θ, 1} x₁y₂+ min{cot²θ, 1} y₁x₂

. With the same skills, we may obtain the same results.

Motivated by the construction of φ_p given as in (10), we consider another function φ_r : IRⁿ× IRⁿ→ IRⁿ defined by

φ_r(x, y) = x − (x − y)₊+ r (x • y)^Ω₊ r > 0, (11)

where Ω := L_θ∩ L^∗_θ = L_θ if θ ∈ (0,^π₄],

L^∗_θ if θ ∈ [^π₄,^π₂). We point out that the function φ_r defined as in (11) is not an C-function for CCCP. The reason come from that if φ_r(x, y) = 0, we have φ_NR(x, y) = x − (x − y)₊ = −r (x • y)^Ω₊. Combining with the expression of φ_p, this implies that

−r (x • y)^Ω₊+ p (x₊• (−y)−) 6= 0

due to (x • y)^Ω₊ ∈ Ω = L_θ ∩ L^∗_θ and x₊ • (−y)− ∈ K/ ⁿ ⊇ L_θ (or L^∗_θ) when θ ∈ (0,^π₄] (or θ ∈ [^π₄,^π₂)). This explains that φ_p(x, y) 6= 0, which contradicts φ_p(x, y) being an C-function for CCCP.

However, there is a merit function related to φ_r which possesses property of bounded level sets. We will explore it in next section.

(14)

4 Merit functions for circular cone complementarity problem

In this section, based on the product (7) of x and y in IRⁿ, we propose four classes of merit functions for the circular cone complementarity problem and investigate their important properties, respectively.

First, we recall that a function F : IRⁿ → IRⁿ is said to be monotone if, for any x, y ∈ IRⁿ, there holds

hx − y, F (x) − F (y)i ≥ 0;

and strictly monotone if, for any x 6= y, the above inequality holds strictly; and strongly monotone with modulus ρ > 0 if, for any x, y ∈ IRⁿ, the following inequality holds

hx − y, F (x) − F (y)i ≥ ρkx − yk².

The following technical lemma is crucial for achieving the property of bounded level sets.

Lemma 4.1. Suppose that CCCP has a strictly feasible point ¯x, i.e., ¯x ∈ int(L_θ) and F (¯x) ∈ int(L^∗_θ) and that F is a monotone function. Then, for any sequence {x^k} satisfying

x^k

→ ∞, lim sup

k→∞

x^k₋

< ∞ and lim sup

k→∞

(−F (x^k))₊ < ∞, we have

x^k, F (x^k) → ∞ and x^k₊, (−F (x^k))− → ∞.

Proof. Since F is monotone, for all x^k ∈ IRⁿ, we know x^k− ¯x, F (x^k) − F (¯x) ≥ 0, which says

x^k, F (x^k) + h¯x, F (¯x)i ≥x^k, F (¯x) + ¯x, F (x^k) . (12) Using x^k = x^k₊− x^k₋ and F (x^k) = (−F (x^k))−− (−F (x^k))₊, it follows from (12) that

x^k, F (x^k) + h¯x, F (¯x)i

≥ x^k₊, F (¯x) − x^k₋, F (¯x) + ¯x, (−F (x^k))− − ¯x, (−F (x^k))₊ . (13) We look into the first term in the right-hand side of (13).

x^k₊, F (¯x)

= x^k₊

1[f (¯x)]₁+x^k₊

2, [f (¯x)]₂

≥ x^k₊

1[f (¯x)]₁− x^k₊

2

· k[f (¯x)]₂k

≥ x^k₊

1[f (¯x)]₁−x^k₊

1tan θ k[f (¯x)]₂k

= x^k₊

1{[f (¯x)]₁− tan θ k[f (¯x)]₂k} . (14)

(15)

Note that x^k= x^k₊−x^k₋, it gives kx^k₊k ≥ kx^kk−kx^k₋k. From the assumptions on {x^k}, i.e., kx^kk → ∞, and lim sup_k→∞kx^k₋k < ∞, we see that kx^k₊k → ∞, and hence [x^k₊]1 → ∞.

Because CCCP has a strictly feasible point ¯x, we have [f (¯x)]₁− tan θk[f (¯x)]₂k > 0, which together with (14) implies that

hx^k₊, F (¯x)i → ∞ (k → ∞). (15)

On the other hand, we observe that lim sup

k→∞

hx^k₋, F (¯x)i ≤ lim sup

k→∞

kx^k₋kkF (¯x)k < ∞ lim sup

k→∞

h¯x, (−F (x^k))+i ≤ lim sup

k→∞

k¯xkk(−F (x^k))+k < ∞ and h¯x, (−F (x^k))−i ≥ 0. All of these together with (13) and (15) yield

x^k, F (x^k) → ∞, which is the first part of the desired result.

Next, we prove thatx^k₊, (−F (x^k))₋ → ∞. Suppose not, that is, limk→∞x^k₊, (−F (x^k))₋ <

∞. Then, we obtain

x^k₊, (−F (x^k))₋ kx^k₊k =

x^k₊

kx^k₊k, (−F (x^k))−

→ 0.

This means that there exists ¯x ∈ IRⁿ such that x^k₊

kx^k₊k → x¯₊ k¯x₊k and

x¯₊

k¯x₊k, (−F (¯x))−

= 0. (16)

Denote z := x¯₊

k¯x₊k and apply Theorem 3.1, there exists α ∈ IR such that [(−F (¯x))−]₂ = αz₂ or αz₂ = [(−F (¯x))−]₂.

It is obvious that z ∈ L_θ and (−F (¯x))− ∈ L^∗_θ. Hence, equation (16) implies that α < 0, which says that z₂ and [(−F (¯x))−]₂ are in opposite direction to each other. From the expression of (−F (¯x))₊and (−F (¯x))₋again, it follows that [(−F (¯x))₊]₂ and [(−F (¯x))₋]₂ are in the opposite direction, to each other. These conclude that z₂ and [(−F (¯x))₊]₂ are in the same direction, which means [¯x₊]₂ and [(−F (¯x))₊]₂ are also in the same direction.

Now, combining with the fact that ¯x₊, (−F (¯x))₊∈ L_θ, we have h¯x+, (−F (¯x))+i ≥ 0.

Similarly, by the the relation between ¯x₊ and ¯x−, we know [¯x−]₂ and [(−F (¯x))−]₂ are in the same direction. Then, combining with ¯x−, (−F (¯x))−∈ L^∗_θ, it leads to

h¯x−, (−F (¯x))−i ≥ 0.

(16)

Moreover, writing out the expression for h¯x, F (¯x)i, we see that

h¯x, F (¯x)i = h¯x₊, (−F (¯x))₋i − h¯x₊, (−F (¯x))₊i − h¯x₋, (−F (¯x))₋i + h¯x₋, (−F (¯x))₊i.

Note that the second and third terms of the right-hand side are nonpositive and the fourth is bounded from above. Hence, from the assumptions lim_k→∞x^k₊, (−F (x^k))− < ∞, we conclude that h¯x, F (¯x)i < ∞, which contradict

h¯x, F (¯x)i = lim

k→∞x^k, F (x^k) = ∞.

Thus, we prove that x^k₊, (−F (x^k))− → ∞. 2

4.1 The first class of merit functions

For any x ∈ IRⁿ, from the analysis of the section 3, we know that the function φ_p and φ_NR are complementarity function for CCCP. In this subsection, we focus on the property of bounded level sets of merit functions based on φ_NR and φ_p with the product of elements, which is a property to guarantee that the existence of accumulation points of sequence generated by some descent algorithms.

Theorem 4.1. Let φ_p be defined as in (10). Suppose that CCCP has a strictly feasible point and that F is monotone. Then, the level set

L_p(α) = {x ∈ IRⁿ| kφ_p(x, F (x))k ≤ α}

is bounded for all α ≥ 0.

Proof. We prove this result by contradiction. Suppose there exists an unbounded sequence {x^k} ⊂ L_p(α) for some α ≥ 0. If kx^k₋k → ∞ or k(−F (x^k))₊k → ∞, by Lemma 3.2, we have kφ_p(x^k, F (x^k))k → ∞, which contradicts kφ_p(x^k, F (x^k))k ≤ α. On the other hand, if

lim sup

k→∞

kx^k₋k < ∞ and lim sup

k→∞

(−F (x^k))+

< ∞,

it follows from Lemma 4.1 that x^k₊, (−F (x^k))− → ∞. From the proof of Lemma 4.1, there exists a constant κ₀ such that

φ_NR(x^k, f (x^k))

1

≥











[x^k₊]₁− κ₀ if x^k− F (x^k) ∈ −L^∗_θ,

(−F (x^k))−

1− κ₀ if x^k− F (x^k) ∈ L_θ,

[x^k₊]1tan²θ+[^{(−F (x}^k⁾⁾⁻]₁^−k[x^k+]2k tan θ−k[(−F (x^k))−]2k tan θ 1+tan²θ

−^2κ_1+tan⁰^{(1+tan θ)}2θ , if x^k− F (x^k) /∈ L_θ∪ −L^∗_θ,

(17)

which means lim infφ_NR(x^k, f (x^k))

1 > −∞. Hence, it follows that

φp(x^k, f (x^k))

1 = φ_NR(x^k, f (x^k))

1+(x^k₊• (−F (x^k))₋

1

= φ_NR(x^k, f (x^k))

1+x^k₊, (−F (x^k))−

→ ∞, where the limit comes from

x^k₊, (−F (x^k))− → ∞ and lim inf φ_NR(x^k, f (x^k))

1 > −∞.

Thus, we obtain that kφ_p(x^k, F (x^k))k → ∞ which contradicts kφ_p(x^k, F (x^k))k ≤ α.

Then, the proof is complete. 2

Note that, under the conditions of Lemma 4.1 or Theorem 4.1, we cannot guarantee the boundedness of the level set on the NR function φ_NR. For example, let F (x) = 1 −1 and x > 0, it is easy to verify that the level set x

L_NR(2) = {x ∈ IRⁿ| kφ_NR(x, F (x))k ≤ 2}

is unbounded. In fact, In order to establish the boundedness of the level set on the natural residual function φ_NR, we need the following concept.

Definition 4.1. A mapping F : IRⁿ → IRⁿ is said to be strongly coercive if

kxk→∞lim

hF (x), x − yi kx − yk = ∞.

holds for all y ∈ IRⁿ.

Theorem 4.2. Suppose that F is strongly coercive. Then, the level set L_NR(α) = {x ∈ IRⁿ| kφ_NR(x, F (x))k ≤ α}

Proof. Again, we prove this result by contradiction. Suppose there exists an unbounded sequence {x^k} ⊂ L_NR(α) for some α ≥ 0, i.e.,

x^k

→ ∞. Note that the sequence

φ_NR(x^k, F (x^k)) = x^k− (x^k− F (x^k))₊ is bounded. It follows from the unboundedness of the sequence {x^k} that the sequence {(x^k− F (x^k))+} is also unbounded. Then, for any y ∈ L_θ, there exist N ∈ N and β > 0 such that

(x^k− F (x^k))₊− y

> β, ∀k > N.

(18)

From the property of projection mapping, we have

x^k− F (x^k) − (x^k− F (x^k))₊, y − (x^k− F (x^k))₊ ≤ 0 (17) for each k > N . On the other hand,

x^k− F (x^k) − (x^k− F (x^k))₊, y − (x^k− F (x^k))₊

= x^k− (x^k− F (x^k))₊, y − (x^k− F (x^k))₊ + F (x^k), (x^k− F (x^k))₊− y

≥ −

x^k− (x^k− F (x^k))₊ ·

y − (x^k− F (x^k))₊

+F (x^k), (x^k− F (x^k))₊− y

≥

y − (x^k− F (x^k))+

hF (x^k), (x^k− F (x^k))₊− yi ky − (x^k− F (x^k))₊k − α

. Plugging in y^k := x^k− (x^k− F (x^k))₊− y, we obtain

lim

k→∞

F (x^k), (x^k− F (x^k))₊− y

ky − (x^k− F (x^k))+k = lim

k→∞

F (x^k), x^k− y^k kx^k− y^kk = ∞,

where the last equality holds due to the strong coercivity of F and [22, Theorem 2.1].

This implies that

k→∞lim x^k− F (x^k) − (x^k− F (x^k))+, y − (x^k− F (x^k))+ = ∞, which contradicts (17). Therefore, the level set

L_NR(α) = {x ∈ IRⁿ| kφ_NR(x, F (x))k ≤ α}

is bounded for all α ≥ 0. 2

4.2 The second class of merit functions

For any x ∈ IRⁿ, LT (standing for Luo-Tseng) merit function for the circular cone complementarity problem is given as follows:

f_LT(x) := ϕ(hx, F (x)i) + 1

2k(x)−k²+ k(−F (x))₊k² , (18) where ϕ : IR → IR₊ is an arbitrary smooth function satisfying

ϕ(t) = 0, ∀t ≤ 0 and ϕ⁰(t) > 0, ∀t > 0.

Notice that we have ϕ(t) ≥ 0 for all t ∈ IR from the above condition. Indeed, this class of functions has been considered for the SDCP case (positive semidefinite complementarity problem) by Tseng in [21], for the SOCCP case (second-order cone complementarity problem) by Chen in [2] and for the general SCCP case by Pan and Chen in [18], respectively. For the case of generally closed convex cone complementarity problems, the LT merit function has been studied by Lu and Huang in [15]. In view of the results in [15], it is easy to obtain the following results directly for the circular cone complementarity problem.

(19)

Proposition 4.1. Let f_LT : IRⁿ → IR be given as in (18). Then, the following results hold.

(a) For all x ∈ IRⁿ, we have f_LT(x) ≥ 0; and f_LT(x) = 0 if and only if x solves the circular cone complementarity problem.

(b) If F (·) is differentiable, then so is f_LT(·). Moreover,

∇f_LT(x) = ∇ϕ(hx, F (x)i)[F (x) + x∇F (x)] − x−− ∇F (x)(−F (x))₊ for all x ∈ IRⁿ.

Proof. See Lemma 3.1 and Theorem 3.4 in [15]. 2

Proposition 4.2. Let f_LT be given as in (18). Suppose that F : IRⁿ→ IRⁿ is a strongly monotone mapping and that the circular cone complementarity problem has a solution x^∗. Then, there exists a constant τ > 0 such that

τ kx − x^∗k² ≤ max{0, hx, F (x)i} + kx−k + k(−F (x))+k, ∀x ∈ IRⁿ. Moreover,

τ kx − x^∗k² ≤ ϕ⁻¹(f_LT(x)) + 2[f_LT(x)]¹², ∀x ∈ IRⁿ. Proof. See Theorem 3.6 in [15]. 2

In the following theorem, we present the condition which ensures the boundedness of the level sets for LT merit function f_LT to solve the circular cone complementarity problem.

Theorem 4.3. Suppose that the circular cone complementarity problem has a strictly feasible point and that F is monotone. Then, the level set

L_f_LT(α) := {x ∈ IRⁿ| f_LT(x) ≤ α}

Proof. We prove this result by contradiction. Suppose there exists an unbounded sequence {x^k} ⊆ L_f_LT(α) for some α ≥ 0. We may assert that the sequences {x^k₋} and {(−F (x^k))₊} are bounded. If not, from the expression (18) of LT merit function f_LT and the property ϕ(t) ≥ 0 for all t ∈ IR, it follows that

fLT(x^k) ≥ 1

2[kx^k₋k²+ k(−F (x^k))+k²] → ∞,

which contradicts {x^k} ⊆ L_f_LT(α), i.e., f_LT(x^k) ≤ α. Therefore, we have lim sup

k→∞

kx^k₋k < ∞ and lim sup

k→∞

k(−F (x^k))₊k < ∞.

(20)

Then, by Lemma 4.1, we get that

hx^k, F (x^k)i → ∞.

By the properties of the function ϕ again, we obtain that ϕ(hx^k, F (x^k)i) → ∞, which implies f_LT(x^k) → ∞. This contradicts {x^k} ⊆ L_f_LT(α). Hence, the level set L_f_LT(α) is bounded for all α ≥ 0. 2

4.3 The third class of merit functions

To achieve the third class of merit functions, we make a slight modification of LT merit function f_LT for the circular cone complementarity problem. More specifically, we consider the set Ω as follows:

Ω := L_θ∩ L^∗_θ = L_θ for 0 < θ ≤ ^π₄, L^∗_θ for ^π₄ < θ < ^π₂.

Indeed, Ω is also a closed convex cone. In light of this Ω, another function is considered:

fd_LT(x) := 1

2k(x • F (x))^Ω₊k²+ 1

2kx−k²+ k(−F (x))₊k² , (19) where (x • y)^Ω₊denotes the projection of x • y onto Ω. Then, together with the expressions (7) of x • y, we can verify that the function df_LT is also a type of merit function for the circular cone complementarity problem, which will be shown in following theorem.

Theorem 4.4. Let the function dfLT be given by (19). Then, for all x ∈ IRⁿ, we have fd_LT(x) = 0 ⇐⇒ x ∈ L_θ, F (x) ∈ L^∗_θ and hx, F (x)i = 0,

where L^∗_θ denotes the dual cone of L_θ, i.e., L^∗_θ = L^π

2−θ.

Proof. By the definition of the function dfLT given by (19), we have fd_LT(x) = 0 ⇔ k(x • F (x))^Ω₊k = 0, kx−k = 0 and k(−F (x))₊k = 0,

⇔ (x • F (x))^Ω₊ = 0, x−= 0 and (−F (x))₊ = 0,

⇔ x • F (x) ∈ −L_θ or x • F (x) ∈ −L^∗_θ, x ∈ L_θ, and F (x) ∈ L^∗_θ,

⇔ x ∈ Lθ, F (x) ∈ L^∗_θ and hx, F (x)i = 0,

where the last equivalence holds due to the properties of the cone −L_θ or −L^∗_θ. Thus, the proof is complete. 2

From Theorem 4.4, we know that the function df_LT is a merit function for the circular cone complementarity problem. As below, according to the type of dot product (7), we establish the differentiability of df_LT.

(21)

Theorem 4.5. Let df_LT : IRⁿ → IR be given by (19). Suppose that the type of dot product (7) is employed. If F (·) is differentiable, then so is df_LT(·). Moreover, for all x ∈ IRⁿ, we have

∇df_LT(x) = (L_y+ ∇F (x)L_x) · (x • F (x))^Ω₊− x−− ∇F (x)(−F (x))₊, where

L_x =

y1 y₂^T

max{tan²θ, 1}y₂ max{cot²θ, 1}y₁I

and

L_y =

x₁ x^T₂

max{tan²θ, 1}x₂ max{cot²θ, 1}x₁I

with I being the identity matrix.

Proof. From the proof of Lemma 3.1(b) in [15], we have

∇ 1

2k(z)^Ω₊k²

= (z)^Ω₊, ∀z ∈ IRⁿ. Then, by the chain rule again, it follows that

∇ 1

2k(x • F (x))^Ω₊k²

= ∇x(x • F (x)) · (x • F (x))^Ω₊

= [L_y+ ∇F (x)L_x] · (x • F (x))^Ω₊, where

L_x =

y₁ y₂^T

max{tan²θ, 1}y₂ max{cot²θ, 1}y₁I

and

L_y =

x₁ x^T₂

max{tan²θ, 1}x₂ max{cot²θ, 1}x₁I

with I being the identity matrix. Thus, we obtain that

∇df_LT(x) = (L_y+ ∇F (x)L_x) · (x • F (x))^Ω₊− (x)−− ∇F (x)(−F (x))₊ for all x ∈ IRⁿ. 2

In order to establish error bound property of the merit function df_LT for the circular cone complementarity problem, we need a technical lemma as below.

Lemma 4.2. Let x = (x₁, x₂) ∈ IR × IRⁿ⁻¹ and y = (y₁, y₂) ∈ IR × IRⁿ⁻¹. Then, we have hx, yi ≤ max 1 + tan²θ

√2 ,1 + cot²θ

√2

(x • y)^Ω₊ where • is defined as in (7).