second-order cone

(1)

to appear in Pacific Journal of Optimization, 2017

Applying a type of SOC-functions to solve a system of equalities and inequalities under the order induced by

second-order cone

Xin-He Miao¹

Department of Mathematics Tianjin University, China

Tianjin 300072, China

Nuo Qi ²

Department of Mathematics Tianjin University, China

Tianjin 300072, China

B. Saheya³

College of Mathematical Science Inner Mongolia Normal University Hohhot 010022, Inner Mongolia, China

Jein-Shan Chen ⁴ Department of Mathematics National Taiwan Normal University

Taipei 11677, Taiwan.

April 28, 2016 (revised on July 15, 2017)

Abstract In this paper, we introduce a special type of SOC-functions which is a vector- valued function associated with second-order cone. By using it, we construct a type of

1E-mail: xinhemiao@tju.edu.cn. The author’s work is supported by National Natural Science Foun- dation of China (No. 11471241).

2E-mail: qinuotju@126.com.

3E-mail: saheya@imnu.edu.cn. The author’s work is supported by National Natural Science Founda- tion of China (No. 11402127,11401326).

4Corresponding author. E-mail:jschen@math.ntnu.edu.tw. The author’s work is supported by Min- istry of Science and Technology, Taiwan.

(2)

smoothing functions which converges to the projection function onto second-order cone.

Then, we reformulate the system of equalities and inequalities under the order induced by second-order cone as a system of parameterized smooth equations. Accordingly, we propose a smoothing-type Newton algorithm to solve the reformulation, and show that the proposed algorithm is globally convergent and locally quadratically convergent under suitable assumptions. Preliminary numerical results demonstrate that the approach is effective. Numerical comparison based on various smoothing functions is reported as well.

Keywords. System of equalities and inequalities, second-order cone, SOC-function, smoothing algorithm, global convergence.

1 Introduction

The second-order cone (SOC for short and denoted by Kⁿ) in IRⁿ (n ≥ 1), also called the Lorentz cone, is defined as

Kⁿ =(x1, x₂) ∈ IR × IRⁿ⁻¹| kx₂k ≤ x₁ ,

where k · k denotes the Euclidean norm. By the definition of Kⁿ, if n = 1, K¹ is the set of nonnegative reals IR₊. Moreover, we know that a general second-order cone K is the Cartesian product of SOCs, i.e.,

K := Kⁿ¹ × Kⁿ² × · · · × Kⁿ^r.

Since all the analysis can be carried over to the setting of Cartesian product, we only focus on the single second-order cone Kⁿ for simplicity. It is well known that the second- order cone Kⁿ is a symmetric cone. During the past decade, optimization problems involved SOC constraints and their corresponding solutions methods have been studied extensively, see [1, 5, 8, 13, 19, 20, 24, 29, 30, 31, 34, 35] and references therein.

There is a spectral decomposition with respect to second-order cone Kⁿ in IRⁿ, which plays a very important role in the study of second-order cone optimization problems. For any vector x = (x1, x2) ∈ IR×IRⁿ⁻¹, the spectral decomposition (or spectral factorization) with respect to Kⁿ is given by

x = λ1(x)u⁽¹⁾_x + λ2(x)u⁽²⁾_x , (1) where λ₁(x), λ₂(x) and u⁽¹⁾x , u⁽²⁾x are called the spectral values and the spectral vectors of x, respectively, with their corresponding formulas as bellow:

λ_i(x) = x₁ + (−1)ⁱkx₂k, i = 1, 2, (2)

(3)

u⁽ⁱ⁾_x =











1 2

"

1 (−1)^{i x}_kx²

2k

#

, if x₂ 6= 0,

1 2

1

(−1)ⁱw

, if x2 = 0,

(3)

for i = 1, 2 with w being any vector in IRⁿ⁻¹ satisfying kwk = 1. Moreover, n

u⁽¹⁾x , u⁽²⁾x

o is called a Jordan frame satisfying the following properties:

u⁽¹⁾_x + u⁽²⁾_x = e, u⁽¹⁾_x , u⁽²⁾_x = 0, u⁽¹⁾_x ◦ u⁽²⁾_x = 0 and u⁽ⁱ⁾_x ◦ u⁽ⁱ⁾_x = u⁽ⁱ⁾_x (i = 1, 2), where e = (1, 0, · · · , 0)^T ∈ IRⁿis the unit element and Jordan product x ◦ y is defined by x ◦ y := (hx, yi, x₁y₂+ y₁x₂) ∈ IR × IRⁿ⁻¹ for any x = (x₁, x₂), y = (y₁, y₂) ∈ IR × IRⁿ⁻¹. For more details about Jordan product, please refer to [11].

In [5, 6], for any real-valued function f : IR → IR and x = (x₁, x₂) ∈ IR × IRⁿ⁻¹, based on the spectral factorization of x with respect to Kⁿ, a type of vector-valued function associated with Kⁿ (also called SOC-function) is introduced. More specifically, if we apply f to the spectral values of x in (1), then we obtain the function f^soc : IRⁿ → IRⁿ given by

f^soc(x) = f (λ₁(x))u⁽¹⁾_x + f (λ₂(x))u⁽²⁾_x . (4) From the expression (4), it is clear that the SOC-function f^soc is unambiguous whether x₂ = 0 or x₂ 6= 0. Further properties regarding f^soc were discussed in [3, 4, 5, 7, 17, 32].

It is also known that such SOC-functions f^soc associated with second-order cone play a crucial role in the theory and numerical algorithm for second-order cone programming, see [1, 5, 8, 13, 19, 20, 24, 29, 30, 31, 34, 35] again.

In this paper, in light of the definition of f^soc, we define another type of SOC-function Φ_µ(see Section 2 for details). In particular, using the SOC-function Φ_µ, we will solve the following system of equalities and inequalities under the order induced by the second- order cone:

f_I(x) K^m 0,

f_E(x) = 0, (5)

where fI(x) = (f1(x), · · · , fm(x))^T, fE(x) = (fm+1(x), · · · , fn(x))^T, and “x K^m 0”

means “−x ∈ K^m”. Likewise, x _K^m 0 means x ∈ K^m and x _K^m 0 means x ∈ int(K^m) whereas int(K^m) denotes the interior of K^m. Throughout this paper, we assume that f_i is continuously differentiable for any i ∈ {1, 2, ..., n}. We also define

f (x) :=

f_I(x) f_E(x)

and hence f is continuously differentiable. When K^m = IR^m₊, the system (5) reduces to the standard system of equalities and inequalities over IR^m. The corresponding standard

(4)

system (5) has been studied extensively due to its various applications, and there are many methods for solving such problems, see [10, 27, 28, 33, 37]. For the setting of second-order cone, we know that the KKT conditions of the second-order cone constrained optimization problems can be expressed in form of (5), i.e., the system of equalities and inequalities under the order induced by second-order cones. For example, for the following second-order cone optimization problem:

min h(x)

s.t. −g(x) ∈ K^m, the KKT conditions of this problem is as follows

∇h(x) + ∇g(x)λ = 0, λ^Tg(x) = 0,

−λ K^m 0, g(x) K^m 0,

where ∇g(x) denotes the gradient matrix of g. Now, by denoting f_I(x, λ) :=

−λ g(x)

and f_E(x, λ) := ∇h(x) + ∇g(x)λ λ^Tg(x)

,

it is clear to see that the KKT conditions of the second-order cone optimization problem is in form of (5). From this view, the investigation of the system (5) provides a theoretical way for solving second-order cone optimization problems. Hence, the study of the system (5) is important and that is the main motivation for this paper.

So far, there are many kinds of numerical methods for solving the second-order cone optimization problems. Among which, there is a class of popular numerical method, the so-called smoothing-type algorithms. This kind of algorithm has also been a powerful tool for solving many other optimization problems, including symmetric cone complementarity problems [15, 16, 20, 21, 22], symmetric cone linear programming [23, 26], the system of inequalities under the order induced by symmetric cone [18, 25, 38], and so on. From these recent studies, most of the existing smoothing-type algorithms were designed on the basis of a monotone line search. In order to achieve better computational results, the nonmonotone line search technique is sometimes adopted in the numerical implementations of smoothing-type algorithms [15, 36, 37]. The main reason is that the nonmonotone line search scheme can improve the likelihood of finding a global optimal solution and convergence speed in cases where the function involved is highly nonconvex or has a valley in a small neighborhood of some point. In view of this, in this paper we also develop a nonmonotone smoothing-type algorithm for solving the system of equalities and inequalities under the order induced by second-order cones.

(5)

The remaining parts of this paper are organized as follows. In Section 2, some back- ground concepts and preliminary results about the second-order cone are given. In Sec- tion 3, we reformulate (5) as a system of smoothing equations in which Φ_µ is employed.

In Section 4, we propose a nonmonotone smoothing-type algorithm for solving (5), and show that the algorithm is well defined. Moreover, we also discuss the global convergence and locally quadratic convergence of the proposed algorithm. The preliminary numerical results are reported to demonstrate that the proposed algorithm is effective in Section 5. Some numerical comparison in light of performance profiles is presented which indicates the difference of numerical performance when various smoothing functions are used.

2 Preliminaries

In this section, we briefly review some basic properties about the second-order cone and the vector-valued functions with respect to SOC, which will be extensively used in subse- quent analysis. More details about the second-order cone and the vector-valued functions can be found in [3, 4, 5, 13, 14, 17].

First, we review the projection of x ∈ IRⁿ onto the second-order cone Kⁿ ⊂ IRⁿ. For the second-order cone Kⁿ, let (Kⁿ)^∗ denote its dual cone. Then, (Kⁿ)^∗ is given by

(Kⁿ)^∗ :=y = (y₁, y₂) ∈ IR × IRⁿ⁻¹| hx, yi ≥ 0, ∀x ∈ Kⁿ .

Moreover, it is well known that the second-order cone Kⁿ is a self-dual cone, i.e., (Kⁿ)^∗ = Kⁿ. Let x₊denote the projection of x ∈ IRⁿonto the second-order cone Kⁿ, and x−denote the projection of −x onto the dual cone (Kⁿ)^∗. With these notations, for any x ∈ IRⁿ, it is not hard to verify that x = x₊− x−. In particular, due to the special structure of Kⁿ, the explicit formula of the projection of x ∈ IRⁿ onto Kⁿ is obtained in [14] as below:

x₊ =







x if x ∈ Kⁿ,

0 if x ∈ −(Kⁿ)^∗ = −Kⁿ, u otherwise,

(6)

where

u =







x₁ + kx₂k

x₁+ kx2₂k 2

x₂ kx₂k





.

In fact, according to the spectral decomposition of x, the expression of the projection x+

onto Kⁿ can be alternatively expressed as (see [13, Prop. 3.3(b)]) x₊ = ((λ₁(x))₊u⁽¹⁾_x + ((λ₂(x))₊u⁽²⁾_x ,

(6)

where (α)₊= max{0, α} for any α ∈ IR.

From the definition (4) of the vector-valued function associated with Kⁿ, we know that the projection x₊onto Kⁿis a vector-valued function. Moreover, it is known that the projection x₊and (α)₊ for any α ∈ IR have many the same properties, such as the conti- nuity, the directional differentiability and semismooth and so on. Indeed, these properties are established for general vector-valued functions associated with SOC. Among which, Chen, Chen and Tseng [5] have obtained that many properties of f^soc are inherited from the function f , which is presented in the following proposition.

Proposition 2.1. Suppose that x = (x₁, x₂) ∈ IR × IRⁿ⁻¹ has the spectral decomposition given as in (1)-(3). For any the function f : IR → IR and the vector-valued function f^soc defined by (4), the following hold.

(a) f^soc is continuous at x ∈ IRⁿ with spectral values λ₁(x), λ₂(x) ⇐⇒ f is continuous at λ₁(x), λ₂(x);

(b) f^soc is directionally differentiable at x ∈ IRⁿ with spectral values λ₁(x), λ₂(x) ⇐⇒ f is directionally differentiable at λ₁(x), λ₂(x);

(c) f^soc is differentiable at x ∈ IRⁿ with spectral values λ1(x), λ2(x) ⇐⇒ f is differentiable at λ₁(x), λ₂(x);

(d) f^soc is strictly continuous at x ∈ IRⁿ with spectral values λ₁(x), λ₂(x) ⇐⇒ f is strictly continuous at λ₁(x), λ₂(x);

(e) f^soc is semismooth at x ∈ IRⁿ with spectral values λ₁(x), λ₂(x) ⇐⇒ f is semismooth at λ₁(x), λ₂(x);

(f ) f^soc is continuously differentiable at x ∈ IRⁿ with spectral values λ₁(x), λ₂(x) ⇐⇒ f is continuously differentiable at λ₁(x), λ₂(x).

Note that the projection function x₊ onto Kⁿ is not a smoothing function on the whole space IRⁿ. From Proposition 2.1, we can make some smoothing functions for the projection x₊ onto Kⁿ if we smooth the functions f (λ_i(x)) for i = 1, 2. More specifically, we consider a family of smoothing functions φ(µ, ·) : IR → IR with respect to (α)+

satisfying

limµ↓0 φ(µ, α) = (α)₊ and 0 ≤ ∂φ

∂α(µ, α) ≤ 1. (7)

for all α ∈ IR. Are there functions satisfying the above conditions? Yes, there are many.

(7)

We illustrate three of them here:

φ₁(µ, α) = pα²+ 4µ²+ α

2 , (µ > 0) φ₂(µ, α) = µ ln(e^α^µ + 1), (µ > 0)

φ₃(µ, α) =







α, if α ≥ µ,

(α+µ)²

4µ , if − µ < α < µ, 0, if α ≤ −µ.

(µ > 0)

In fact, the functions φ1 and φ2 were considered in [13, 17], while the function φ3 was employed in [18, 37]. In addition, as for the function φ₃, there is a more general function φ_p(µ, ·) : IR → IR given by

φ_p(µ, α) =







α if α ≥ _p−1^µ ,

µ p−1

h(p−1)(α+µ) pµ

ip

if −µ < α < _p−1^µ ,

0 if α ≤ −µ,

where µ > 0 and p ≥ 2. This function φ_p is recently studied in [9] and it is not hard to verify that φ_p also satisfies the above conditions (7). All the functions φ₁, φ₂ and φ₃ will play the role of smoothing functions as f (λ_i(x)) in (4). In other words, based on these smoothing functions, we define a type of SOC-functions Φµ(·) on IRⁿ associated with Kⁿ(n ≥ 1) as

Φµ(x) := φ(µ, λ1(x))u⁽¹⁾_x + φ(µ, λ2(x))u⁽²⁾_x ∀x = (x1, x2) ∈ IR × IRⁿ⁻¹, (8) where λ1(x), λ2(x) are given by (2) and u⁽¹⁾x , u⁽²⁾x are given by (3). In light of the properties of φ(µ, α), we show as below that the SOC-function Φ_µ(x) becomes the smoothing function for the projection function x₊ onto Kⁿ.

We depict the graphs of φ_i(µ, α) for i = 1, 2, 3, in Figure 1. From Figure 1, we see that φ₃ is the one which best approximates the function (α)₊ under the sense that it is closest to (α)₊ among all φ_i(µ, α) for i = 1, 2, 3.

Proposition 2.2. Suppose that x = (x₁, x₂) ∈ IR × IRⁿ⁻¹ has the spectral decomposition given as in (1)-(3), and that φ(µ, ·) with µ > 0 is continuously differentiable function satisfying (7). Then, the following hold.

(a) The function Φ_µ(x) : IRⁿ → IRⁿ defined as in (8) is continuously differentiable.

Moreover, its Jacobian matrix at x is described as

∂Φ_µ(x)

∂x =







∂φ

∂λ(µ, x₁)I if x₂ = 0,

b cx₂^T/kx₂k

cx₂/kx₂k aI + (b − a)x₂x₂^T/kx₂k²

if x2 6= 0, (9)

(8)

Max[0,t]

ϕ1(μ,t) ϕ2(μ,t) ϕ3(μ,t)

-1.0 -0.5 0.0 0.5 1.0

0.0 0.2 0.4 0.6 0.8 1.0

t ϕi(μ,t)

Figure 1: Graphs of max(0, t) and all three φ_i(µ, t) with µ = 0.2.

where

a = ^φ(µ,λ_λ²^{(x))−φ(µ,λ}¹^(x))

2(x)−λ1(x) , b = ¹₂

∂φ

∂λ2(µ, λ₂(x)) + _∂λ^∂φ

1(µ, λ₁(x)) , c = ¹₂

∂φ

∂λ2(µ, λ₂(x)) − _∂λ^∂φ

1(µ, λ₁(x))

;

(10)

(b) Both ^∂Φ_∂x^µ^(x) and I − ^∂Φ_∂x^µ^(x) are positive semi-definite matrices;

(c) lim

µ→0Φ_µ(x) = x₊ = (λ₁(x))₊u⁽¹⁾_x + (λ₂(x))₊u⁽²⁾_x for i = 1, 2.

Proof. (a) From the expression (8) and the assumption of φ(µ, ·) being continuously differentiable, it is easy to verify that the function Φ_µ is continuously differentiable. The Jacobian matrix (9) of Φ_µ(x) can be obtained by adopting the same arguments as in [13, Proposition 5.2]. Hence, we omit the details here.

(b) First, we prove that the matrix ^∂Φ_∂x^µ^(x) is positive semi-definite. For the case of x₂ = 0, we know that ^∂Φ_∂x^µ^(x) = ^∂φ_∂λ(µ, x₁)I. Then, from 0 ≤ _∂α^∂φ(µ, α) ≤ 1, it is clear to see that the matrix ^∂Φ_∂x^µ^(x) is positive semi-definite. For the case of x₂ 6= 0, from ^∂φ_∂α(µ, α) ≥ 0 and (10), we have b ≥ 0. In order to prove that the matrix ^∂Φ_∂x^µ^(x) is positive semi-definite, we only need to verify that the Schur Complement of b with respect to ^∂Φ_∂x^µ^(x) is positive semi-definite. Note that the Schur Complement of b has the form of

aI + (b − a)x₂x^T₂ kx2k² −c²

b x₂x^T₂ kx2k² = a

I − x₂x^T₂ kx2k²

+ b²− c² b

x₂x^T₂ kx2k².

Since ^∂φ_∂α(µ, α) ≥ 0, we obtain that the function φ(µ, α) with respect to α is increasing, which leads to a ≥ 0. Besides, from (10), we observe that

b²− c² = ∂φ

∂λ₂(µ, λ₂(x))∂φ

∂λ₁(µ, λ₁(x)) ≥ 0.

(9)

With this, it follows that the Schur Complement of b with respect to ^∂Φ_∂x^µ^(x) is a linear non-negative combination of the matrices _kx^x²^x^T²

2k² and I − _kx^x²^x^T²

2k². Thus, we show that the Schur Complement of b is positive semi-definite, which says the matrix ^∂Φ_∂x^µ^(x) is positive semi-definite.

Combining with ^∂φ_∂α(µ, α) ≤ 1 and following similar arguments as above, we can also argue that the matrix I − ^∂Φ_∂x^µ^(x) is also positive semi-definite.

(c) By the definition of the function Φ_µ(x), it can be verified directly. 2

We point out that the definition of (8) includes the similar way to define smoothing functions in [13, Section 4] as a special case, and hence [13, Prop. 4.1] is covered by Proposition 2.2. Indeed, Proposition 2.2 can be also verified by geometric views. More specifically, from Figures 2, 3 and 4, we see that when µ ↓ 0, φ_i is getting closer to (α)₊, which verifies Proposition 2.2(c).

μ=0.5 μ=0.3 μ=0.1 μ=0.01

-2 -1 0 1 2

0.0 0.5 1.0 1.5 2.0

t ϕ1(μ,t)

Figure 2: Graphs of φ₁(µ, α) with µ = 0.01, 0.1, 0.3, 0.5.

3 Applying Φ

_µ

to solve the system (5)

In this section, in light of the smoothing vector-valued function Φ_µ, we reformulate (5) as a system of smoothing equations. To this end, we need a partial order induced by SOC. More specifically, for any x ∈ IRⁿ, using the definition of the partial order “_K^m” and the projection function x₊ in (6), we have

f_I(x) K^m 0 ⇐⇒ −f_I(x) ∈ K^m ⇐⇒ f_I(x) ∈ −K^m ⇐⇒ (f_I(x))₊= 0.

Hence, the system (5) is equivalent to the following system of equations:

(f_I(x))₊ = 0,

f_E(x) = 0. (11)

(10)

μ=0.5 μ=0.3 μ=0.1 μ=0.01

-2 -1 0 1 2

0.0 0.5 1.0 1.5 2.0

t ϕ2(μ,t)

Figure 3: Graphs of φ₂(µ, α) with µ = 0.01, 0.1, 0.3, 0.5.

μ=0.5 μ=0.3 μ=0.1 μ=0.01

-2 -1 0 1 2

0.0 0.5 1.0 1.5 2.0

t ϕ3(μ,t)

Figure 4: Graphs of φ₃(µ, α) with µ = 0.01, 0.1, 0.3, 0.5.

Note that the function (f_I(·))₊ in the above equation (11) is nonsmooth. Therefore, the smoothing-type Newton methods cannot be directly applied to solve the equation (11).

To conquer this, we employ the smoothing function Φ_µ(·) defined in (8), and define the following function:

F (µ, x, y) :=





fI(x) − y f_E(x) Φ_µ(y)



.

(11)

From Proposition 2.2(c), it follows that

F (µ, x, y) = 0 and µ = 0

⇐⇒ y = f_I(x), f_E(x) = 0, Φ_µ(y) = 0 and µ = 0

⇐⇒ y = f_I(x), f_E(x) = 0 and y₊= 0

⇐⇒ (f_I(x))₊= 0, f_E(x) = 0

⇐⇒ f_I(x) _K^m 0, f_E(x) = 0.

In other words, as long as the system F (µ, x, y) = 0 and µ = 0 is solved, the corresponding x is a solution to the original system (5). In view of Proposition 2.2(a), we can obtain the solution to the system (5) by applying smoothing-type Newton method for solving F (µ, x, y) = 0 and setting µ ↓ 0 at the same time. To do this, for any z = (µ, x, y) ∈ IR₊₊ × IRⁿ× IR^m, we further define a continuously differentiable function H : IR₊₊× IRⁿ× IR^m → IR₊₊× IRⁿ× IR^m as follows:

H(z) :=







µ

f_I(x) − y + µx_I fE(x) + µxE

Φ_µ(y) + µy







, (12)

where x_I := (x₁, x₂, ..., x_m)^T ∈ IR^m, x_E := (x_m+1, ..., x_n)^T ∈ IR^n−m, x := (x^T_I, x^T_E)^T ∈ IRⁿ and y ∈ IR^m. Then, it is clear to see that when H(z) = 0, we have µ = 0 and x is a solution to the system (5). Now, we let H⁰(z) denote the Jacobian matrix of the function H at z, then for any z ∈ IR++× IRⁿ× IR^m, we obtain that

H⁰(z) =







1 0_n 0_m

x_I f_I⁰ + µU −I_m xE f_E⁰ + µV 0(n−m)×m

∂Φµ(y)

∂µ + y 0_m×n ^∂Φ_∂y^µ^(y)+ µI_m







, (13)

where U := Im 0_m×(n−m) , V := 0(n−m)×m I_n−m , 0l denotes l dimensional zero vector, and 0_l×q denotes l × q zero matrix for any positive integer l and q. In summary, we will apply smoothing-type Newton method to solve the smoothed equation H(z) = 0 at each iteration and make µ > 0 as well as H(z) → 0 to find a solution of the system (5).

4 A smoothing-type Newton algorithm

Now, we consider a smoothing-type Newton algorithm with a nonmonotone line search, and show that the algorithm is well defined. For convenience, we denote the merit function Ψ as Ψ(z) := kH(z)k² for any z ∈ IR₊₊× IRⁿ× IR^m.

(12)

Algorithm 4.1. (A smoothing-type Newton Algorithm)

Step 0 Choose γ ∈ (0, 1), ξ ∈ (0,¹₂). Take η > 0, σ ∈ (0, 1) such that ση < 1. Let µ₀ = η and (x⁰, y⁰) ∈ IRⁿ × IR^m be an arbitrary vector. Set z⁰ = (µ₀, x⁰, y⁰), e⁰ := (1, 0, ..., 0) ∈ IR × IRⁿ× IR^m, G₀ := kH(z⁰)k² = Ψ(z⁰) and S₀ := 1. Choose β_min and β_max such that 0 ≤ β_min ≤ β_max < 1. Set τ (z⁰) := σ min{1, Ψ(z⁰)} and k := 0.

Step 1 If kH(z^k)k = 0, stop. Otherwise, go to Step 2.

Step 2 Compute ∆z^k := (∆µ_k, ∆x^k, ∆y^k) ∈ IR × IRⁿ× IR^m by

H⁰(z^k)∆z^k = −H(z^k) + ητ (z^k)e⁰. (14) Step 3 Let α_k be the maximum of the values 1, γ, γ², ... such that

Ψ(z^k+ α_k∆z^k) ≤ [1 − 2ξ(1 − ση)α_k] G_k. (15) Step 4 Set z^k+1 := z^k+ αk∆z^k. If kH(z^k+1)k = 0, stop. Otherwise, go to Step 5.

Step 5 Choose β_k ∈ [β_min, β_max]. Set

S_k+1 := β_kS_k+ 1,

τ (z^k+1) := minσ, σΨ(z^k+1), τ (z^k) , Gk+1 := βkSkGk+ Ψ(z^k+1) /Sk+1,

(16)

and set k := k + 1. Go to Step 2.

The nonmonotone line search technique in Algorithm 4.1 was introduced in [36]. From the first and third equations in (16), we know that G_k+1 is a convex combination of G_k and Ψ(z^k+1). In fact, G_k is expressed as a convex combination of Ψ(z⁰), Ψ(z¹), ..., Ψ(z^k).

Moreover, the main role of β_k is to control the degree of non-monotonicity. If β_k = 0 for every k, then the corresponding line search is the usual monotone Armijo line search.

Proposition 4.1. Suppose that the sequences {z^k}, {µk}, {Gk}, {Ψ(z^k)} and {τ (z^k)}

are generated by Algorithm 4.1. Then, the following hold.

(a) The sequence {G_k} is monotonically decreasing and Ψ(z^k) ≤ G_k for all k ∈ N;

(b) The sequence {τ (z^k)} is monotonically decreasing;

(c) ητ (z^k) ≤ µ_k for all k ∈ N;

(d) The sequence {µ_k} is monotonically decreasing and µ_k > 0 for all k ∈ N.

(13)

Proof. The proof is similar to Remark 3.1 in [37], we omit the details. 2

Next, we show that Algorithm 4.1 is well-defined and establish its local quadratic convergence. For simplicity, we denote the Jacobian matrix of the function f by

f⁰(x) :=

f_I⁰(x) f_E⁰ (x)

and use the following assumption.

Assumption 4.1. f⁰(x) + µIn is invertible for any x ∈ IRⁿ and µ ∈ IR++.

We point our that the Assumption 4.1 is only a mild condition and there are many functions satisfying the assumption. For example, if f is a monotone function, then f⁰(x) is a positive semi-definite matrix for any x ∈ IRⁿ. Thus, Assumption 4.1 is satisfied.

Theorem 4.1. Suppose that f is a continuously differentiable function and Assumption 4.1 is satisfied. Then, Algorithm 4.1 is well-defined.

Proof. In order to show that Algorithm 4.1 is well-defined, we need to prove that Newton equation (14) is solvable, and the line search (15) is well-defined.

First, we prove that Newton equation (14) is solvable. By the expression of Jacobian matrix H⁰(z) in (13), we see that the determinant det(H⁰(z)) of H⁰(z) satisfies

det(H⁰(z)) = det (f⁰(x) + µI_n) · det ∂Φ_µ(y)

∂y + µI_m

for any z ∈ IR₊₊× IRⁿ× IR^m. Moreover, from Proposition 2.2(b), we know that ^∂Φ_∂y^µ^(y) is positive semi-definite for µ ∈ IR++. Hence, combing this with Assumption 4.1, we obtain that H⁰(z) is nonsingular for any z ∈ IR₊₊× IRⁿ× IR^m with µ > 0. Applying Proposition 4.1(d), it follows that Newton equation (14) is solvable.

Secondly, we prove that the line search (15) is well-defined. For notational convenience, we denote

w_k(α) := Ψ z^k+ α∆z^k − Ψ z^k − αΨ⁰ z^k ∆z^k. From Newton equation (14) and the definition of Ψ, we have

Ψ z^k+ α∆z^k

= w_k(α) + Ψ z^k + αΨ⁰ z^k ∆z^k

= wk(α) + Ψ z^k + 2αH z^kT

−H(z^k) + ητ (z^k)e⁰

≤ w_k(α) + (1 − 2α)Ψ z^k + 2αητ (z^k)

H(z^k) . If Ψ(z^k) ≤ 1, then we have kH(z^k)k ≤ 1. Hence, it follows that

τ (z^k)kH(z^k)k ≤ σΨ(z^k)kH(z^k)k ≤ σΨ(z^k).

(14)

If Ψ(z^k) > 1, then we see that Ψ(z^k) = kH(z^k)k² ≥ kH(z^k)k, which yields τ (z^k)kH(z^k)k ≤ σkH(z^k)k ≤ σΨ(z^k).

Thus, from all the above, we obtain that Ψ z^k+ α∆z^k

≤ w_k(α) + (1 − 2α)Ψ(z^k) + 2αησΨ(z^k)

= w_k(α) +1 − 2(1 − ση)αΨ(z^k) (17)

≤ w_k(α) +1 − 2(1 − ση)αG_k.

Since the function H is continuous and differentiable for any z ∈ IR₊₊× IRⁿ× IR^m, we have w_k(α) = o(α) for all k ∈ N. Combining with (17), this indicates that the line search (15) is well-defined. 2

Theorem 4.2. Suppose that f is a continuously differentiable function and Assumption 4.1 is satisfied. Then the sequence {z^k} generated by Algorithm 4.1 is bounded; and any accumulation point of the sequence {x^k} is a solution of the system (5).

Proof. The proof is similar to [37, Theorem 4.1] and we omit it. 2

In Theorem 4.2, we give the global convergence of Algorithm 4.1. Now, we analyze the convergence rate for Algorithm 4.1. We start with introducing the following concepts. A locally Lipschitz function F : IRⁿ → IR^m is said to be semismooth (or strongly semismooth) at x ∈ IRⁿ if F is directionally differentiable at x and

F (x + h) − F (h) − V h = o(khk) (or = O(khk²))

holds for any V ∈ ∂F (x + h), where ∂F (x) is the generalized Jacobian matrix of the function F at x ∈ IRⁿ in the sense of Clarke [2]. There are many functions being semismooth, such as convex functions, smooth functions, piecewise linear functions and so on.

In addition, it is known that the composition of semismooth functions is still a semismooth function, and the composition of strongly semismooth functions is still a strongly semismooth function [12]. From Proposition 2.2 (a), we know that Φ_µ(x) defined by (8) is smooth on IRⁿ.

With the definition (12) of H, mimicking the arguments as in [37, Theorem 5.1], we have the local quadratic convergence of Algorithm 4.1.

Theorem 4.3. Suppose that the conditions given in Theorem 4.2 are satisfied, and z^∗ = (µ∗, x^∗, y^∗) is an accumulation point of sequence {z^k} which is generated by Algorithm 4.1.

(a) If all V ∈ ∂H(z^∗) are nonsingular, then the sequence {z^k} converges to z^∗, and kz^k+1− z^kk = o(kz^k− z^∗k), µ_k+1 = o(µ_k);

(15)

(b) If the functions f and Φ_µ satisfy that f⁰ and Φ⁰_µ are Lipschitz continuous on IRⁿ, then kz^k+1− z^kk = O(kz^k− z^∗k)² and µk+1 = O(µ²_k).

5 Numerical experiments

In this section, we present some numerical examples to demonstrate the efficiency of Algorithm 4.1 for solving the system (5). In our tests, all experiments are done on a PC with CPU of 1.9 GHz and RAM of 8.0 GB, and all the program codes are written in MATLAB and run in MATLAB environment. We point out that if there are no n numbers in I ∪ E, we can adopt a similar way to those given in [37], then the system (5) can be transformed as a new problem and we can solve the new problem using Algorithm 4.1. By this approach, a solution of the original problem can be found.

Throughout the following experiments, we employ three functions φ₁, φ₂ and φ₃ along with the proposed algorithm to implement each example. Note that, for the function φ₁, its corresponding SOC-function Φ_µ can be alternatively expressed as

Φeµ(x) = x +px² + 4µ²e

2 with e = (1, 0, · · · , 0)^T ∈ Kⁿ.

This form is simpler than the Φµ(x) induced from (8). Hence, we adopt it in our implementation. Moreover, the parameters used in the algorithm are chosen as follows:

γ = 0.3, ξ = 10⁻⁴, η = 1.0, β₀ = 0.01, µ₀ = 1.0, S₀ = 1.0,

and the parameters c and σ are chosen according to the ones listed in Table 1 and Table 4. In the implementation, the stopping rule is kH(z)k ≤ 10⁻⁶, the step length ν ≤ 10⁻⁶, or the number of iteration is over 500; and the starting points are randomly generated from the interval [−1, 1].

Now, we present the test examples. We first consider two examples in which the system (5) only includes inequalities, i.e., m = n. Note that a similar way to construct the two examples was given in [25].

Example 5.1. Consider the system (5) with inequalities only, where f (x) := M x+q _Kⁿ 0 and Kⁿ := Kⁿ¹ × · · · × Kⁿ^r. Here M is generated by M = BB^T with B ∈ IR^n×n being a matrix whose every component is randomly chosen from the interval [0, 1] and q ∈ IRⁿ being a vector whose every component is 1.

For Example 5.1, the tested problems are generated with sizes n = 500, 1000, ..., 4500 and each n_i = 10. The random problems of each size are generated 10 times. Besides using the three functions along with Algorithm 4.1 for solving Example 5.1, we have also

(16)

n fun suc iter cpu res

500 φ₁ 10 5.000 0.251 8.864e-09

500 φ₂ 10 7.800 1.496 2.600e-07

500 φ₃ 10 3.500 0.707 3.762e-07

1000 φ₁ 10 5.000 0.632 2.165e-08

1000 φ₂ 10 7.200 5.240 8.657e-08

1000 φ₃ 10 3.400 3.093 4.853e-07

1500 φ₁ 9 5.000 1.224 1.537e-07

1500 φ₂ 9 8.111 13.232 3.124e-07

1500 φ₃ 9 4.222 8.781 2.706e-07

2000 φ₁ 10 5.000 2.145 1.599e-07

2000 φ₂ 10 7.700 24.130 2.234e-07 2000 φ3 10 4.200 16.925 1.923e-07

2500 φ₁ 9 5.000 3.519 3.897e-08

2500 φ2 9 6.889 34.849 2.016e-07

2500 φ₃ 9 4.000 27.870 1.479e-07

3000 φ₁ 10 5.000 5.161 9.769e-08

3000 φ₂ 10 8.300 69.723 1.714e-07 3000 φ₃ 10 4.100 45.891 1.608e-07

3500 φ1 7 5.000 7.415 2.226e-07

3500 φ₂ 7 7.857 102.272 4.037e-07

3500 φ₃ 7 4.429 75.068 2.334e-07

4000 φ₁ 9 5.000 9.974 5.795e-08

4000 φ₂ 9 6.444 106.850 3.132e-07

4000 φ₃ 9 4.000 98.983 7.743e-08

4500 φ₁ 8 5.000 13.075 2.374e-07

4500 φ₂ 8 10.250 240.602 3.115e-07 4500 φ₃ 8 4.250 147.863 3.070e-07

Table 1: Average performance of Algorithm4.1 for Example 5.1 (c = 0.01, σ = 10⁻⁵)

tested it by using the smoothing-type algorithm with the monotone line search which was introduced in [25] (for this case, we choose the function φ₁). Table 1 shows the numerical results where

(17)

Non-monotone Monotone

n suc iter cpu res n suc iter cpu res

500 10 5.000 0.251 8.864e-09 500 10 5.500 0.289 4.905e-07 1000 10 5.000 0.632 2.165e-08 1000 10 5.500 0.616 7.184e-08

1500 9 5.000 1.224 1.537e-07 1500 9 6.000 1.466 4.654e-09

2000 10 5.000 2.145 1.599e-07 2000 10 6.500 2.866 3.151e-08 2500 9 5.000 3.519 3.897e-08 2500 10 6.000 4.477 4.320e-08 3000 10 5.000 5.161 9.769e-08 3000 10 6.500 7.348 1.743e-07 3500 7 5.000 7.415 2.226e-07 3500 10 8.000 11.957 5.674e-07 4000 9 5.000 9.974 5.795e-08 4000 10 7.000 14.875 2.166e-08 4500 8 5.000 13.075 2.374e-07 4500 10 7.000 19.204 2.433e-08

Table 2: Comparisons of non-monotone Algorithm 4.1 and monotone Algorithm in [25]

for Example 5.1

“fun” denotes the three functions,

“suc” denotes the number that Algorithm 4.1 successfully solves every generated problem,

“iter” denotes the average iteration numbers,

“cpu” denotes the average CPU time in seconds,

“res” denotes the average residual norm kH(z)k for 9 test problems.

The initial points are also randomly generated. In light of “iter” and “cpu” in Table 1, we can conclude that

φ₃(µ, α) > φ₁(µ, α) > φ₂(µ, α)

where “>” means “better performance”. In Table 2, we compare Algorithm 4.1 with non-monotone line search and the smoothing-type algorithm with monotone line search studied in [25]. Although the number that Algorithm 4.1 successfully solves every generated problem is less than the one by the smoothing-type algorithm with monotone line search as aforementioned in overall, the performance based on cpu time and iterations of our proposed algorithm outperforms better than the other. This indicates that Algo- rithm 4.1 has some advantages over the one with the monotone line search in [25].

Another way to compare the performance of function φ_i(µ, α), i = 1, 2, 3, is via the so-called “performance profile”, which is introduced in [39]. In this means, we regard Algorithm 4.1 corresponding to a smoothing function φ_i(µ, α), i = 1, 2, 3 as a solver, and assume that there are n_s solvers and n_p test problems from the test set P which is generated randomly. We are interested in using the iteration number as performance measure for Algorithm 4.1 with different φi(µ, α). For each problem p and solver s, let

f_p,s = iteration number required to solve problem p by solver s.

(18)

We employ the performance ratio

r_p,s:= f_p,s

min{f_p,s: s ∈ S},

where S is the four solvers set. We assume that a parameter rp,s ≤ rM for all p, s are chosen, and r_p,s = r_M if and only if solver s does not solve problem p. In order to obtain an overall assessment for each solver, we define

ρ_s(τ ) := 1 np

size{p ∈ P : r_p,s≤ τ },

which is called the performance profile of the number of iteration for solver s. Then, ρ_s(τ ) is the probability for solver s ∈ S that a performance ratio f_p,s is within a factor τ ∈ R of the best possible ratio.

We then need to test the three functions for Example 5.1. In particular, the random problems of each size are generated 50 times. In order to obtain an overall assessment for the three functions, we are interested in using the number of iterations as a performance measure for Algorithm 4.1 with φ₁(µ, α), φ₂(µ, α), and φ₃(µ, α), respectively. The performance plot based on iteration number is presented in Figure 5. From this figure, we also see that φ₃(µ, α) working with Algorithm 4.1 has the best numerical performance, followed by φ4(µ, α). In other words, in view of “iteration numbers”, there has

φ₃(µ, α) > φ₁(µ, α) > φ₂(µ, α) where “>” means “better performance”.

We are also interested in using the computing time as performance measure for Algo- rithm 4.1 with different φ_i(µ, α), i = 1, 2, 3. The performance plot based on computing time is presented in Figure 6. From this figure, we can also see the function φ₃(µ, t) has best performance. In other words, in view of “computing time”, there has

φ₃(µ, α) > φ₁(µ, α) > φ₂(µ, α) where “>” means “better performance”.

In summary, for the Example 5.1, no matter the number of iterations or the computing time is taken into account, the function φ₃(µ, α) is the best choice for the Algorithm 4.1.

Example 5.2. Consider the system (5) with inequalities only, where x ∈ IR⁵, K⁵ = K³× K² and

f (x) :=







24(2x₁− x₂)³ + exp(x₁+ x₃) − 4x₄+ x₅

−12(2x₁− x₂)³+ 3(3x₂+ 5x₃)/p1 + (3x2+ 5x₃)²− 6x₄− 7x₅

−exp(x₁− x₃) + 5(3x₂+ 5x₃)/p1 + (3x₂+ 5x₃)²− 3x₄+ 5x₅ 4x₁+ 6x₂+ 3x₃− 1

−x1+ 7x2− 5x3+ 2







_K⁵ 0.

(19)

1.0 1.5 2.0 2.5 0.0

0.2 0.4 0.6 0.8 1.0

τ

ρs(τ) ϕ₁(μ,t)

ϕ2(μ,t) ϕ3(μ,t)

Figure 5: Performance profile of iteration numbers for Example 5.1.

1.0 1.5 2.0 2.5 3.0

0.0 0.2 0.4 0.6 0.8 1.0

τ ρ

s

(τ )

ϕ₁(μ,t) ϕ2(μ,t) ϕ₃(μ,t)

Figure 6: Performance profile of computing time for Example 5.1.

This problem is taken from [17].

Example 5.2 is tested 20 times for 20 random starting points. Similar to the case of Example 5.1, besides using Algorithm 4.1 to test Example 5.2, we have also tested it using the monotone smoothing-type algorithm in [25]. From Table 3, we see that there is no big difference regarding performance between these two algorithms for Example 5.2.

Moreover, Figure 7 shows the performance profile of iteration number in Algorithm 4.1 for Example 5.2 on 100 test problems with random starting points. The three solvers

(20)

Non-monotone Monotone

suc iter cpu res suc iter cpu res

20 13.500 0.002 5.835e-08 20 8.750 0.005 1.2510e-07

Table 3: Comparisons of non-monotone Algorithm 4.1 and monotone Algorithm in [25]

for Example 5.2

1.0 1.1 1.2 1.3 1.4 1.5 1.6

0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0

τ

ρs(τ) ϕ₁(μ,t)

ϕ₂(μ,t) ϕ3(μ,t)

Figure 7: Performance profile of iteration number for Example 5.2.

correspond to Algorithm 4.1 with φ₁(µ, α), φ₂(µ, α), and φ₃(µ, α), respectively. From this figure, we see that φ₃(µ, α) working with Algorithm 4.1 has the best numerical performance. followed by φ2(µ, t). In summary, from the viewpoint of “iteration numbers”, we conclude that

φ₃(µ, α) > φ₂(µ, α) > φ₁(µ, α), where “>” means “better performance”.

Example 5.3. Consider the system of equalities and inequalities (5), where f (x) := f_I(x)^T, f_E(x)^TT

x ∈ IR⁶,

(21)

with

f_I(x) =







−x⁴₁

3x³₂+ 2x₂− x₃− 5x²₃

−4x²₂− 7x₃+ 10x³₃

−x³₄− x₅ x₅+ x₆







_K⁵_=K³_×K² 0,

fE(x) = 2x1+ 5x²₂− 3x²₃+ 2x4− x5x6− 7.

x ∈ IR⁶, with

fI(x) =







−e^5x¹ + x2

x₂ + x³₃

−3e^x⁴ 5x₅− x₆







K⁴=K²×K² 0,

f_E(x) =

3x1+ e^x²^+x³ − 2x4− 7x5+ x6− 3 2x²₁+ x₂+ 3x₃− (x₄− x₅)²+ 2x₆ − 13

= 0.

x ∈ IR⁷, with

fI(x) =







3x³₁ x₂− x₃

−2(x₄− 1)² sin(x₅+ x₆)

2x₆+ x₇







K⁵=K²×K³ 0,

f_E(x) =

x₁+ x₂+ 2x₃x₄+ sin x₅ + cos x₆+ 2x₇ x³₁+ x₂+px²₃+ 3 + 2x₄+ x₅ + x₆+ 6x₇

= 0.

Table 4 shows the numerical results including three smoothing functions (fun) used to solve the problems, the number (suc) that Algorithm 4.1 successfully solves every generated problem, the parameters c and σ, the average iteration numbers (iter), the average CPU time (cpu) in seconds and the average residual norm kH(z)k (res) for Examples 5.2-5.5 with random initializations, respectively. Performance profiles are provided as below.

Figure 8 and Figure 9 are the performance profiles in terms of iteration number for Example 5.3 and Example 5.5. From the Figure 8, we see that although the best

(22)

Exam fun suc c σ iter cpu res

5.2 φ₁ 20 5 0.02 13.500 0.002 5.835e-08

5.2 φ₂ 20 5 0.02 8.450 0.001 5.134e-07

5.2 φ₃ 20 5 0.02 8.600 0.002 2.260e-07

5.3 φ₁ 20 1 0.02 21.083 0.009 8.165e-07

5.3 φ₂ 17 1 0.02 14.647 0.001 2.899e-07??

5.3 φ₃ 17 1 0.02 18.529 0.002 7.167e-07

5.4 φ₁ 20 0.5 0.002 46.750 0.033 1.648e-07

5.4 φ₂ 2 0.5 0.002 420.000 0.499 9.964e-07

5.4 φ₃ 0 0.5 0.002 Fail Fail Fail

5.5 φ₁ 20 0.1 0.002 14.250 0.009 6.251e-07

5.5 φ₂ 20 0.1 0.002 13.250 0.001 6.532e-07

5.5 φ3 20 0.1 0.002 12.650 0.001 6.016e-07

Table 4: Average performance of Algorithm4.1 for Examples 5.2-5.5

1 2 3 4 5

0.2 0.4 0.6 0.8 1.0

τ

ρs(τ) ϕ₁(μ,t)

ϕ₂(μ,t) ϕ3(μ,t)

probability of the function φ₃ is lower, but the ratio that can be solved in a large number of problems is higher than that of the other two. In this case, the difference between the three functions is not obvious. From the Figure 9, we can also see the function φ₃ has best performance.

In summary, below are our numerical observations and conclusions.

1. The Algorithm 4.1 is effective. In particular, the numerical results show that our

(23)

1.0 1.2 1.4 1.6 1.8 2.0 0.3

0.4 0.5 0.6 0.7 0.8 0.9 1.0

τ

ρs(τ) ϕ₁(μ,t)

ϕ2(μ,t) ϕ3(μ,t)

method is better than the algorithm with monotone line search studied in [25] when solving the system of inequalities under the order induced by second-order cone.

2. For Examples 5.1 and 5.2, φ3 outperforms much better than the others. For the rest problems, the difference of their numerical performance is very marginal.

3. For future topics, it is interesting to discover more efficient smoothing functions and to apply the type of SOC-functions to other optimization problems involved second-order cones.

References

[1] F. Alizadeh, D. Goldfarb, Second-order cone programming, Mathematical Pro- gramming, vol. 95, pp. 3–51, 2003.

[2] F.H. Clark, Optimizaton and Nonsmooth Analysis, Wiley, New York, 1983.

[3] J-S. Chen, The convex and monotone functions associated with second-order cone, Optimization, vol. 55, pp. 363-385, 2006.

[4] J.-S. Chen, X. Chen, S.-H. Pan, and J. Zhang, Some characterizations for SOC-monotone and SOC-convex functions, Journal of Global Optimization, vol. 45, pp. 259-279, 2009.

(24)

[5] J-S. Chen, X. Chen, and P. Tseng, Analysis of nonsmooth vector-valued functions associated with second-order cones, Mathematical Programming, vol. 101, pp.

95-117, 2004.

[6] J.-S. Chen and P. Tseng, An unconstrained smooth minimization reformulation of second-order cone complementarity problem, Mathematical Programming, vol. 104, pp. 293-327, 2005.

[7] J.-S. Chen, T.-K. Liao, and S.-H. Pan, Using Schur Complement Theorem to prove convexity of some SOC-functions, Journal of Nonlinear and Convex Analysis, vol. 13, pp. 421-431, 2012.

[8] J.-S. Chen and S.-H. Pan, A survey on SOC complementarity functions and solution methods for SOCPs and SOCCPs, Pacific Journal of Optimization, vol. 8, pp.

33-74, 2012.

[9] J-S. Chen, C.-H. Ko, Y.-D. Liu, and S.-P. Wang, New smoothing functions for solving a system of equalities and inequalities, Pacific Journal of Optimization, vol.

12, pp. 185-206, 2016.

[10] J.W. Daniel, Newton’s method for nonlinear inequalities, Numerische Mathematik, vol. 21, pp. 381-387, 1973.

[11] U. Faraut and A. Kor´anyi, Analysis on Symmetric Cones, Oxford Mathemat- ical Monographs, Oxford University Press, New York, 1994.

[12] A. Fischer, Solution of monotone complementarity problems with locally Lips- chitzian functions, Mathematical Programming, vol. 76, pp. 513-532, 1997.

[13] M. Fukushima, Z.Q. Luo, and P. Tseng, Smoothing functions for second-order cone complementarity problems, SIAM Journal on Optimization, vol. 12, pp. 436-460, 2002.

[14] F. Facchinei and J.S. Pang, Finite-Dimensional Variational Inequalities and Complementarity Problems, Volume-I, Springer, New York, 2003.

[15] Z.-H. Huang, S.-L. Hu, and J. Han, Global convergence of a smoothing algorithm for symmetric cone complementarity problems with a nonmonotone line search, Science in China Series A, vol. 52, pp. 833-848, 2009.

[16] Z.-H. Huang and T. Ni, Smoothing algorithms for complementarity problems over symmetric cones, Computational Optimization Applications, vol. 45, pp. 557-579, 2010.