An approximate lower order penalty approach for solving second-order cone linear complementarity problems

(1)

An approximate lower order penalty approach for solving second-order cone linear complementarity problems

Zijun Hao ¹

School of Mathematics and Information Science North Minzu University

Yinchuan 750021, China

Chieu Thanh Nguyen ² Department of Mathematics National Taiwan Normal University

Taipei 11677, Taiwan

Jein-Shan Chen ³ Department of Mathematics National Taiwan Normal University

Taipei 11677, Taiwan

March 24, 2020

(revised on August 12, 2021)

Abstract Based on a class of smoothing approximations to projection function onto second-order cone, an approximate lower order penalty approach for solving second-order cone linear complementarity problems (SOCLCPs) is proposed, and four kinds of specific smoothing approximations are considered. In light of this approach, the SOCLCP is approximated by asymptotic lower order penalty equations with penalty parameter and smoothing parameter. When the penalty parameter tends to positive infinity and the smoothing parameter monotonically decreases to zero, we show that the solution sequence of the asymptotic lower order penalty equations converges to the solution of the SOCLCP at an exponential rate under a mild assumption. A corresponding algorithm is constructed and numerical results are reported to illustrate the feasibility of this approach. The performance profile of four specific smoothing approximations is presented, and the generalization of two approximations are also investigated.

1E-mail: zijunhao@126.com. The author’s work is supported by the National Natural Science Foun- dation of China (Nos. 11661002,11871383), the Natural Science Fund of Ningxia (No. 2020AAC03236), the First-class Disciplines Foundation of Ningxia (No. NXYLXK2017B09).

2E-mail: thanhchieu90@gmail.com.

3Corresponding author. E-mail: jschen@math.ntnu.edu.tw. The author’s work is supported by Ministry of Science and Technology, Taiwan.

(2)

Keywords Second-order cone; linear complementarity problem; lower order penalty approach; exponential convergence rate

MC codes 90C25; 90C30; 90C33

1 Introduction

This paper targets the following second-order cone linear complementarity problem (SO- CLCP), which is to find x ∈ IRⁿ, such that

x ∈ K, Ax − b ∈ K, x^T(Ax − b) = 0, (1) where A is an n × n matrix, b is a vector in IRⁿ, and K is the Cartesian product of second-order cones (SOCs), also called Lorentz cones [7,18]. In other words,

K := Kⁿ¹ × · · · × Kⁿ^r (2)

with r, n₁, . . . , n_r≥ 1, n₁+ · · · + n_r = n and

Kⁿⁱ :=(x₁, x₂) ∈ IR × IRⁿⁱ⁻¹| kx₂k ≤ x₁ , i = 1, . . . , r,

where k · k denotes the Euclidean norm and (x₁, x₂) := (x₁, x^T₂)^T. Note that K¹ denotes the set of nonnegative real numbers IR₊. The SOCLCP, as an extension of the linear complementarity problem (LCP), has a wide range of applications in linear and quadratic programming problems, computer science, game theory, economics, finance, engineering, and network equilibrium problems [3, 15, 17,26, 27, 30].

During the past several years, there are many methods proposed for solving the SO- CLCPs (1)-(2), including the interior-point method [1, 28, 32], the smoothing Newton method [14, 19, 24], the smoothing-regularization method [23], the semismooth New- ton method [25, 33], the merit function method [5, 10, 12], and the matrix splitting method [22,41] etc. Although the effectiveness of some methods has been improved sub- stantially in recent years, the fact remains that there still have many complementarity problems require efficient and accurate numerical methods. The penalty methods are well-known for solving constrained optimization problems which possess many nice properties. More specifically, the l₁ exact penalty function method and lower order penalty function method are known as the approaches which hold many nice properties and at- tracts much attention [2,20,29,34,39,40]. The smoothing of the exact penalty methods are also proposed [35, 37,38]. Besides, Wang and Yang [36] focus on the power of lower order penalty function, and propose a power penalty method for solving LCP based on approximating the LCP by nonlinear equations. It shows that under some mild assumptions, the solution sequence of the nonlinear equations converges to the solution of the LCP at an exponential rate when the penalty parameter tends to positive infinity. Based

(3)

on the method in [36], Hao et al. [21] propose a power penalty method for solving the SOCLCP with a single K = Kⁿ, i.e.,

x ∈ Kⁿ, Ax − b ∈ Kⁿ, x^T(Ax − b) = 0, (3) where A ∈ IR^n×n and b ∈ IRⁿ. In particular, they consider the power penalty equations:

Ax − α[x]^1/k₋ = b, (4)

where k ≥ 1 and α ≥ 1 are parameters,

[x]^1/k₋ = [λ₁(x)]^1/k₋ u⁽¹⁾_x + [λ₂(x)]^1/k₋ u⁽²⁾_x

with [t]− = max{0, −t} and the spectral decomposition (will be introduced later in (5)).

Under a mild assumption of matrix A, as α → +∞, the solution sequence of (4) converges to the solution of the SOCLCP (3) at an exponential rate.

In this paper, we further enhance improvement and extension of the method and the problem studied in [21]. We first generalize [x]^1/k₋ in (4) to general lower order penalty function [x]^σ₋ with σ ∈ (0, 1], then focus on a class of approximate function to [x]^σ₋ for solving the general SOCLCP (1) instead of the SOCLCP (3) with single SOC constraint.

In addition, we construct a class of functions Φ⁻(µ, x)^σ to approximate [x]^σ₋ as µ → 0⁺. Four kinds of specific smoothing approximations are studied. Theoretically, we prove that the solution sequence of the approximating lower order penalty equations converge to the solution of the SOCLCP (1) at an exponential rate O(α^−1/σ) when α → +∞

and µ → 0⁺. This generalizes all its counterparts in the literature. Moreover, a corresponding algorithm is constructed and numerical results are also reported to examine the feasibility of the proposed method. The performance profile of those specific smoothing approximations is presented, and the generalization of two approximations are also investigated.

This paper is organized as follows: In Section 2, we review some properties related to the single SOC which is the basis for our subsequent analysis. In Section 3, a class of approximation functions for lower order penalty function is constructed, and four kinds of specific smoothing approximations are investigated. In Section 4, we study the approximating lower order penalty equations for solving the SOCLCP (1), and prove the convergence analysis. In Section 5, a corresponding algorithm is constructed and the preliminary numerical experiments are presented. The performance profiles of the considered four specific smoothing approximations and the generalization of two approximations are also considered. Finally, we draw the conclusion in Section 6.

For simplicity, we denote the interior of single SOC Kⁿ by int(Kⁿ). For any x, y in IRⁿ, we write x Kⁿ y if x − y ∈ Kⁿ and write x Kⁿ y if x − y ∈ int(Kⁿ). In other words, we have x Kⁿ 0 if and only if x ∈ Kⁿ, and x Kⁿ 0 if and only if x ∈ int(Kⁿ).

We usually denote (x, y) := (x^T, y^T)^T for the concatenation of two column vectors x, y for simplicity. The notation k · k_p denotes the usual l_p-norm on IRⁿ for any p ≥ 1. In particular, it is Euclidean norm k · k when p = 2.

(4)

2 Preliminary results

In this section, we first recall some basic concepts and preliminary results related to a single SOC K = Kⁿ that will be used in the subsequent analysis. All of the analysis are then carried over to the general structure K (2). For any x = (x₁, x₂) ∈ IR × IRⁿ⁻¹, y = (y₁, y₂) ∈ IR × IRⁿ⁻¹, their Jordan product [7,18] is defined as

x ◦ y := (hx, yi , y₁x₂+ x₁y₂).

We write x + y to mean the usual componentwise addition of vectors and x² to mean x ◦ x. The identity element under this product is e = (1, 0, . . . , 0)^T ∈ IRⁿ. It is known that x² ∈ Kⁿ for all x ∈ IRⁿ. Moreover, if x ∈ Kⁿ, then there is a unique vector in Kⁿ, denoted by x¹², such that (x¹²)² = x¹²◦ x¹² = x. For any x ∈ IRⁿ, we define x⁰ = e if x 6= 0.

For any integer k ≥ 1, we recursively define the powers of element as x^k = x ◦ x^k−1, and define x^−k = (x^k)⁻¹ if x ∈ int(Kⁿ). The Jordan product is not associative for n > 2, but it is power associated, i.e., x ◦ (x ◦ x) = (x ◦ x) ◦ x. Thus, for any positive integer p, the form x^p is definite, and x^m+n = x^m ◦ xⁿ for all positive integer m and n. Note that Kⁿ is not closed under the Jordan product for n > 2.

In the following, we recall the expression of the spectral decomposition of x with respect to SOC, see [5,6, 7, 8, 10, 11, 12,18, 19, 33]. For x = (x1, x2) ∈ IR × IRⁿ⁻¹, the spectral decomposition of x with respect to SOC is given by

x = λ₁(x)u⁽¹⁾_x + λ₂(x)u⁽²⁾_x , (5) where for i = 1, 2,

λ_i(x) = x₁+ (−1)ⁱkx₂k, u⁽ⁱ⁾_x = ( 1

2(1, (−1)^{i x}_kx²

2k) if kx2k 6= 0,

1

2(1, (−1)ⁱw) if kx2k = 0, (6) with w ∈ IRⁿ⁻¹ being any unit vector. The two scalars λ₁(x) and λ₂(x) are called spectral values of x, while the two vectors u⁽¹⁾x and u⁽²⁾x are called the spectral vectors of x. Moreover, it is obvious that the spectral decomposition of x ∈ IRⁿ is unique if x2 6= 0.

Some basic properties of the spectral decomposition in the Jordan algebra associated with SOC are stated as below, whose proofs can be found in [6, 7,18, 19].

Proposition 2.1. For any x = (x₁, x₂) ∈ IR × IRⁿ⁻¹ with the spectral values λ₁(x), λ₂(x) and spectral vectors u⁽¹⁾x , u⁽²⁾x given as (6), we have:

(a) u⁽¹⁾x ◦ u⁽²⁾x = 0 and u⁽ⁱ⁾x ◦ u⁽ⁱ⁾x = u⁽ⁱ⁾x , ku⁽ⁱ⁾x k² = 1/2 for i = 1, 2.

(b) λ₁(x), λ₂(x) are nonnegative (positive) if and only if x ∈ Kⁿ (x ∈ int(Kⁿ)).

(c) For any x ∈ IRⁿ, x Kⁿ 0 if and only if hx, yi ≥ 0 for all y Kⁿ 0.

(5)

The spectral decomposition (5)-(6) and the Proposition 2.1 indicate that x^k can be described as x^k = λ^k₁(x)u⁽¹⁾x + λ^k₂(x)u⁽²⁾x . For any x ∈ IRⁿ, let [x]₊ denote the projection of x onto Kⁿ, and [x]₋ be the projection of −x onto the dual cone (Kⁿ)^∗ of Kⁿ, where the dual cone (Kⁿ)^∗ is defined by (Kⁿ)^∗ := {y ∈ IRⁿ| hx, yi ≥ 0, ∀x ∈ Kⁿ}. In fact, by Proposition 2.1, the dual cone of Kⁿ being itself, i.e., (Kⁿ)^∗ = Kⁿ. Due to the special structure of Kⁿ, the explicit formula of projection of x = (x₁, x₂) ∈ IR × IRⁿ⁻¹ onto Kⁿ is obtained in [14,17, 19] as below

[x]₊=







x if x ∈ Kⁿ, 0 if x ∈ −Kⁿ, u otherwise,

where u =

" _x₁_+kx₂_k

2

x1+kx2k 2

x2

kx2k

# .

Similarly, the expression of [x]− can be written out as

[x]− =







0 if x ∈ Kⁿ,

−x if x ∈ −Kⁿ, v otherwise,

where v =

"

−^x¹^−kx₂ ²^k

x1−kx₂k 2

x2

kx2k

# .

It is easy to verify that x = [x]+− [x]− and

[x]₊= [λ₁(x)]₊u⁽¹⁾_x + [λ₂(x)]₊u⁽²⁾_x , [x]−= [λ₁(x)]−u⁽¹⁾_x + [λ₂(x)]−u⁽²⁾_x ,

where [α]₊ = max{0, α} and [α]− = max{0, −α} for α ∈ IR. Thus, it can be seen that [x]₊, [x]− ∈ Kⁿ and [x]₊◦ [x]− = 0.

Putting these analyses into a single SOC Kⁿⁱ, i = 1, . . . , r in (2), we can extend them to the general case K = Kⁿ¹ × · · · × Kⁿ^r. More specifically, for any x = (x₁, . . . , x_r) ∈ IRⁿ¹ × · · · × IRⁿ^r, y = (y₁, . . . , y_r) ∈ IRⁿ¹ × · · · × IRⁿ^r, their Jordan product is defined as

x ◦ y := (x1◦ y1, . . . , xr◦ yr).

Let [x]₊, [x]− respectively denote the projection of x onto K and the projection of −x onto the dual cone K^∗ = K, then

[x]₊:= ([x₁]₊, . . . , [x_r]₊), [x]−:= ([x₁]−, . . . , [x_r]−), (7) where [x_i]₊, [x_i]₋ for i = 1, . . . , r respectively denote the projection of x_i onto the single SOC Kⁿⁱ and the projection of −x_i onto (Kⁿⁱ)^∗.

3 Approximation of projection function with power

This section is devoted to presenting a way to generate smoothing functions for the plus function [t]₊ = max{0, t} and minus function [t]− = max{0, −t} via convolution which was proposed by Chen and Mangasarian [4]. First, we consider the piecewise continuous

(6)

function d(t) with finite number of pieces, which is a density (kernel) function. In other words, it satisfies

d(t) ≥ 0 and

Z +∞

−∞

d(t)dt = 1. (8)

Next, we define ˆs(µ, t) := ¹_µd

t µ

, where µ is a positive parameter. If R+∞

−∞ |t| d(t)dt <

+∞, then a smoothing approximation for [t]₊ is formed. In particular, φ⁺(µ, t) =

Z +∞

−∞

(t − s)+ˆs(µ, s)ds = Z t

−∞

(t − s)ˆs(µ, s)ds ≈ [t]+. (9) The following proposition states the properties of φ⁺(µ, t), whose proofs can be found in [4, Proposition 2.2].

Proposition 3.1. Let d(t) be a density function satisfying (8) and ˆs(µ, t) = ¹_µd

t µ

with positive parameter µ. If d(t) is piecewise continuous with finite number of pieces and R+∞

−∞ |t| d(t)dt < +∞. Then, the function φ⁺(µ, t) defined by (9) possesses the following properties.

(a) φ⁺(µ, t) is continuously differentiable.

(b) −D₂µ ≤ φ⁺(µ, t) − [t]₊ ≤ D₁µ, where D₁ =

Z 0

−∞

|t|d(t)dt and D₂ = max

Z +∞

−∞

td(t)dt, 0

.

(c) _∂t^∂φ⁺(µ, t) is bounded satisfying 0 ≤ _∂t^∂φ⁺(µ, t) ≤ 1.

From Proposition 3.1(b), we have lim

µ→0⁺

φ⁺(µ, t) = [t]₊

under the assumptions of this proposition. Applying the above way of generating smoothing function to approximate [t]− = max{0, −t}, which appears in equation (4), we also achieve a smoothing approximation as follows:

φ⁻(µ, t) = Z −t

−∞

(−t − s)ˆs(µ, −s)ds = Z +∞

t

(s − t)ˆs(µ, s)ds ≈ [t]−. (10) Similar to Proposition3.1, we have the below properties for φ⁻(µ, t).

Proposition 3.2. Let d(t) and ˆs(µ, t) be as in Proposition3.1with the same assumptions.

Then, the function φ⁻(µ, t) defined by (10) possesses the following properties.

(a) φ⁻(µ, t) is continuously differentiable.

(7)

(b) −D₂µ ≤ φ⁻(µ, t) − [t]− ≤ D₁µ, where D₁ =

Z +∞

0

|t|d(t)dt and D₂ = max

Z +∞

−∞

td(t)dt, 0

.

(c) _∂t^∂φ⁻(µ, t) is bounded satisfying −1 ≤ _∂t^∂φ⁻(µ, t) ≤ 0.

Similar to Proposition 3.1, we also obtain lim_µ→0⁺φ⁻(µ, t) = [t]−. Therefore, in view of Proposition 3.1 and 3.2, we know that φ⁺(µ, t) defined by (9) and φ⁻(µ, t) defined by (10), are the smoothing functions of [t]₊ and [t]−, respectively. Accordingly, using the continuity of compound function and φ⁺(µ, t) ≥ 0, φ⁻(µ, t) ≥ 0, we can generate approximate function (not necessarily smooth) for [t]^σ₊ and [t]^σ₋, see below lemma.

Lemma 3.1. Under the assumptions of Proposition 3.1, let φ⁺(µ, t), φ⁻(µ, t) be the smoothing functions of [t]₊, [t]−, defined by (9) and (10) respectively. Then, for any σ > 0, we have

(a) lim

µ→0⁺φ⁺(µ, t)^σ = [t]^σ₊, (b) lim

µ→0⁺

φ⁻(µ, t)^σ = [t]^σ₋.

By modifying the smoothing functions used in [4,9,31], we have four specific smoothing functions for [t]₋ as well:

φ⁻₁(µ, t) = −t + µ ln 1 + e^µ^t

, (11)

φ⁻₂(µ, t) =







0 if t ≥ ^µ₂,

1

2µ −t + ^µ₂2

if − ^µ₂ < t < ^µ₂,

−t if t ≤ −^µ₂,

(12)

φ⁻₃(µ, t) = p4µ²+ t²− t

2 , (13)

φ⁻₄(µ, t) =







0 if t > 0,

t²

2µ if − µ ≤ t ≤ 0,

−t − ^µ₂ if t < −µ,

(14)

where the corresponding kernel functions are d₁(t) = e^t

(1 + e^t)²,

d₂(t) = 1 if − ¹₂ ≤ t ≤ ¹₂, 0 otherwise, d₃(t) = 2

(t²+ 4)³²,

d₄(t) = 1 if − 1 ≤ t ≤ 0, 0 otherwise.

(8)

Figure 1: Graphs of [t]− and φ⁻_i (µ, t), i = 1, 2, 3, 4 with µ = 0.1.

For those specific functions (11)-(14), they certainly obey Proposition3.2and Lemma 3.1. The graphs of [t]− and φ⁻_i (µ, t), i = 1, 2, 3, 4 with µ = 0.1 are depicted in Figure 1.

From Figure 1, we see that, for a fixed µ > 0, the function φ⁻₂(µ, t) seems the one which best approximate the function [t]₋ among all φ⁻_i (µ, t), i = 1, 2, 3, 4. Indeed, for a fixed µ > 0 and all t ∈ IR, we have

φ⁻₃(µ, t) ≥ φ⁻₁(µ, t) ≥ φ⁻₂(µ, t) ≥ [t]− ≥ φ⁻₄(µ, t). (15) Furthermore, we shall show that φ⁻₂(µ, t) is the function closest to [t]− in the sense of the infinite norm. For any fixed µ > 0, it is clear that

|t|→∞lim

φ⁻_i (µ, t) − [t]−

= 0, i = 1, 2, 3.

The functions φ⁻_i (µ, t) − [t]−, i = 1, 3 have no stable point but unique non-differentiable point t = 0, and φ⁻₂(µ, t) − [t]− is non-zero only on the interval (−µ/2, µ/2) with maxt∈(−µ/2,µ/2)

φ⁻₂(µ, t) − [t]₋

= φ⁻₂(µ, 0). These imply that max

t∈IR

φ⁻_i (µ, t) − [t]₋ =

φ⁻_i (µ, 0)

, i = 1, 2, 3.

Since φ⁻₁(µ, 0) = (ln 2)µ ≈ 0.7µ, φ⁻₂(µ, 0) = µ/8, φ⁻₃(µ, 0) = µ, we obtain kφ⁻₁(µ, t) − [t]−k∞ = (ln 2)µ,

kφ⁻₂(µ, t) − [t]−k∞ = µ/8, kφ⁻₃(µ, t) − [t]−k∞ = µ.

On the other hand, it is obvious that max_t∈IR

φ⁻₄(µ, t) − [t]−

= µ/2, which says kφ⁻₄(µ, t) − [t]−k∞ = µ/2.

(9)

Figure 2: Graphs of φ⁻_i (µ, t), i = 1, 2, 3, 4 with different µ.

In summary, we have

kφ⁻₃(µ, t) − [t]−k∞> kφ⁻₁(µ, t) − [t]−k∞ > kφ⁻₄(µ, t) − [t]−k∞ > kφ⁻₂(µ, t) − [t]−k∞. (16) The orderings of (15) and (16) indicate the behavior of φ⁻_i (µ, t), i = 1, 2, 3, 4 for fixed µ > 0. When taking µ → 0⁺, we know lim_µ→0⁺φ⁻_i (µ, t) = [t]−, i = 1, 2, 3, 4 and φ⁻₂(µ, t) is the closest to [t]−, which can be verified by geometric views depicted as in Figure 2.

Remark 3.1. For any µ > 0, σ > 0 and continuously differentiable φ⁻(µ, t) defined by (10), it can be easily seen that, φ⁻(µ, t)^σ is continuous function about t, but may not be differentiable. For example, φ⁻₁(µ, t)^σ, φ⁻₃(µ, t)^σ are continuously differentiable, but φ⁻₂(µ, t)^σ, φ⁻₄(µ, t)^σ are not continuously differentiable for σ = 1/2 since the non- differentiable points are t = µ/2 and t = 0 respectively. Their geometric views are depicted in Figure 3.

With the aforementioned discussions, for any x = (x₁, . . . , x_r) ∈ IRⁿ¹ × · · · × IRⁿ^r, we are ready to show how to construct a smoothing function for vectors [x]₊ and [x]₋ associated with K = Kⁿ¹× · · · × Kⁿ^r. We start by constructing a smoothing function for vectors [x_i]₊, [x_i]− on a single SOC Kⁿⁱ, i = 1, . . . , r since [x]₊ and [x]− are shown as (7).

First, given smoothing functions φ⁺, φ⁻ in (9),(10) and x_i ∈ IRⁿⁱ, i = 1, . . . , r, we define vector-valued function Φ⁺_i , Φ⁻_i : IR₊₊× IRⁿⁱ → IRⁿⁱ, i = 1, . . . , r as

Φ⁺_i (µ, x_i) := φ⁺(µ, λ₁(x_i)) u⁽¹⁾_x

i + φ⁺(µ, λ₂(x_i)) u⁽²⁾_x

i, (17)

(10)

Figure 3: Graphs of φ⁻_i (µ, t)^σ, i = 1, 2, 3, 4 with different µ and σ = 1/2.

Φ⁻_i (µ, x_i) := φ⁻(µ, λ₁(x_i)) u⁽¹⁾_x

i + φ⁻(µ, λ₂(x_i)) u⁽²⁾_x

i, (18)

where µ ∈ IR₊₊is a parameter, λ₁(x_i), λ₂(x_i) are the spectral values, and u⁽¹⁾xi , u⁽²⁾xi are the spectral vectors of x_i. Consequently, Φ⁺_i (µ, x_i), Φ⁻_i (µ, x_i) are also smooth on IR₊₊× IRⁿⁱ [8]. Moreover, it is easy to assert that

lim

µ→0⁺Φ⁺_i (µ, x_i) = [λ₁(x_i)]₊u⁽¹⁾_x_i + [λ₂(x_i)]₊u⁽²⁾_x_i = [x_i]₊, (19) lim

µ→0⁺

Φ⁻_i (µ, x_i) = [λ₁(x_i)]−u⁽¹⁾_x

i + [λ₂(x_i)]−u⁽²⁾_x

i = [x_i]−, (20)

which means each function Φ⁺_i (µ, x_i), Φ⁻_i (µ, x_i) serves as a smoothing function of [x_i]₊, [x_i]−

associated with single SOC Kⁿⁱ, i = 1, . . . , r, respectively. Due to Lemma 3.1, Remark 3.1 and from definition of Φ⁺_i (µ, x_i), Φ⁻_i (µ, x_i) in (17), (18), it is not difficult to verify that for any σ > 0, the below two functions

Φ⁺_i (µ, x_i)^σ := φ⁺(µ, λ₁(x_i))^σu⁽¹⁾_x

i + φ⁺(µ, λ₂(x_i))^σu⁽²⁾_x

i , (21)

Φ⁻_i (µ, x_i)^σ := φ⁻(µ, λ₁(x_i))^σu⁽¹⁾_x

i + φ⁻(µ, λ₂(x_i))^σu⁽²⁾_x

i (22)

are continuous functions approximate to [x_i]^σ₊ and [x_i]^σ₋, respectively. In other words, lim

µ→0⁺

Φ⁺_i (µ, x_i)^σ = [λ₁(x_i)]^σ₊u⁽¹⁾_x

i + [λ₂(x_i)]^σ₊u⁽²⁾_x

i = [x_i]^σ₊, lim

µ→0⁺Φ⁻_i (µ, x_i)^σ = [λ₁(x_i)]^σ₋u⁽¹⁾_x_i + [λ₂(x_i)]^σ₋u⁽²⁾_x_i = [x_i]^σ₋.

(11)

Now we construct smoothing function for vectors [x]₊and [x]−associated with general cone (2). To this end, we define vector-valued function Φ⁺, Φ⁻ : IR₊₊× IRⁿ→ IRⁿ as

Φ⁺(µ, x) := Φ⁺₁(µ, x₁), . . . , Φ⁺_r(µ, x_r) , (23) Φ⁻(µ, x) := Φ⁻₁(µ, x₁), . . . , Φ⁻_r(µ, x_r) , (24) where Φ⁺_i (µ, xi), Φ⁻_i (µ, xi), i = 1, . . . , r are defined by (17), (18), respectively. Therefore, from (19), (20) and (7), Φ⁺(µ, x), Φ⁻(µ, x) serves as a smoothing function for [x]₊, [x]−

associated with K = Kⁿ¹ × · · · × Kⁿ^r, respectively. At the same time, from (21), (22), Φ⁺(µ, x)^σ := Φ⁺₁(µ, x₁)^σ, . . . , Φ⁺_r(µ, x_r)^σ , (25) Φ⁻(µ, x)^σ := Φ⁻₁(µ, x₁)^σ, . . . , Φ⁻_r(µ, x_r)^σ

(26) are continuous functions approximate to [x]^σ₊ and [x]^σ₋, respectively.

In light of this idea, we establish an approximating lower order penalty equations for solving SOCLCP (1), which will be described in next section. To end this section, we present a technical lemma for subsequent needs.

Lemma 3.2. Suppose that Φ⁺(µ, x) and Φ⁻(µ, x) are defined by (23), (24), respectively, and Φ⁺(µ, x)^σ and Φ⁻(µ, x)^σ are defined for any σ > 0 as in (25), (26), respectively.

Then, the following results hold.

(a) Both Φ⁺(µ, x) and Φ⁻(µ, x) belong to K, (b) Both Φ⁺(µ, x)^σ and Φ⁻(µ, x)^σ belong to K .

Proof. (a) For any x_i ∈ IRⁿⁱ, i = 1, . . . , r, since φ⁺(µ, λ_k(x_i)) ≥ 0, φ⁻(µ, λ_k(x_i)) ≥ 0 for k = 1, 2 from (9), (10), we have Φ⁺_i (µ, x_i), Φ⁻_i (µ, x_i) ∈ Kⁿⁱ according to the definition (17), (18). Therefore, the conclusion holds due to the definitions (23), (24) and (2).

(b) From part (a) and knowing σ > 0, we have φ⁺(µ, λk(xi))^σ ≥ 0, φ⁻(µ, λk(xi))^σ ≥ 0, k = 1, 2. Applying (25) and (26), the desired result follows. 2

4 Approximate lower order penalty approach and convergence analysis

In this section, we propose an approximate lower order penalty approach for solving SOCLCP (1). To this end, we consider the approximate lower order penalty equations (LOPEs):

Ax − αΦ⁻(µ, x)^σ = b, (27)

(12)

where σ ∈ (0, 1] is a given power parameter, α ≥ 1 is a penalty parameter and Φ⁻(µ, x)^σ is defined as (26). Throughout this section, x_µ,α means the solution of (27), and corresponding to the structure of (2), we denote

x_µ,α = ((x_µ,α)₁, . . . , (x_µ,α)_r) ∈ IRⁿ¹ × · · · × IRⁿ^r. (28) For simplicity and without causing confusion, we always denote the spectral values and spectral vectors of (x_µ,α)_i, i = 1, . . . , r as λ_k := λ_k((x_µ,α)_i), u^(k) := u^(k)_(x

µ,α)i for k = 1, 2.

Accordingly, [λ_k]− := [λ_k((x_µ,α)_i)]− and φ⁻(µ, λ_k) := φ⁻(µ, λ_k((x_µ,α)_i)), k = 1, 2 for instance. Note that for special case σ = 1, the nonlinear function in (27) is always smooth.

Note that the equations (27) are penalized equations corresponding to the SOCLCP (1) because the penalty term αΦ⁻(µ, x)^σ penalizes the ‘negative part’ of x when µ → 0⁺. By Lemma 3.2 and from equations (27), it is easy to see that Ax_µ,α − b ∈ K (noting αΦ⁻(µ, x_µ,α)^σ ∈ K). Our goal is to show that the solution sequence {x_µ,α} converges to the solution of SOCLCP (1) when α → +∞ and µ → 0⁺. In order to achieve this, we need to make the assumption for matrix A as below.

Assumption 4.1. The matrix A is positive definite, but not necessarily symmetric, i.e., there exists a constant a0 > 0, such that

y^TAy ≥ a₀kyk², ∀y ∈ IRⁿ. (29) This assumption just implies that ¯A = (A + A^T)/2 is symmetric positive definite with a₀ = λ_min( ¯A) since y^TAy = y^TAy. Here is an example of A. Let¯

A = 2 −1 3 1

,

it is easy to see that matrix A is positive definite satisfying (29), but not symmetric.

Under Assumption 4.1, the SOCLCP (1) has a unique solution and the LOPEs (27) also has a unique solution, see for more details in [17, 21].

Proposition 4.1. For any α ≥ 1, σ ∈ (0, 1] and sufficiently small µ, the solution of the LOPEs (27) is bounded, i.e., there exists a positive constant M , independent of x_µ,α, µ, α and σ, such that kx_µ,αk ≤ M .

Proof. By multiplying x_µ,α on both sides of (27), we observe that x^T_µ,αAxµ,α = x^T_µ,αb + αx^T_µ,αΦ⁻(µ, xµ,α)^σ =

r

X

i=1

(xµ,α)^T_i bi+ α(xµ,α)^T_i Φ⁻_i (µ, (xµ,α)i)^σ (30) by (26),(28) and denoting b = (b₁, . . . , b_r) ∈ IRⁿ¹× · · · × IRⁿ^r. For any (x_µ,α)_i, i = 1, . . . , r, to proceed, we consider three cases to evaluate the term

Ξ_i := (x_µ,α)^T_i b_i + α(x_µ,α)^T_i Φ⁻_i (µ, (x_µ,α)_i)^σ ≤ kx_µ,αk (kbk + 1) . (31)

(13)

Case 1: (x_µ,α)_i ∈ Kⁿⁱ. From Cauchy-Schwarz inequality, spectral decomposition of (x_µ,α)_i, and the fact that the norm of the piece component is less than that of the whole vector, we have

Ξi ≤ k(xµ,α)ik kbik + αkΦ⁻_i (µ, (xµ,α)i)^σk

≤ kx_µ,αk kbk + αkφ⁻(µ, λ₁)^σu⁽¹⁾+ φ⁻(µ, λ₂)^σu⁽²⁾k

≤ kx_µ,αk kbk +√

2αφ⁻(µ, 0)^σ ,

(32)

where the second inequality holds by the definition of Φ⁻_i (µ, (x_µ,α)_i)^σ as in (22), and the last inequality holds by the triangle inequality, the nonnegativity of φ⁻(µ, 0)^σ from (10) and the monotone decreasing of φ⁻(µ, t) about t since 0 ≤ λ₁ ≤ λ₂ in this case.

Now, applying Lemma3.1, we have lim_µ→0⁺φ⁻(µ, 0)^σ = 0. This means, for any penalty parameter α, there exists a positive real number ν, such that √

2αφ⁻(µ, 0)^σ ≤ 1 for all µ ∈ (0, ν]. Therefore, from (32), we obtain the conclusion (31).

Case 2: (x_µ,α)_i ∈ −Kⁿⁱ. In light of Lemma 3.2, we know Φ⁻_i (µ, (x_µ,α)_i)^σ ∈ Kⁿⁱ, and hence

(x_µ,α)^T_i Φ⁻_i (µ, (x_µ,α)_i)^σ ≤ 0.

Thus, we have Ξ_i ≤ (x_µ,α)^T_i b_i ≤ k(x_µ,α)_ikkb_ik ≤ kx_µ,αk (kbk + 1), which says conclusion (31) holds.

Case 3: (xµ,α)i ∈ K/ ⁿⁱ ∪ −Kⁿⁱ. In this case, we know that λ1 < 0 < λ2 and [(xµ,α)i]+ = λ₂u⁽²⁾. From the definition of Φ⁻_i (µ, (x_µ,α)_i)^σ as in (22), Proposition 2.1, we have

(x_µ,α)^T_i Φ⁻_i (µ, (x_µ,α)_i)^σ = (λ₁u⁽¹⁾+ λ₂u⁽²⁾)^T φ⁻(µ, λ₁)^σu⁽¹⁾+ φ⁻(µ, λ₂)^σu⁽²⁾ .

= ¹₂ (λ₁φ⁻(µ, λ₁)^σ + λ₂φ⁻(µ, λ₂)^σ)

≤

√ 2 2 (

√ 2

2 λ2)φ⁻(µ, λ2)^σ

≤

√ 2

2 kx_µ,αkφ⁻(µ, λ₂)^σ,

(33)

where the first inequality holds due to λ1φ⁻(µ, λ1)^σ < 0 < λ2φ⁻(µ, λ2)^σ, and the second inequality holds due to

√2

2 λ₂ = k[(x_µ,α)_i]₊k ≤ k(x_µ,α)_ik ≤ kx_µ,αk. Substituting (33) from Ξ_i and using Cauchy-Schwarz inequality, we obtain

Ξ_i ≤ k(x_µ,α)_ikkb_ik +

√2

2 αkx_µ,αkφ⁻(µ, λ₂)^σ

≤ kx_µ,αkkbk +

√2

2 αkx_µ,αkφ⁻(µ, λ₂)^σ

≤ kxµ,αk kbk +

√ 2

2 αφ⁻(µ, 0)^σ

,

(34)

where the third inequality holds by the monotone decreasing of φ⁻(µ, t) about t. Similar to case 1, for any penalty parameter α, there exists a positive real number ν, such that_√

2

2 αφ⁻(µ, 0)^σ ≤ 1 for all µ ∈ (0, ν]. Hence, we reach the conclusion (31) by (34).

From above three cases, the conclusion (31) holds, which shows an evaluation of Ξ_i. Thus, from (30) and Assumption 4.1, there exists a constants a₀ > 0 such that

a₀kx_µ,αk² ≤ x^T_µ,αAx_µ,α =

r

X

i=1

Ξ_i ≤ rkx_µ,αk (kbk + 1) .

(14)

This implies kx_µ,αk · (a₀kx_µ,αk − r (kbk + 1)) ≤ 0, and hence kx_µ,αk ≤ _a^r

0 (kbk + 1) . By taking M = _a^r

0 (kbk + 1), the proof is completed. 2

It is well-known that the affine function g(x) := Ax − b is continuous function and by Proposition 4.1, kg(x_µ,α)k is bounded for any α ≥ 1, σ ∈ (0, 1] and sufficiently small µ. We are able to establish an upper bound for kΦ⁻(µ, x_µ,α)k in next proposition. The upper bound is also applicable for k[x_µ,α]−k (see Remark 4.1), which plays an impor- tant role in the convergence analysis. The detailed proof is based on the definition of Φ⁻_i (µ, (x_µ,α)_i) stated as in (18) and uses the same techniques as in [21, Proposition 3.2]

by left multiplying Φ⁻_i (µ, (x_µ,α)_i) on both sides of the i-th block of (27):

(Ax_µ,α)_i− αΦ⁻_i (µ, (x_µ,α)_i)^σ = b_i.

Therefore, we omit it and only present the result, i.e., there exists a positive constant Ci, independent of x_µ,α, µ and α, such that

kΦ⁻_i (µ, (x_µ,α)_i)k ≤ C_i

α^1/σ (35)

holds for any α ≥ 1, σ ∈ (0, 1] and sufficiently small µ. By the definition of Φ⁻(µ, xµ,α) as shown in (24) and setting C = C₁+ · · · + C_r, we obtain the following proposition.

Proposition 4.2. For any α ≥ 1, σ ∈ (0, 1] and sufficiently small µ, there exists a positive constant C, independent of x_µ,α, µ and α, such that

kΦ⁻(µ, xµ,α)k ≤ C

α^1/σ. (36)

Remark 4.1. For any α ≥ 1, σ ∈ (0, 1] and sufficiently small µ, the i-th (i = 1, . . . , r) component vector (xµ,α)i is fixed since xµ,α with (28) means the solution of (27). For the fixed (x_µ,α)_i with spectral decomposition (x_µ,α)_i = λ₁u⁽¹⁾ + λ₂u⁽²⁾ and the expression Φ⁻_i (µ, (x_µ,α)_i) = φ⁻(µ, λ₁)u⁽¹⁾+ φ⁻(µ, λ₂)u⁽²⁾, by taking µ → 0⁺ in φ⁻(µ, λ₁) and φ⁻(µ, λ₂), we obtain k[λ₁]−u⁽¹⁾+ [λ₂]−u⁽²⁾k ≤ _α^C1/σⁱ from (35), which yields

k[(xµ,α)i]−k ≤ C_i

α^1/σ. (37)

Also, by setting C = C₁+ · · · + C_r, we obtain k[(x_µ,α)]−k ≤ C

α^1/σ. (38)

By using Propositions 4.1, 4.2 and Remark 4.1, we are able to obtain the following desired convergence result of SOCLCP (1) is approximated by the LOPEs (27).

(15)

Theorem 4.1. For any α ≥ 1, σ ∈ (0, 1] and sufficiently small µ, let x_µ,α be the solution of LOPEs (27), and x^∗ be the solution of SOCLCP (1). Then, there exists a positive constant C, independent of x^∗, x_µ,α, µ and α, such that

kx^∗− x_µ,αk ≤ C

α^1/σ. (39)

Proof. Follows from (28) and the definition (7), we get x_µ,α = [x_µ,α]₊− [x_µ,α]−, where [x_µ,α]₊ = ([(x_µ,α)₁]₊, . . . , [(x_µ,α)_r]₊), [x_µ,α]− = ([(x_µ,α)₁]−, . . . , [(x_µ,α)_r]−)

respectively denotes the projection of x_µ,α on K and −x_µ,α on K^∗. Therefore, the vector x^∗− x_µ,α can be decomposed as

x^∗− x_µ,α = x^∗− [x_µ,α]₊+ [x_µ,α]−= r_µ,α+ [x_µ,α]−, (40) where

r_µ,α = x^∗− [x_µ,α]₊. (41)

Let’s consider the estimation of r_µ,α. If r_µ,α = 0, the inequality (39) is satisfied due to (38) and (40). Therefore, in the following, we only consider r_µ,α 6= 0. Noting that, the SOCLCP (1) is equivalent to the variational inequality problem: find x^∗ ∈ K (see [17, Proposition 1.1.3]), such that

(y − x^∗)^TAx^∗ ≥ (y − x^∗)^Tb, ∀y ∈ K. (42) It is known that [x_µ,α]₊ ∈ K, by (41) and substituting [x_µ,α]₊ for y in (42) yields

− r_µ,α^T Ax^∗ ≥ −r_µ,α^T b. (43)

Then, multiplying both sides of (27) by r_µ,α yields

r_µ,α^T Ax_µ,α− αr^T_µ,αΦ⁻(µ, x_µ,α)^σ = r_µ,α^T b. (44) Adding up (43) and (44) leads to

r_µ,α^T A(x_µ,α− x^∗) − αr_µ,α^T Φ⁻(µ, x_µ,α)^σ ≥ 0. (45) Applying (41) again, we have

r^T_µ,αΦ⁻(µ, x_µ,α)^σ = (x^∗− [x_µ,α]₊)^TΦ⁻(µ, x_µ,α)^σ. (46) Combining (45) and (46), we achieve r^T_µ,αA(x_µ,α − x^∗) ≥ α(x^∗ − [x_µ,α]₊)^TΦ⁻(µ, x_µ,α)^σ, which says

r^T_µ,αA(x^∗− xµ,α) ≤ α([xµ,α]+− x^∗)^TΦ⁻(µ, xµ,α)^σ. (47) Now, using (40) and (47) further gives

(x^∗− x_µ,α− [x_µ,α]−)^TA(x^∗− x_µ,α) ≤ α([x_µ,α]₊− x^∗)^TΦ⁻(µ, x_µ,α)^σ,

(16)

which implies

(x^∗− x_µ,α)^TA(x^∗− x_µ,α) ≤ [x_µ,α]^T₋A(x^∗− x_µ,α) + α([x_µ,α]₊− x^∗)^TΦ⁻(µ, x_µ,α)^σ. (48) Follows from (26),(28) and the definition (7), by denoting

Ξ_i := [(x_µ,α)_i]^T₋(A(x^∗− x_µ,α))_i+ α ([(x_µ,α)_i]₊− x^∗_i)^T Φ⁻_i (µ, (x_µ,α)_i)^σ, (49) the inequality (48) is reduced to

(x^∗− x_µ,α)^TA(x^∗− x_µ,α) ≤

r

X

i=1

Ξ_i. (50)

To proceed, we discuss three cases of (x_µ,α)_i to proof the term (49) satisfying Ξi ≤ 1

α^1/σ(C⁰kx^∗− xµ,αk + c) , (51) where C⁰ is a positive constant, independent of x_µ,α, µ, α and c ∈ IR₊₊ is undetermined.

Case 1: (x_µ,α)_i ∈ Kⁿⁱ. Under this case, we see that [(x_µ,α)_i]− = 0 and [(x_µ,α)_i]₊ = (x_µ,α)_i. Using (49) and Cauchy-Schwarz inequality, we have

Ξ_i = α ((x_µ,α)_i− x^∗_i)^T Φ⁻_i (µ, (x_µ,α)_i)^σ

≤ αk(x_µ,α)_i− x^∗_ik · kΦ⁻_i (µ, (x_µ,α)_i)^σk

≤ αkx_µ,α− x^∗k · kΦ⁻_i (µ, (x_µ,α)_i)^σk

= αkx^∗− x_µ,αkkφ⁻(µ, λ₁)^σu⁽¹⁾+ φ⁻(µ, λ₂)^σu⁽²⁾k

≤ kx^∗− x_µ,αk√

2αφ⁻(µ, 0)^σ,

(52)

where the second inequality holds by the fact that the norm of the piece component is less than that of the whole vector, the second equality holds by the definition as (18) and the last inequality holds by Proposition2.1, the triangle inequality, the nonnegativity of φ⁻(µ, 0)^σ from (10) and the monotone decreasing of φ⁻(µ, t) about t since 0 ≤ λ₁ ≤ λ₂ in this case. By Lemma 3.1, we know lim_µ→0⁺φ⁻(µ, 0)^σ = 0. Therefore, for any α ≥ 1 and σ ∈ (0, 1], there exists a positive real number ν, such that √

2αφ⁻(µ, 0)^σ ≤ _α1/σ¹ for all µ ∈ (0, ν]. Thus, we achieve the conclusion (51) by setting C⁰ = 1.

Case 2: (x_µ,α)_i ∈ −Kⁿⁱ. Under this case, it is clear that [(x_µ,α)_i]₊ = 0, and we have (x^∗_i)^TΦ⁻_i (µ, (x_µ,α)_i)^σ ≥ 0 since Φ⁻_i (µ, (x_µ,α)_i)^σ ∈ Kⁿⁱ and x^∗_i ∈ Kⁿⁱ. Thus, it follows from (49) and Cauchy-Schwarz inequality that

Ξ_i ≤ [(x_µ,α)_i]^T₋(A(x^∗ − x_µ,α))_i

≤ k[(x_µ,α)_i]−k · k (A(x^∗− x_µ,α))_ik

≤ k[(x_µ,α)_i]−k · kA(x^∗− x_µ,α)k

≤ _α^C1/σⁱ kAkkx^∗− xµ,αk,

(53)