Numerical comparisons based on four smoothing functions for absolute value equation

(1)

to appear in Journal of Applied Mathematics and Computing, 2016

Numerical comparisons based on four smoothing functions for absolute value equation

B. Saheya¹

College of Mathematical Science Inner Mongolia Normal University Hohhot 010022, Inner Mongolia, P. R. China

E-mail: saheya@imnu.edu.cn

Cheng-He Yu

Department of Mathematics National Taiwan Normal University

Taipei 11677, Taiwan E-mail: 60240031S@ntnu.edu.tw

Jein-Shan Chen ² Department of Mathematics National Taiwan Normal University

Taipei 11677, Taiwan E-mail: jschen@math.ntnu.edu.tw

March 25, 2016

(revised on August 18, 2016)

Abstract The system of absolute value equation, denoted by AVE, is a non-differentiable NP-hard problem. Many approaches have been proposed during the past decade and most of them focus on reformulating it as complementarity problem and then solve it accordingly. Another approach is to recast the AVE as a system of nonsmooth equations and then tackle with the nonsmooth equations. In this paper, we follow this path. In particular, we rewrite it as a system of smooth equations and propose four new smoothing functions along with a smoothing-type algorithm to solve the system of equations. The

1The author’s work is supported by Natural Science Foundation of Inner Mongolia (Award Number:

2014MS0119).

2Corresponding author. The author’s work is supported by Ministry of Science and Technology, Taiwan.

(2)

main contribution of this paper focuses on numerical comparisons which suggest a better choice of smoothing function along with the smoothing-type algorithm.

Key words. Smoothing function, smoothing algorithm, singular value, convergence.

1 Introduction

The absolute value equation (AVE) is in the form of

Ax + B|x| = b, (1)

where A ∈ R^n×n, B ∈ R^n×n, B 6= 0, and b ∈ Rⁿ. Here |x| means the componentwise absolute value of vector x ∈ Rⁿ. When B = −I, where I is the identity matrix, the AVE (1) reduces to the special form:

Ax − |x| = b. (2)

It is known that the AVE (1) was first introduced by Rohn in [18] and recently has been investigated by many researchers, for example, Hu and Huang [5], Jiang and Zhang [7], Ketabchi and Moosaei [8], Mangasarian [9–13], Mangasarian and Meyer [14], Prokopyev [15], and Rohn [20].

In particular, Mangasarian and Meyer [14] show that the AVE (1) is equivalent to the bilinear program, the generalized LCP (linear complementarity problem), and the stan- dard LCP provided 1 is not an eigenvalue of A. With these equivalent reformulations, they also show that the AVE (1) is NP-hard in its general form and provide existence results. Prokopyev [15] further improves the above equivalence which indicates that the AVE (1) can be equivalently recast as LCP without any assumption on A and B, and also provides a relationship with mixed integer programming. In general, if solvable, the AVE (1) can have either unique solution or multiple (e.g., exponentially many) solutions.

Indeed, various sufficiency conditions on solvability and non-solvability of the AVE (1) with unique and multiple solutions are discussed in [14, 15, 19]. Some variants of the AVE, like the absolute value equation associated with second-order cone and the absolute value programs, are investigated in [3] and [21], respectively.

As for its numerical solvers, many numerical methods for solving the AVEs (1)–(2) have been proposed. A parametric successive linearization algorithm for the AVE (1) that terminates at a point satisfying necessary optimality conditions is studied in [10].

The generalized Newton algorithm for the AVE (2) is investigated in [11], in which it was proved that this algorithm converges linearly from any starting point to the unique solution of the AVE (2) under the condition that kA⁻¹k < ¹₄. The generalized Newton algorithm with semismooth and smoothing Newton steps combined into the algorithm

(3)

is considered in [23]. The smoothing-type algorithms for solving the AVEs (1)-(2) are studied in [1, 5, 7]. A branch and bound method for the absolute value programs (AVP), which is an extension of the AVE, is studied in [21].

Among the aforementioned approaches, many of them focus on reformulating it as complementarity problem and then solve it accordingly. An alternative approach is to recast the AVE as a system of nonsmooth equations and then tackle with the nonsmooth equations by applying nonsmooth Newton algorithm [16] or smoothing Newton algorithm [17]. In this paper, we follow the latter pathway. More specifically, we rewrite it as a system of smooth equations and propose four new smoothing functions along with a smoothing-type algorithm to solve the system of equations. To see this, motivated by the approach in [1, 7], we define H_i : Rⁿ⁺¹ → Rⁿ⁺¹ as

Hi(µ, x) =

µ

Ax + BΦ_i(µ, x) − b

for µ ∈ R and x ∈ Rⁿ (3) where Φi : Rⁿ⁺¹ → Rⁿ is given by

Φ_i(µ, x) :=







φ_i(µ, x₁) φ_i(µ, x₂)

... φ_i(µ, x_n)







for µ ∈ R and x ∈ Rⁿ (4)

with four various smoothing functions φ_i : R² → R that will be introduced later. The role of φi looks similar to the function φp used in [7]. However, they are substantially different. More specifically, the function φ_p employed in [7] is strongly semismooth on R², whereas each φ_i proposed in this paper is continuously differentiable on R². Now, we present the exact form for each function φ_i, which is defined as below:

φ₁(µ, t) = µh

ln(1 + e⁻^µ^t) + ln(1 + e^µ^t)i

(5)

φ₂(µ, t) =











t if t ≥ ^µ₂, t²

µ + µ

4 if −^µ₂ < t < ^µ₂,

−t if t ≤ −^µ₂.

(6)

φ₃(µ, t) = p

4µ²+ t² (7)

φ₄(µ, t) =





 t²

2µ if |t| ≤ µ,

|t| − ^µ₂ if |t| > µ.

(8)

Some of the smoothing functions have appeared in other contexts for other optimization problems, but they are all novel ones for dealing with the AVE (1). The main idea in this paper is showing that the AVE (1) has a solution if and only if H_i(µ, x) = 0, φ_i

(4)

is continuously differentiable at any (µ, t) ∈ R++× R, and limµ↓0φ_i(µ, x) = |x|. Then, with these four new smoothing functions, we consider the smoothing-type algorithm studied in [6, 22] to solve the AVE (1). In other words, we reformulate the AVE (1) as parameterized smooth equations and then employ a smoothing-type algorithm to solve it. In addition, we show that the algorithm is well-defined under the assumption that the minimal singular value of the matrix A is strictly greater than the maximal singular value of the matrix B. We also show that the proposed algorithm is globally and locally quadratically convergent no matter which smoothing function φ_i is used. Numerical implementations and comparisons based on these four different φ_i are reported as well.

From the numerical results, we conclude that φ₂ is the best choice of smoothing function when we apply the proposed smoothing-type algorithm. More detailed reports will be seen in Section 4.

2 Smoothing Reformulation

In this section, we depict the graphs of φ_ifor i = 1, 2, 3, 4 and investigate their properties.

Then, we show the equivalent reformulation that H_i(µ, x) = 0 if and only if x solves the AVE (1), and talk about the condition to guarantee the unique solvability of the AVE (1). We begin with showing the pictures of φ_i for i = 1, 2, 3, 4, see Figure 1.

|t|

ϕ1(μ,t) ϕ2(μ,t) ϕ3(μ,t) ϕ4(μ,t)

-1.0 -0.5 0.0 0.5 1.0

0.0 0.2 0.4 0.6 0.8 1.0

t ϕi(μ,t)

Figure 1: Graphs of |t| and all four φ_i(µ, t) with µ = 0.1.

From Figure 1, we see that φ2 is the one which best approximates the function |t|

under the sense that it is closest to |t| among all φ_i for i = 1, 2, 3, 4. To see this, we adopt the max norm to measure the distance of two real-valued functions. In other words, for given two real-valued functions f and g, the distance between them is defined as

kf − gk_∞ = max

t∈R{f (t) − g(t)}.

Now, for any fixed µ > 0, we know that lim

|t|→∞

φ_i(µ, t) − |t|

= 0, for i = 1, 2, 3.

(5)

This implies that

maxt∈R

φi(µ, t) − |t|

= |φi(µ, 0)| , for i = 1, 2, 3.

Since, φ₁(µ, 0) = (2 ln 2)µ ≈ 1.4µ, φ₂(µ, 0) = ^µ₄, and φ₃(µ, 0) = 2µ, we obtain φ₁(µ, t) − |t|

_∞ = (2 ln 2)µ ≈ 1.4µ φ₂(µ, t) − |t|

_∞ = µ 4 φ₃(µ, t) − |t|

_∞ = 2µ On the other hand, we see that

t→∞lim

φ₄(µ, t) − |t|

= µ

2 and φ₄(µ, 0) = 0, which says

maxt∈R

φ₄(µ, t) − |t|

= µ 2. Hence, we obtain

φ₄(µ, t) − |t|

_∞ = µ 2. From all the above, we conclude that

φ₃(µ, t) − |t|

∞ >

φ₁(µ, t) − |t|

∞>

φ₄(µ, t) − |t|

∞>

φ₂(µ, t) − |t|

∞. (9) This shows that φ2 is the function among φi, i = 1, 2, 3, 4 which best approximates the function |t|. In fact, for fixed µ > 0, there has the local behavior that

φ₃(µ, t) > φ₁(µ, t) > φ₂(µ, t) > |t| > φ₄(µ, t). (10) A natural question arises here, does the smoothing algorithm based on φ₂ perform best among all φ₁, φ₂, φ₃, φ₄? This will be answered in Section 4.

Proposition 2.1. Let φi : R² → R for i = 1, 2, 3, 4 be defined as in (5), (6), (7) and (8), respectively. Then, we have

(a) φi is continuously differentiable at (µ, t) ∈ R⁺⁺× R;

(b) lim

µ↓0φ_i(µ, t) = |t| for any t ∈ R.

Proof. (a) In order to prove the continuous differentiability of φi, we need to write out the expressions of ^∂φⁱ_∂t^(µ,t) and ^∂φⁱ_∂µ^(µ,t); and then show the continuity of ^∂φⁱ_∂t^(µ,t) and ^∂φⁱ_∂µ^(µ,t). (i) For i = 1, we compute that

∂φ₁(µ, t)

∂t = 1

1 + e⁻^µ^t

− 1

1 + e^µ^t ,

∂φ₁(µ, t)

∂µ = h

ln(1 + e⁻^µ^t) + ln(1 + e^µ^t)i + t

µ

−1 1 + e⁻^µ^t

+ 1

1 + e^µ^t

.

(6)

Then, it is clear to see that ^∂φ¹_∂t^(µ,t) and ^∂φ¹_∂µ^(µ,t) are continuous. Hence, φ₁ is continuously differentiable.

(ii) For i = 2, we compute that

∂φ₂(µ, t)

∂t =







1 if t ≥ ^µ₂,

2t

µ if −^µ₂ < t < ^µ₂,

−1 if t ≤ −^µ₂.

∂φ2(µ, t)

∂µ =







0 if t ≥ ^µ₂,

−

t µ

2

+¹₄ if −^µ₂ < t < ^µ₂, 0 if t ≤ −^µ₂.

Then, it can be verified that ^∂φ²_∂t^(µ,t) and ^∂φ²_∂µ^(µ,t) are continuous because

lim

t→^µ₂

∂φ₂(µ, t)

∂t = lim

t→^µ₂

2t µ = 1, lim

t→−^µ₂

∂φ₂(µ, t)

∂t = lim

t→−^µ₂

2t

µ = −1.

and

lim

t→^µ₂

∂φ₂(µ, t)

∂µ = lim

t→^µ₂

"

− t µ

2

+ 1 4

#

= 0,

lim

t→−^µ₂

∂φ2(µ, t)

∂µ = lim

t→−^µ₂

"

− t µ

2

+ 1 4

#

= 0.

Hence, φ₂ is continuously differentiable.

(iii) For i = 3, we compute that

∂φ₃(µ, t)

∂t = t

p4µ²+ t²,

∂φ₃(µ, t)

∂µ = 4µ

p4µ²+ t².

Again it is clear to see that ^∂φ³_∂t^(µ,t) and ^∂φ³_∂µ^(µ,t) are continuous. Hence, φ₃ is continuously differentiable.

(iv) For i = 4, we compute that

∂φ₄(µ, t)

∂t =







1 if t > µ,

t

µ if −µ ≤ t ≤ µ,

−1 if t < −µ.

(7)

∂φ₄(µ, t)

∂µ =







−¹₂ if t > µ,

−¹₂

t µ

2

if −µ ≤ t ≤ µ,

−¹₂ if t < −µ.

Then, we conclude that ^∂φ⁴_∂t^(µ,t) and ^∂φ⁴_∂µ^(µ,t) are continuous by checking

t→µlim

∂φ₄(µ, t)

∂t = lim

t→µ

t µ = 1,

t→−µlim

∂φ4(µ, t)

∂t = lim

t→−µ

t

µ = −1.

and

limt→µ

∂φ₄(µ, t)

∂µ = lim

t→µ

"

−1 2 × t

µ

2#

= −1 2,

t→−µlim

∂φ₄(µ, t)

∂µ = lim

t→−µ

"

−1 2 × t

µ

2#

= −1 2. Hence, φ₄ is continuously differentiable.

From all the above, we prove that φ_i is continuously differentiable at (µ, t) ∈ R++× R.

(b) For i = 1, 2, 3, 4, we always have the following:

µ→0lim

∂φ_i(µ, t)

∂t =

1 if t > 0,

−1 if t < 0, which verifies part (b). 2

For subsequent needs in convergence analysis and numerical implementations, we summarize the gradient of each φ_i as below.

∇φ₁(µ, t) =





 h

ln(1 + e⁻^µ^t) + ln(1 + e^µ^t)i +_µ^t

−1 1+e^{− t}^µ

+ ¹

1+e^µ^t

1 1+e^{− t}^µ

− ¹

1+e

t µ





.

∇φ₂(µ, t) = ξ₁ ξ₂

, where ξ₁ =







0 if t ≥ ^µ₂,

−

t µ

2

+¹₄ if −^µ₂ < t < ^µ₂, 0 if t ≤ −^µ₂.

ξ₂ =







1 if t ≥ ^µ₂,

2t

µ if −^µ₂ < t < ^µ₂,

−1 if t ≤ −^µ₂.

∇φ₃(µ, t) =





√ 4µ 4µ²+t²

√ t 4µ²+t²



.

∇φ₄(µ, t) = v₁ v₂

, where v₁ =







−¹₂ if t > µ,

−¹₂

t µ

2

if −µ ≤ t ≤ µ,

−¹₂ if t < −µ.

v₂ =







1 if t > µ,

t

µ if −µ ≤ t ≤ µ,

−1 if t < −µ.

(8)

In fact, Proposition 2.1 can be also depicted by geometric views. In particular, from Figures 2-5, we see that when µ ↓ 0, φ_i is getting closer to |t|, which verifies Proposition 2.1(b).

μ=0.5 μ=0.3 μ=0.1 μ=0.01

-2 -1 0 1 2

0.0 0.5 1.0 1.5 2.0

t ϕ1(μ,t)

Figure 2: Graphs of φ₁(µ, t) with µ = 0.01, 0.1, 0.3, 0.5.

(9)

μ=0.5 μ=0.3 μ=0.1 μ=0.01

-2 -1 0 1 2

0.0 0.5 1.0 1.5 2.0

t ϕ2(μ,t)

Figure 3: Graphs of φ2(µ, t) with µ = 0.01, 0.1, 0.3, 0.5.

μ=0.5 μ=0.3 μ=0.1 μ=0.01

-2 -1 0 1 2

0.0 0.5 1.0 1.5 2.0

t ϕ3(μ,t)

Figure 4: Graphs of φ₃(µ, t) with µ = 0.01, 0.1, 0.3, 0.5.

μ=0.5 μ=0.3 μ=0.1 μ=0.01

-2 -1 0 1 2

0.0 0.5 1.0 1.5 2.0

t ϕ4(μ,t)

Figure 5: Graphs of φ₄(µ, t) with µ = 0.01, 0.1, 0.3, 0.5.

Now, in light of Proposition 2.1, we obtain the equivalent reformulation Hi(µ, x) = 0 for the AVE (1).

Proposition 2.2. Let Φ_i(µ, x) for i = 1, 2, 3, 4 be defined as in (4). Then, we have

(10)

(a) H_i(µ, x) = 0 if and only if x solves the AVE (1);

(b) Hi is continuously differentiable on Rⁿ⁺¹\ {0} with the Jacobian matrix given by

∇H_i(µ, x) :=

1 0

B ∇₁Φ_i(µ, x) A + B ∇₂Φ_i(µ, x)

(11) where

∇1Φi(µ, x) :=







∂φi(µ,x1)

∂µ

∂φi(µ,x2)

∂µ...

∂φi(µ,xn)

∂µ





 ,

∇₂Φ_i(µ, x) :=







∂φi(µ,x1)

∂x1 0 · · · 0

0 ^∂φⁱ_∂x^(µ,x²⁾

2 · · · 0

... ... . .. ... 0 · · · 0 ^∂φⁱ_∂x^(µ,xⁿ⁾

n





 .

Proof. This result follows from Proposition 2.1 immediately and the computation of the Jacobian matrix is straightforward. 2

For completeness, we also talk about the unique solvability of the AVE (1), which is presumed in our numerical implementations. The following assumption and proposition are both employed from [7]. The Assumption 2.3 will be also used to guarantee that

∇H_i(µ, x) is invertible at any (µ, x) ∈ R++× Rⁿ, see Proposition 3.2 in Section 3.

Assumption 2.3. The minimal singular value of the matrix A is strictly greater than the maximal singular value of the matrix B.

Proposition 2.4. ([7, Prop. 2.3]) The AVE (1) is uniquely solvable for any b ∈ Rⁿ if Assumption 2.3 is satisfied.

3 A smoothing-type algorithm

From Proposition 2.2, we know that the AVE (1) is equivalent to H_i(µ, x) = 0. Ac- cordingly, in this section, we consider the smoothing-type algorithm as in [1, 7] to solve H_i(µ, x) = 0. In fact, this type of algorithm has been also proposed for solving other kinds of problems, see [2, 6, 22] and references therein.

Algorithm 3.1. (A Smoothing-Type Algorithm)

(11)

Step 0. Choose δ, σ ∈ (0, 1), µ₀ > 0, x⁰ ∈ Rⁿ. Set z⁰ := (µ, x⁰). Denote e⁰ := (1, 0) ∈ R × Rⁿ. Choose β > 1 such that (min {1, kH_i(z⁰)k})² ≤ βµ₀. Set k := 0.

Step 1. If kH_i(z^k)k = 0, stop.

Step 2. Set τ_k := min1, kH_i(z^k)k , and compute 4z^k := (4µ_k, 4x^k) ∈ R × Rⁿ by using

∇H_i(z^k)4z^k = −H_i(z^k) + (1/β)τ_k²e⁰, (12) where ∇H_i(·) is defined as in (11).

Step 3. Let α_k be the maximum of the values 1, δ, δ², · · · such that

kH_i(z^k+ α_k4z^k)k ≤ [1 − σ(1 − 1/β)α_k] kH_i(z^k)k (13) Step 4. Set z^k+1 := z^k+ α_k4z^k and k := k + 1. Back to Step 1.

Following the same arguments as in [4, 6], the line search (13) in the above scheme is well-defined. In other words, the Algorithm 3.1 is well-defined and possesses some nice properties.

Proposition 3.2. (a) Suppose that Assumption 2.3 holds. Then, the Algorithm 3.1 is well-defined.

(b) Let the sequence z^k be generated by Algorithm 3.1. Then, (i) both kH_i(z^k)k and {τ_k} are monotonically decreasing;

(ii) τ_k² ≤ βµ_k holds for all k;

(iii) the sequence {µk} is monotonically decreasing, and µk > 0 for all k.

Proof. Please refer to [6, Remark 2.1] or [7, Prop. 3.1]. 2

The key point in the above scheme is the solvability of Newton equations (12) in Step 2. The following result is regarding this issue. Since the Φ_i function plays almost the same role as the function Φp used in [7], the below Proposition 3.3 can be obtained by mimicking the same arguments as in [7, Theorem 3.2]. We omit its proof and only state it.

Proposition 3.3. Let H_i and ∇H_i be given as in (3) and (11), respectively. Suppose that Assumption 2.3 holds. Then, ∇H_i(µ, x) is invertible at any (µ, x) ∈ R++× Rⁿ.

Next, we discuss the global and local convergence. Again, although the function Φi

here is continuously differentiable and the function Φ_p used in [7] is only semismooth, their roles in the proof are almost the same. Consequently, the arguments for convergence analysis are almost the same. Hence, we also omit the detailed proof and only present the convergence result.

(12)

Proposition 3.4. Suppose that Assumption 2.3 holds and that the sequence z^k is generated by Algorithm 3.1. Then,

(a) z^k is bounded;

(b) any accumulation point of z^k is a solution of the AVE (1).

(c) The whole sequence z^k convergence to z^∗ with kz^k+1 − z^kk = o kz^k− z^∗k and µ_k+1 = µ²_k.

4 Numerical implementations

In this section, we report the numerical results of Algorithm 3.1 for solving the AVE (1) and (2). All numerical experiments are carried out in Mathematica 10.0 running on a PC with Intel i5 of 3.00GHz CPU processor, 4.00GB Memory and 32-bit Windows 7 operating system.

In our numerical experiments, the stoping criteria for Algorithm 3.1 is kHi(z^k)k ≤ 1.0e − 6. We also stop programs when the total iteration is more than 100. Throughout the computational experiments, the following parameters are used:

δ = 0.5, σ = 0.0001, µ₀ = 0.1, β = max1, 1.01 ∗ τ₀²/µ .

4.1 Experiments on the AVE Ax − |x| = b

In this subsection we consider the simplified form of AVE (2). Consider the ordinary differential equation [24, Example 4.2]:

d²x

dt² − |x| = (1 − t²), x(0) = −1, x(1) = 0, t ∈ [0, 1]. (14) As explained in [24, Example 4.2], after descretization (by using finite difference method), the above ODE can be recast an AVE in form of

Ax − |x| = b, (15)

where the matrix A is given by

a_i,j =







−242, i = j, 121, |i − j| = 1, 0, otherwise.

(16)

We implement the above problems by using φ_i, i = 1, 2, 3, 4 and n = 2, 5, 10, 20, . . . , 100, respectively. Every starting point x is randomly generated 10 times from a uniform distribution on x ∈ [−2, 2]. The results are put together in Table 1, where

(13)

Table 1: The numerical results of ordinary differential equation (14)

Dim N φ1 T φ1 Ar φ1 N φ2 T φ2 Ar φ2 N φ3 T φ3 Ar φ3 N φ4 T φ4 Ar φ4

2 5.1 0.0967 3.30E-07 3.9 0.0015 6.92E-08 5.1 0.0016 5.93E-08 4 0.0062 5.99E-08 5 5.9 0.3697 2.23E-07 4.1 0.0031 7.47E-08 5.6 0.0062 2.21E-08 4.2 0.0016 6.54E-08 10 6.4 0.4851 2.98E-07 4.3 0.0094 2.10E-07 5.9 0.0031 1.05E-07 4.5 0.0031 4.67E-08 20 5.2 0.4290 2.41E-07 4.9 0.0078 1.10E-08 6.3 0.0078 2.13E-09 5 0.0094 2.46E-09 40 8.8 4.4117 4.66E-07 6.1 0.5210 5.28E-08 7.3 0.0172 6.59E-08 6.3 0.0156 1.88E-07 60 9.1 2.4289 2.31E-07 6.8 0.0281 4.49E-08 9 0.0312 1.20E-08 7.7 0.0312 1.31E-07 80 9.8 2.0514 3.61E-07 7.4 0.0374 3.21E-10 9.3 0.0452 3.21E-08 9.2 0.0593 3.15E-08 100 9.8 8.2306 4.44E-07 7.8 0.0577 8.78E-08 10 0.0671 2.26E-07 9.5 0.0827 2.83E-08

Dim denotes the size of problem,

N φ_i denotes the average number of iterations,

T φ_i denotes the average value of the CPU time in seconds,

Ar φi denotes the average value ofkH(z^k)k when Algorithm 3.1 stop.

From Table 1, in terms of the average number of iterations, the efficiency of φ₂(µ, t) is best, followed by φ₄(µ, t), φ₃(µ, t) and φ₁(µ, t). This is especially true for the problem of high dimension ordinary differential equation (14). In terms of time efficiency, φ₁(µ, t) is still better than other functions too. In other words, for the AVE (2) arising from the ODE (15), we have

φ₂(µ, t) > φ₄(µ, t) > φ₃(µ, t) > φ₁(µ, t) where “>” means “better performance”.

To compare the performance of smoothing function φ_i(µ, t), i = 1, 2, 3, 4, we adopt the performance profile which is introduced in [25] as a means. In other words, we regard Algorithm 3.1 corresponding to a smoothing function φi(µ, t), i = 1, 2, 3, 4 as a solver, and assume that there are n_s solvers and n_p test problems from the test set P which is generated randomly. We are interested in using the iteration number as performance measure for Algorithm 3.1 with different φ_i(µ, t). For each problem p and solver s, let

f_p,s = iteration number required to solve problem p by solver s.

We employ the performance ratio

r_p,s:= f_p,s

min{fp,s: s ∈ S},

where S is the four solvers set. We assume that a parameter r_p,s ≤ r_M for all p, s are chosen, and r_p,s = r_M if and only if solver s does not solve problem p. In order to obtain an overall assessment for each solver, we define

ρ_s(τ ) := 1

n_psize{p ∈ P : r_p,s≤ τ },

which is called the performance profile of the number of iteration for solver s. Then, ρ_s(τ ) is the probability for solver s ∈ S that a performance ratio f_p,s is within a factor τ ∈ R of the best possible ratio.

(14)

We then need to test the four functions for ODE (14) at random starting points.

In particular, starting points for each dimension are randomly chosen 20 times from a uniform distribution on x ∈ [−2, 2]. In order to obtain an overall assessment for the four functions, we are interested in using the number of iterations as a performance measure for Algorithm 3.1 with φ₁(µ, t), φ₂(µ, t), φ₃(µ, t), and φ₄(µ, t), respectively. The performance plot based on iteration number is presented in Figure 6. From this figure, we see that φ₂(µ, t) working with Algorithm 3.1 has the best numerical performance, followed by φ₄(µ, t). In other words, in view of “iteration numbers”, there has

1.0 1.5 2.0 2.5 3.0

0.0 0.2 0.4 0.6 0.8 1.0

τ

ρs(τ) ϕ1(μ,t)

ϕ₂(μ,t) ϕ₃(μ,t) ϕ₄(μ,t)

Figure 6: Performance profile of iteration numbers of Algorithm 3.1 for the ODE (14).

We are also interested in using the computing time as performance measure for Algo- rithm 3.1 with different φ_i(µ, t), i = 1, 2, 3, 4. The performance plot based on computing time is presented in Figure 7. From this figure, we can also see the function φ₂(µ, t) has best performance, then followed by φ₃(µ, t). Note that the time efficiency of φ₁(µ, t) is very bad. In other words, in view of “computing time”, there has

φ₂(µ, t) > φ₃(µ, t) > φ₄(µ, t) > φ₁(µ, t) where “>” means “better performance”.

In summary, for the special AVE (2) arising from the ODE (14), no matter the number of iterations or the computing time is taken into account, the function φ₂(µ, t) is the best choice for the Algorithm 3.1.

(15)

2 4 6 8 10 12 14 0.0

0.2 0.4 0.6 0.8 1.0

τ

ρs(τ) ^ϕ¹^(μ,t)

ϕ2(μ,t) ϕ₃(μ,t) ϕ₄(μ,t)

Figure 7: Performance profile of computing time of Algorithm 3.1 for the ODE (14).

4.2 Experiments on the general AVE Ax + B|x| = b

In this subsection we consider the general AVE (1): Ax + B|x| = b. Here matrix A (or B) is equal to a normal distribution random matrix minus another one so that we can randomly generate the testing problems.

In order to ensure that Assumption 2.3 holds, we further modify the matrix A in light of the below conditions.

• If min{w_ii : i = 1, . . . , n} = 0 with {u, w, v} = SingularValueDecomposition[A], then we set A = u(w + 0.01 ∗ IdentityMatrix[n])v.

• Set A = ^λ^max_λ ^(B^T^B)+0.01

min(A^TA) A.

Then, it is clear to verify that Assumption 2.3 is satisfied for such A. Moreover, we set p =2RandomVariate[NormalDistribution[ ],{n, 1}] and b = Ap + B|p| so that the testing problems are solvable.

We implement the above problems for φ_i, i = 1, 2, 3, 4 and n = 2, 5, 10, 20, . . . , 100, respectively. Every case is randomly generated 10 times for testing. The numerical results are listed in Table 2. From Table 2, in terms of the number of iterations and computation time, the efficiency of φ₂(µ, t) is best, followed by φ₄(µ, t). The iteration number of φ₁(µ, t) is less than φ₃(µ, t), but the computing time of φ₁(µ, t) is more than φ₃(µ, t).

Figure 8 shows the performance profile of iteration number in Algorithm 3.1 in the range of τ ∈ [1, 15] for four solvers on 100 test problem which are generated randomly.

The four solvers correspond to Algorithm 3.1 with φ₁(µ, t), φ₂(µ, t), φ₃(µ, t), and φ₄(µ, t), respectively. From this figure, we see that φ₂(µ, t) working with Algorithm 3.1 has the best numerical performance, followed by φ4(µ, t). In summary, from the viewpoint of

(16)

Table 2: The numerical results of experiments

Dim N φ1 T φ1 Ar φ1 N φ2 T φ2 Ar φ2 N φ3 T φ3 Ar φ3 N φ4 T φ4 Ar φ4

2 6.2 0.4596 5.00E-7 3.6 0.0031 8.56E-8 7.1 0.0016 1.79E-7 3.9 0 8.04E-8 5 7.4 0.2246 6.05E-7 4.1 0.0031 8.39E-8 9.6 0.0094 4.73E-7 4.3 0.0016 7.53E-8 10 10.2 1.0733 2.23E-7 4.3 0.0062 8.26E-8 17.2 0.0187 4.79E-7 4.7 0.0031 7.53E-8 20 19.8 3.7830 5.00E-7 4.8 0.0062 9.95E-8 26.3 0.0499 1.86E-7 5.9 0.0094 1.06E-7 30 28.7 5.0575 4.46E-7 5.6 0.0140 1.00E-7 43.2 0.1295 5.22E-8 9.3 0.0265 1.82E-7 40 38.6 3.0935 6.52E-7 7.1 0.0234 5.60E-8 54.1 0.2137 1.65E-7 11.9 0.0374 9.14E-8 50 42.7 1.9016 5.37E-7 5.3 0.0218 7.73E-8 61.5 0.3120 1.93E-8 10.4 0.0437 5.88E-8 60 52.1 2.5272 5.61E-7 6.6 0.0359 5.90E-8 78.7 0.4976 1.05E-8 13.9 0.0718 1.15E-7 70 60.2 3.7050 6.10E-7 9.9 0.0624 1.12E-7 94.4 0.7332 1.80E-7 18.7 0.1264 1.26E-7 80 58.0 4.1246 4.31E-7 8.9 0.0640 6.03E-8 98.5 0.8845 3.88E-8 17.5 0.1420 5.35E-8 90 78.2 11.170 6.28E-7 10.0 0.0905 2.23E-7 114.3 1.2745 1.46E-7 20.9 0.2028 1.46E-7 100 72.2 12.211 4.77E-7 7.5 0.0709 1.62E-7 110.8 1.6477 1.31E-7 16.9 0.1881 1.34E-7

0 2 4 6 8 10 12 14

0.0 0.2 0.4 0.6 0.8 1.0

τ

ρs(τ) ϕ1(μ,t)

ϕ₂(μ,t) ϕ₃(μ,t) ϕ₄(μ,t)

Figure 8: Performance profile of iteration numbers of Algorithm 3.1 for general AVE.

“iteration numbers”, we conclude that

φ₂(µ, t) > φ₄(µ, t) > φ₁(µ, t) > φ₃(µ, t) where “>” means “better performance”.

Finally, we are also interested in using the computing time as performance measure for Algorithm 3.1 with different φ_i(µ, t), i = 1, 2, 3, 4. The performance plot based on computing time is presented in Figure 9. From this figure, we can also see the function φ₂(µ, t) has best performance, then followed by φ₄(µ, t). Note that the time efficiency of φ₁(µ, t) is very bad. Again, from the viewpoint of “computing time”, we conclude that

(17)

0 5 10 15 20 25 30 0.0

0.2 0.4 0.6 0.8 1.0

τ

ρs(τ) ^ϕ¹^(μ,t)

ϕ2(μ,t) ϕ₃(μ,t) ϕ₄(μ,t)

Figure 9: Performance profile of computing time of Algorithm 3.1 for general AVE.

5 Conclusion

In this paper, we recast the AVE (1) as a system of smooth equations. Accordingly, we have proposed four smoothing functions along with a smoothing-type algorithm studied in [1, 7] to solve it. As mentioned in Section 2, there holds the local behavior shown as in (10):

φ₃(µ, t) > φ₁(µ, t) > φ₂(µ, t) > |t| > φ₄(µ, t).

and φ₂(µ, t) is the one which best approximates the function |t| shown as in (9), i.e., φ₃(µ, t) − |t|

_∞>

φ₁(µ, t) − |t|

_∞>

φ₄(µ, t) − |t|

_∞>

φ₂(µ, t) − |t|

_∞. Surprisingly, φ₂(µ, t) is also the best choice of smoothing function no matter when the iteration number or the computing time is taken into account. For the “iteration” aspect, the order of numerical performance from good to bad is

φ₂(µ, t) > φ₄(µ, t) > φ₁(µ, t) > φ₃(µ, t), for th AVE (1).

φ₂(µ, t) > φ₄(µ, t) > φ₃(µ, t) > φ₁(µ, t), for th AVE (2).

whereas for the “time” aspect, the order of numerical performance from good to bad is

φ₂(µ, t) > φ₄(µ, t) > φ₃(µ, t) > φ₁(µ, t), for th AVE (1).

φ2(µ, t) > φ3(µ, t) > φ4(µ, t) > φ1(µ, t), for th AVE (2).

In other words, φ₂(µ, t) is the best choice of smoothing function to work with the proposed smoothing-type algorithm, meanwhile it also best approximate the function |t|. This is a very interesting discovery which may be helpful in other contexts. One of future directions is to check whether such phenomenon occurs in other types of algorithms.

(18)

References

[1] L. Caccetta, B. Qu, and G.-L. Zhou, A globally and quadratically convergent method for absolute value equations, Computational Optimization and Applications, vol. 48, 45–58, 2011.

[2] J.-S. Chen, C.-H. Ko, Y.-D. Liu, and S.-P. Wang, New smoothing functions for solving a system of equalities and inequalities, Pacific Journal of Optimization, vol. 12, pp. 185–206, 2016.

[3] S.-L. Hu, Z.-H. Huang, and Q. Zhang, A generalized Newton method for absolute value equations associated with second order cones, Journal of Computational and Applied Mathematics, vol. 235, pp. 1490–1501, 2011.

[4] Z.-H. Huang, Locating a maximally complementary solution of the monotone NCP by using non-interior-point smoothing algorithms, Mathematical Methods of Opera- tion Research, vol 61, pp. 41–45, 2005.

[5] S.-L. Hu and Z-H. Huang, A note on absolute value equations, Optimization Letters, vol. 4, pp. 417–424, 2010.

[6] Z-H. Huang, Y. Zhang, and W. Wu, A smoothing-type algorithm for solving system of inequalities, Journal of Computational and Applied Mathematics, vol. 220, pp. 355–363, 2008.

[7] X. Jiang and Y. Zhang, A smoothing-type algorithm for absolute value equations, Journal of Industrial and Management Optimization, vol. 9, pp. 789–798, 2013.

[8] S. Ketabchi and H. Moosaei, Minimum norm solution to the absolute value equation in the convex case, Journal of Optimization Theory and Applications, vol.

154, pp. 1080–1087, 2012.

[9] O. L. Mangasarian, Absolute value programming, Computational Optimization and Applications, vol. 36, pp. 43–53, 2007.

[10] O. L. Mangasarian, Absolute value equation solution via concave minimization, Optimization Letters, vol. 1, pp. 3–5, 2007.

[11] O. L. Mangasarian, A generalized Newton method for absolute value equations, Optimization Letters, vol. 3, pp. 101–108, 2009.

[12] O. L. Mangasarian, Primal-dual bilinear programming solution of the absolute value equation, Optimization Letters, vol. 6, pp. 1527–1533, 2012.

[13] O. L. Mangasarian, Absolute value equation solution via dual complementarity, Optimization Letters, vol. 7, pp. 625–630, 2013.

(19)

[14] O. L. Mangasarian and R. R. Meyer, Absolute value equation, Linear Algebra and Its Applications, vol. 419, pp. 359–367, 2006.

[15] O. A. Prokopyev, On equivalent reformulations for absolute value equations, Computational Optimization and Applications, vol. 44, pp. 363–372, 2009.

[16] L. Qi, Convergence analysis of some algorithms for solving nonsmooth equations, Mathematics of Operations Research, vol. 18, pp. 227–244, 1993.

[17] L. Qi, D. Sun, and G.-L. Zhou, A new look at smoothing Newton methods for nonlinear complementarity problems and box constrained variational inequality problems, Mathematical Programming, vol. 87, pp. 1–35, 2000.

[18] J. Rohn, A theorem of the alternatives for the equation Ax + B|x| = b, Linear and Multilinear Algebra, vol. 52, 421–426, 2004.

[19] J. Rohn, Solvability of systems of interval linear equations and inequalities, in Lin- ear Optimization Problems with Inexact Data edited by. M. Fiedler, J. Nedoma, J.

Ramik, J. Rohn and K. Zimmermann, Springer, pp. 35–77, 2006.

[20] J. Rohn, An algorithm for solving the absolute value equation, Eletronic Journal of Linear Algebra, vol. 18, 589–599, 2009.

[21] S. Yamanaka and M. Fukushima, A brancd and bound method for the absolute value programs, Optimization, vol. 63, pp. 305–319, 2014.

[22] Y. Zhang and Z-H. Huang, A nonmonotone smoothing-type algorithm for solving a system of equalities and inequalities, Journal of Computational and Applied Mathematics, vol. 233, pp. 2312–2321, 2010.

[23] C. Zhang and Q.-J. Wei, Global and finite convergence of a generalized Newton method for absolute value equations, Journal of Optimization Theory and Applica- tions, vol. 143, pp. 391–403, 2009.

[24] F.K. Haghani, On Generalized Traubs Method for Absolute Value Equations, Jour- nal of Optimization Theory and Applications, vol. 166, pp. 619–625, 2015.

[25] E. D. Dolan and J. J. More, Benchmarking optimization software with performance profiles, Mathematical Programming, vol. 91, pp. 201–213, 2002.