Contents lists available atScienceDirect
Journal of Mathematical Analysis and Applications
www.elsevier.com/locate/jmaa
A one-parametric class of merit functions for the symmetric cone complementarity problem
Shaohua Pan
a,1, Jein-Shan Chen
b,∗,2aSchool of Mathematical Sciences, South China University of Technology, Guangzhou 510640, China bDepartment of Mathematics, National Taiwan Normal University, Taipei, Taiwan 11677
a r t i c l e i n f o a b s t r a c t
Article history:
Received 23 June 2008 Available online 5 February 2009 Submitted by H. Frankowska
Keywords:
Symmetric cone complementarity problem Merit function
Jordan algebra Smoothness Lipschitz continuity Cartesian P -properties
In this paper, we extend the one-parametric class of merit functions proposed by Kan- zow and Kleinmichel [C. Kanzow, H. Kleinmichel, A new class of semismooth Newton- type methods for nonlinear complementarity problems, Comput. Optim. Appl. 11 (1998) 227–251] for the nonnegative orthant complementarity problem to the general symmet- ric cone complementarity problem (SCCP). We show that the class of merit functions is continuously differentiable everywhere and has a globally Lipschitz continuous gradient mapping. From this, we particularly obtain the smoothness of the Fischer–Burmeister merit function associated with symmetric cones and the Lipschitz continuity of its gradient. In addition, we also consider a regularized formulation for the class of merit functions which is actually an extension of one of the NCP function classes studied by [C. Kanzow, Y. Ya- mashita, M. Fukushima, New NCP functions and their properties, J. Optim. Theory Appl.
97 (1997) 115–135] to the SCCP. By exploiting the Cartesian P -properties for a nonlinear transformation, we show that the class of regularized merit functions provides a global error bound for the solution of the SCCP, and moreover, has bounded level sets under a rather weak condition which can be satisfied by the monotone SCCP with a strictly feasi- ble point or the SCCP with the joint Cartesian R02-property. All of these results generalize some recent important works in [J.-S. Chen, P. Tseng, An unconstrained smooth minimiza- tion reformulation of the second-order cone complementarity problem, Math. Program. 104 (2005) 293–327; C.-K. Sim, J. Sun, D. Ralph, A note on the Lipschitz continuity of the gradi- ent of the squared norm of the matrix-valued Fischer–Burmeister function, Math. Program.
107 (2006) 547–553; P. Tseng, Merit function for semidefinite complementarity problems, Math. Program. 83 (1998) 159–185] under a unified framework.
©2009 Elsevier Inc. All rights reserved.
1. Introduction
Given a Euclidean Jordan algebra
A = (V, ◦, ·,·)
whereV
is a finite-dimensional vector space over the real fieldR
endowed with the inner product·,·
and “◦
” denotes the Jordan product. LetK
be a symmetric cone inV
and G,
F: V → V
be nonlinear transformations assumed to be continuously differentiable throughout this paper. Consider the symmetric cone complementarity problem (SCCP) of findingζ ∈ V
such thatG
(ζ ) ∈ K,
F(ζ ) ∈ K,
G
(ζ ),
F(ζ )
=
0.
(1)*
Corresponding author.E-mail addresses:shhpan@scut.edu.cn(S. Pan),jschen@math.ntnu.edu.tw(J.-S. Chen).
1 Partially supported by the Doctoral Starting-up Foundation (B13B6050640) of GuangDong Province.
2 Member of Mathematics Division, National Center for Theoretical Sciences, Taipei Office. Partially supported by National Science Council of Taiwan.
0022-247X/$ – see front matter ©2009 Elsevier Inc. All rights reserved.
doi:10.1016/j.jmaa.2009.01.064
The model provides a simple, natural and unified framework for various existing complementarity problems such as the nonnegative orthant nonlinear complementarity problem (NCP), the second-order cone complementarity problem (SOCCP), and the semidefinite complementarity problem (SDCP). In addition, the model itself is closely related to the KKT optimality conditions for the convex symmetric cone program (CSCP):
minimize g
(
x),
subject to
ai,
x=
bi,
i=
1,
2, . . . ,
m,
x∈ K,
(2)where ai
∈ V
, bi∈ R
for i=
1,
2, . . . ,
m, and g: V → R
is a convex twice continuously differentiable function. Therefore, the SCCP has wide applications in engineering, economics, management science and other fields; see [1,11,20,29] and references therein.During the past several years, interior-point methods have been well used for solving the symmetric cone linear pro- gramming problem (SCLP), i.e., the CSCP with g being a linear function (see [7,8,23,24]). However, in view of the wide applications of the SCCP, it is worthwhile to explore other solution methods for the more general CSCP and SCCP. Recently, motivated by the successful applications of the merit function approach in the solution of NCPs, SOCCPs and SDCPs (see, e.g., [4,10,22,28]), some researchers started with the investigation of merit functions or complementarity functions associ- ated with symmetric cones. For example, Liu, Zhang and Wang [21] extended a class of merit functions proposed in [18] to the following special SCCP:
ζ ∈ K,
F(ζ ) ∈ K, ζ,
F(ζ )
=
0;
(3)Kong, Tuncel and Xiu [17] studied the extension of the implicit Lagrangian function proposed by Mangasarian and Solodov [22] to symmetric cones; and Kong, Sun and Xiu [16] proposed a regularized smoothing method for the SCCP (3) based on the natural residual complementarity function associated with symmetric cones. Following this line, in this paper we will consider the extension of the one-parametric class of merit functions proposed by Kanzow and Kleinmichel [14]
and a class of regularized functions based on it.
We define the one-parametric class of vector-valued functions
φ
τ: V × V → V
byφ
τ(
x,
y) :=
x2
+
y2+ ( τ −
2)(
x◦
y)
1/2− (
x+
y),
(4)where
τ ∈ (
0,
4)
is an arbitrary but fixed parameter, x2=
x◦
x, x1/2 is a vector such that(
x1/2)
2=
x, and x+
y means the usual componentwise addition of vectors. Whenτ =
2,φ
τ reduces to the vector-valued Fischer–Burmeister function given byφ
FB(
x,
y) :=
x2
+
y21/2− (
x+
y) ;
(5)whereas as
τ →
0 it will become a multiple of the vector-valued residual functionψ
NR(
x,
y) :=
x− (
x−
y)
+,
where
( ·)
+ denotes the metric projection onK
. In this sense, the one-parametric class of vector-valued functions covers the two popular complementarity functions associated with the symmetric coneK
. In fact, from Lemma 3.1 later, it follows that the functionφ
τ with anyτ ∈ (
0,
4)
is a complementarity function associated withK
, that is,φ
τ(
x,
y) =
0⇐⇒
x∈ K,
y∈ K,
x,
y=
0.
Consequently, its squared norm yields a merit function associated with
K ψ
τ(
x,
y) :=
12
φ
τ(
x,
y)
2,
(6)where is the norm induced by
·,·
, and the SCCP can be reformulated as minζ∈V fτ
(ζ ) := ψ
τG
(ζ ),
F(ζ )
.
(7)To apply the effective unconstrained optimization methods, such as the quasi-Newton method, the trust-region method and the conjugate gradient method, for solving the unconstrained minimization reformulation (7) of the SCCP, the smooth- ness of the merit function
ψ
τ and the Lipschitz continuity of its gradient will play an important role. In Sections 3 and 4, we show that the functionψ
τ defined by (6) is continuously differentiable everywhere and has a globally Lipschitz continuous gradient with the Lipschitz constant being a positive multiple of 1+ τ
−1. These results generalize some recent important works in [4,25,28] under a unified framework, as well as improve the work [21] greatly in which only the differentiability of the merit functionψ
FB was given.In addition, we also consider a class of regularized functions for fτ defined as
ˆ
fτ(ζ ) := ψ
0 G(ζ ) ◦
F(ζ ) + ψ
τ G(ζ ),
F(ζ )
,
(8)where
ψ
0: V → R
+ is continuously differentiable and satisfiesψ
0(
u) =
0∀
u∈ −K
andψ
0(
u) β (
u)
+∀
u∈ V
(9)for some constant
β >
0. Using the properties ofψ
0 in (9), it is not hard to verify thatˆ
fτ is a merit function for the SCCP.The class of functions will reduce to the one studied in [21] if
τ =
2 and G degenerates into an identity transformation. In Section 5, we show that the class of merit functions can provide a global error bound for the solution of the SCCP under the condition that G and F have the joint uniform Cartesian P -property. In Section 6, we establish the boundedness of the level sets ofˆ
fτ under a weaker condition than the one used by [21], which can be satisfied by the monotone SCCP with a strictly feasible point or the SCCP with G and F having the joint Cartesian R02-property.Throughout this paper,
I
denotes an identity operator, represents the norm induced by the inner product·,·
, and int( K)
denotes the interior of the symmetric coneK
. All vectors are column ones and write the column vector(
x1T, . . . ,
xmT)
T as(
x1, . . . ,
xm)
, where xi is a column vector from the subspaceV
i. For any x∈ V
, we denote(
x)
+ and(
x)
− by the metric projection of x ontoK
and−K
, respectively, i.e.,(
x)
+:=
arg miny∈K{
x−
y}
. For any symmetric matrix A, the notation AO means that A is positive semidefinite. For a differentiable mapping F: V → V
, the notation∇
F(
x)
denotes the transposed Jacobian operator of F at a point x. We write x=
o( α )
(respectively, x=
O( α )
) ifx/| α | →
0 (respectively, uniformly bounded) asα →
0.2. Preliminaries
In this section, we recall some concepts and materials of Euclidean Jordan algebras that will be used in the subsequent sections. More detailed expositions of Euclidean Jordan algebras can be found in the monograph by Faraut and Korányi [9].
Besides, one can find excellent summaries in the articles [2,12,24,26].
A Euclidean Jordan algebra is a triple
(V, ◦, ·,·
V)
, whereV
is a finite-dimensional inner product space over the real fieldR
and(
x,
y) →
x◦
y: V × V → V
is a bilinear mapping satisfying the following conditions:(i) x
◦
y=
y◦
x for all x,
y∈ V
,(ii) x
◦ (
x2◦
y) =
x2◦ (
x◦
y)
for all x,
y∈ V
, where x2:=
x◦
x, and (iii) x◦
y,
zV=
x,
y◦
zV for all x,
y,
z∈ V
.We call x
◦
y the Jordan product of x and y. We also assume that there is an element e∈ V
, called the unit element, such that x◦
e=
x for all x∈ V
. For x∈ V
, letζ (
x)
be the degree of the minimal polynomial of x, which can be equivalently defined asζ (
x) :=
min k:e
,
x,
x2, . . . ,
xkare linearly dependent
.
Since
ζ (
x)
dim( V)
where dim( V)
denotes the dimension ofV
, the rank ofV
is well defined by r:=
max{ζ(
x)
: x∈ V}
. In a Euclidean Jordan algebraA = (V, ◦, ·,·
V)
, we define the set of squares asK := {
x2: x∈ V}
. Then, by Theorem III.2.1 of [9],K
is a symmetric cone. This means thatK
is a self-dual closed convex cone with nonempty interior int(K)
and for any two elements x,
y∈
int(K)
, there exists an invertible linear transformationT : V → V
such thatT (K) = K
andT (
x) =
y.A Euclidean Jordan algebra is said to be simple if it is not the direct sum of two Euclidean Jordan algebras. By Propo- sition III.4.4 of [9], each Euclidean Jordan algebra is, in a unique way, a direct sum of simple Euclidean Jordan algebras.
A common simple Euclidean Jordan algebra is
( S
n, ◦, ·,·
Sn)
, whereS
n is the space of n×
n real symmetric matrices with the inner productX,
YSn:=
Tr(
X Y)
, and the Jordan product is defined by X◦
Y:= (
X Y+
Y X)/
2. Here, X Y is the usual matrix multiplication of X and Y and Tr(
X)
is the trace of X . The associate coneK
is the set of all positive semidef- inite matrices. Another one is the Lorentz algebra( R
n, ◦, ·,·
Rn)
, whereR
n is the Euclidean space of dimension n with the standard inner product x,
yRn=
xTy, and the Jordan product is defined by x◦
y:= (
x,
yRn,
x1y2+
y1x2)
for any x= (
x1,
x2),
y= (
y1,
y2) ∈ R × R
n−1. The associate cone, called the Lorentz cone or the second-order cone, isK :=
x
= (
x1,
x2) ∈ R × R
n−1: x2x1
.
Recall that an element c
∈ V
is said to be idempotent if c2=
c. Two idempotents c and d are said to be orthogonal if c◦
d=
0. One says that{
c1,
c2, . . . ,
ck}
is a complete system of orthogonal idempotents ifc2j
=
cj,
cj◦
ci=
0 if j=
i,
j,
i=
1,
2, . . . ,
k,
andk
j=1 cj
=
e.
A nonzero idempotent is said to be primitive if it cannot be written as the sum of two other nonzero idempotents. We call a complete system of orthogonal primitive idempotents a Jordan frame. Then, we have the following spectral decomposition theorem.
Theorem 2.1. (See [9, Theorem III.1.2].) Suppose that
A = (V, ◦, ·,·
V)
is a Euclidean Jordan algebra and the rank ofA
is r. Then for any x∈ V
, there exist a Jordan frame{
c1,
c2, . . . ,
cr}
and real numbersλ
1(
x), λ
2(
x), . . . , λ
r(
x)
, arranged in the decreasing orderλ
1(
x) λ
2(
x) · · · λ
r(
x)
, such that x=
rj=1
λ
j(
x)
cj.The numbers
λ
j(
x)
(counting multiplicities), which are uniquely determined by x, are called the eigenvalue, and we write the maximum eigenvalue and the minimum eigenvalue of x asλ
max(
x)
andλ
min(
x)
, respectively. The trace of x, denoted as tr(
x)
, is defined by tr(
x) :=
rj=1
λ
j(
x)
; whereas the determinant of x is defined by det(
x) :=
rj=1
λ
j(
x)
.By Proposition III.1.5 of [9], a Jordan algebra over
R
with a unit element e∈ V
is Euclidean if and only if the symmetric bilinear form tr(
x◦
y)
is positive definite. Hence, we may define an inner product·,·
onV
by x,
y:=
tr(
x◦
y), ∀
x,
y∈ V.
(10)Unless otherwise states, the inner product
·,·
appearing in this paper always means the one defined by (10). By the asso- ciativity of tr( ·)
(see [9, Proposition II.4.3]), the inner product·,·
is associative, i.e.,x,
y◦
z=
y,
x◦
zfor all x,
y,
z∈ V
. LetL(
x)
y:=
x◦
y for every y∈ V.
Then, the linear operator
L(
x)
for each x∈ V
is symmetric with respect to the inner product·,·
in the sense thatL(
x)
y,
z=
y, L(
x)
zfor any y,
z∈ V
. Let be the norm onV
induced by the inner product·,·
, namely, x:=
x,
x=
rj=1
λ
2j(
x)
1/2, ∀
x∈ V.
It is not difficult to verify that for any x
,
y∈ V
, there always holds that x,
y1 2
x2
+
y2and
x+
y22 x2+
y2.
(11)Let
ϕ : R → R
be a scalar valued function. Then, it is natural to define a vector-valued function associated with the Euclidean Jordan algebraA = (V, ◦, ·,·)
byϕ
V(
x) := ϕ λ
1(
x)
c1
+ ϕ λ
2(
x)
c2
+ · · · + ϕ λ
r(
x)
cr
,
(12)where x
∈ V
has the spectral decomposition x=
rj=1
λ
j(
x)
cj. The functionϕ
V is also called the Löwner operator [26].When
ϕ (
t)
is chosen as max{
0,
t}
and min{
0,
t}
for t∈ R
, respectively,ϕ
V becomes the metric projection operator ontoK
and−K
:(
x)
+:=
r j=1
max
0, λ
j(
x)
cj and
(
x)
−:=
r j=1
min
0, λ
j(
x)
cj
.
(13)Lemma 2.1. (See [26, Theorem 13].) For any x
=
rj=1
λ
j(
x)
cj, letϕ
Vbe given as in (12). Thenϕ
Vis (continuously) differentiable at x if and only ifϕ
is (continuously) differentiable at eachλ
j(
x)
, j=
1,
2, . . . ,
r. The derivative ofϕ
Vat x, for any h∈ V
, isϕ
V(
x)
h=
rj=1
ϕ
[1]λ(
x)
j j
cj,
hcj+
1j<lr 4
ϕ
[1]λ(
x)
jlcj
◦ (
cl◦
h),
where
ϕ
[1]λ(
x)
i j
:=
ϕ(λi(x))−ϕ(λj(x))λi(x)−λj(x) if
λ
i(
x) = λ
j(
x), ϕ
(λ
i(
x))
ifλ
i(
x) = λ
j(
x),
i
,
j=
1,
2, . . . ,
r.
In fact, the Jacobian
ϕ
V( ·)
is a linear and symmetric operator, which can be written asϕ
V(
x) =
rj=1
ϕ
λ
j(
x)
Q(
cj) +
2r i,j=1,i=j
ϕ
[1]λ(
x)
i j
L(
cj)L(
ci),
(14)where
Q(
x) :=
2L
2(
x) − L(
x2)
for any x∈ V
is called the quadratic representation ofV
.In the sequel, unless otherwise stated, we assume that
A = (V, ◦, ·,·)
is a simple Euclidean Jordan algebra of rank r and dim( V) =
n.An important part in the theory of Euclidean Jordan algebras is the Peirce decomposition. Let c be a nonzero idempotent in
A
. Then, by [9, Proposition III.1.3], c satisfies 2L
3(
c) −
3L
2(
c) + L(
c) =
0 and the distinct eigenvalues of the symmetric operatorL(
c)
are 0, 12 and 1. LetV(
c,
1), V(
c,
12)
andV(
c,
0)
be the three corresponding eigenspaces, i.e.,V(
c, α ) :=
x
∈ V
:L(
c)
x= α
x, α =
1,
1 2,
0.
Then
V
is the orthogonal direct sum ofV(
c,
1)
,V(
c,
12)
andV(
c,
0)
. The decompositionV = V(
c,
1) ⊕ V
c,
12
⊕ V(
c,
0)
is called the Peirce decomposition of
V
with respect to the nonzero idempotent c.Let
{
c1,
c2, . . . ,
cr}
be a Jordan frame ofA
. For i,
j∈ {
1, . . . ,
r}
, define the eigenspacesV
ii:= V(
ci,
1) = R
ci,
V
i j:= V
ci,
12
∩ V
cj,
12
,
i=
j.
Then, from [9, Theorem IV.2.1], it follows that the following conclusion holds.
Theorem 2.2. The space
V
is the orthogonal direct sum of subspacesV
i j(
1ijr)
, i.e.,V =
ijVi j. Furthermore,
V
i j◦ V
i j⊂ V
ii+ V
j j, V
i j◦ V
jk⊂ V
ik,
if i=
k,
V
i j◦ V
kl= {
0},
if{
i,
j} ∩ {
k,
l} = ∅.
Let x
∈ V
have the spectral decomposition x=
rj=1
λ
j(
x)
cj. For i,
j∈ {
1,
2, . . . ,
r}
, letC
i j(
x)
be the orthogonal projection operator ontoV
i j. Then,C
i j(
x) = C
i j∗(
x), C
2i j(
x) = C
i j(
x), C
i j(
x)C
kl(
x) =
0 if{
i,
j} = {
k,
l},
i,
j,
k,
l=
1, . . . ,
r,
(15) and1ijr
C
i j(
x) = I,
(16)where
C
i j∗ is the adjoint (operator) ofC
i j. In addition, by [9, Theorem IV.2.1],C
j j(
x) = Q(
cj)
andC
i j(
x) =
4L(
ci) L(
cj) =
4L(
cj) L(
ci) = C
ji(
x),
i,
j=
1,
2, . . . ,
r.
Note that the original notation in [9] for orthogonal projection operator is Pi j. However, to avoid confusion with another orthogonal projector Pi
(
cj)
ontoV(
c, α )
and orthogonal matrix P which will be used later (Sections 3 and 4), we adoptC
i j instead.With the orthogonal projection operators
{C
i j(
x)
: i,
j=
1,
2, . . . ,
r}
, we have the following spectral decomposition theo- rem forL(
x)
andL(
x2)
; see [15, Chapters VI–V].Lemma 2.2. Let x
∈ V
have the spectral decomposition x=
rj=1
λ
j(
x)
cj. Then the symmetric operatorL(
x)
has the spectral decom- positionL(
x) =
rj=1
λ
j(
x) C
j j(
x) +
1j<lr 1 2
λ
j(
x) + λ
l(
x) C
jl(
x)
with the spectrum
σ ( L(
x))
consisting of all distinct numbers in{
12(λ
j(
x) + λ
l(
x))
: j,
l=
1,
2, . . . ,
r}
, andL(
x2)
has the spectral decompositionL
x2=
rj=1
λ
2j(
x)C
j j(
x) +
1j<lr 1 2
λ
2j(
x) + λ
2l(
x)
C
jl(
x)
(17)with the spectrum
σ (L(
x2))
consisting of all distinct numbers in{
12(λ
2j(
x) + λ
2l(
x))
: j,
l=
1,
2, . . . ,
r}
. Proposition 2.1. For any x∈ V
, the operatorL(
x2) − L
2(
x)
is positive semidefinite.Proof. By Lemma 2.2 and (15), we can verify that
L
2(
x)
has the spectral decompositionL
2(
x) =
r j=1
λ
2j(
x)C
j j(
x) +
1j<lr 1 4
λ
j(
x) + λ
l(
x)
2C
jl(
x).
(18)This means that the operator
L(
x2) − L
2(
x)
has the spectral decompositionL
x2
− L
2(
x) =
1j<lr
1 2λ
2j(
x) + λ
l2(
x)
−
1 4λ
j(
x) + λ
l(
x)
2C
jl(
x).
Noting that the orthogonal projection operator is positive semidefinite on
V
andλ
2j(
x) + λ
2l(
x)
2
(λ
j(
x) + λ
l(
x))
24 for all j
,
l=
1,
2, . . . ,
r,
we readily obtain the conclusion from the spectral decomposition of
L(
x2) − L
2(
x)
.2
3. Differentiability of the functionψτIn this section, we show that
ψ
τ is a merit function associated withK
, and moreover, it is differentiable everywhere onV × V
. By the definition of Jordan product,x2
+
y2+ ( τ −
2)(
x◦
y) =
x
+ τ −
2 2 y 2+ τ (
4− τ )
4 y2=
y
+ τ −
2 2 x 2+ τ (
4− τ )
4 x2
∈ K
(19)for any x
,
y∈ V
, and consequently the functionφ
τ in (4) is well defined. The following lemma states thatφ
τ andψ
τ is respectively a complementarity function and a merit function associated withK
.Lemma 3.1. For any x
,
y∈ V
, letφ
τ andψ
τ be given by (4) and (6), respectively. Then,ψ
τ(
x,
y) =
0⇐⇒ φ
τ(
x,
y) =
0⇐⇒
x∈ K,
y∈ K,
x,
y=
0.
Proof. The first equivalence is clear by the definition of
ψ
τ , and we only need to prove the second equivalence. Suppose thatφ
τ(
x,
y) =
0. Then, x2+
y2+ ( τ −
2)(
x◦
y)
1/2= (
x+
y).
(20)Squaring the two sides of (20) yields that
x2
+
y2+ ( τ −
2)(
x◦
y) =
x2+
y2+
2(
x◦
y),
which implies x
◦
y=
0 sinceτ ∈ (
0,
4)
. Substituting x◦
y=
0 into (20), we have that x=
x2
+
y21/2−
y and y=
x2
+
y21/2−
x.
Since x2
+
y2∈ K
, x2∈ K
and y2∈ K
, from [12, Proposition 8] or [19, Corollary 9] it follows that x,
y∈ K
. Consequently, the necessity holds. For the other direction, suppose x,
y∈ K
and x◦
y=
0. Then,(
x+
y)
2=
x2+
y2. This, together with x◦
y=
0, implies that x2+
y2+ ( τ −
2)(
x◦
y)
1/2− (
x+
y) =
0.
Consequently, the sufficiency follows. The proof is thus completed.
2
In what follows, we concentrate on the differentiability of the merit function
ψ
τ . For this purpose, we need the following two crucial technical lemmas.Lemma 3.2. For any x
,
y∈ V
, let u(
x,
y) := (
x2+
y2)
1/2. Then, the function u(
x,
y)
is continuously differentiable at any point(
x,
y)
such that x2+
y2∈
int(K)
. Furthermore,∇
xu(
x,
y) = L(
x)L
−1 u(
x,
y)
and
∇
yu(
x,
y) = L(
y)L
−1 u(
x,
y)
.
(21)Proof. The first part is due to Lemma 2.1. It remains to derive the formulas in (21). From the definition of u
(
x,
y)
, it follows thatu2
(
x,
y) =
x2+
y2, ∀
x,
y∈ V.
(22)By the formula (14), it is easy to verify that
∇
x(
x2) =
2L(
x)
. Differentiating on both sides of (22) with respect to x then yields that2
∇
xu(
x,
y)L
u(
x,
y)
=
2L(
x).
This implies that
∇
xu(
x,
y) = L(
x) L
−1(
u(
x,
y))
since, by u(
x,
y) ∈
int( K)
,L(
u(
x,
y))
is positive definite onV
. Similarly, we have that∇
yu(
x,
y) = L(
y)L
−1(
u(
x,
y))
.2
To present another lemma, we first introduce some related notations. For any 0
=
z∈ K
and z∈ /
int(K)
, suppose that z has the spectral decomposition z=
rj=1
λ
j(
z)
cj, where{
c1,
c2, . . . ,
cr}
is a Jordan frame andλ
1(
z), . . . , λ
r(
z)
are the eigenvalues arranged in the decreasing orderλ
1(
z) λ
2(
z) · · · λ
r(
z) =
0. Define the indexj∗
:=
minj
λ
j(
z) =
0,
j=
1,
2, . . . ,
r(23) and let
cJ
:=
j∗−1
l=1
cl
.
(24)Clearly, j∗ and cJ are well defined since 0
=
z∈ K
and z∈ /
int( K)
. Since cJ is an idempotent and cJ=
0 (otherwise z=
0),V
can be decomposed as the orthogonal direct sum of the subspacesV(
cJ,
1)
,V(
cJ,
12)
andV(
cJ,
0)
. In the sequel, we write P1(
cJ)
, P12
(
cJ)
and P0(
cJ)
as the orthogonal projection ontoV(
cJ,
1)
,V(
cJ,
12)
andV(
cJ,
0)
, respectively. From [21], we know thatL(
z)
is positive definite onV(
cJ,
1)
and is a one-to-one mapping fromV(
cJ,
1)
toV(
cJ,
1)
. This means thatL(
z)
an inverseL
−1(
z)
onV(
cJ,
1)
, i.e., for any u∈ V(
cJ,
1)
,L
−1(
z)
u is the unique v∈ V(
cJ,
1)
such that z◦
v=
u.Lemma 3.3. For any x
,
y∈ V
, let z: V × V → V
be the mapping defined as z=
z(
x,
y) :=
x2
+
y2+ ( τ −
2)(
x◦
y)
1/2.
(25)If
(
x,
y) = (
0,
0)
such that z(
x,
y) / ∈
int( K)
, then the following results hold:(a) The vectors x
,
y,
x+
y,
x+
τ−22y and y+
τ−22x belong to the subspaceV(
cJ,
1)
.(b) For any h
∈ V
such that z2(
x,
y) +
h∈ K
, let w=
w(
x,
y) := [
z2(
x,
y) +
h]
1/2−
z(
x,
y)
. Then, P1(
cJ)
w=
12L
−1(
z(
x,
y)) × [
P1(
cJ)
h] +
o(
h)
.Proof. From (19) and the definition of z, it is clear that z
(
x,
y) ∈ K
for all x,
y∈ V
. Hence, using the similar arguments as Lemma 11 of [21] yields the desired result.2
Now by Lemmas 3.2 and 3.3, we prove the differentiability of the merit function
ψ
τ .Proposition 3.1. The function
ψ
τ defined by (6) is differentiable everywhere onV × V
. Furthermore,∇
xψ
τ(
0,
0) = ∇
yψ
τ(
0,
0) =
0, and if(
x,
y) = (
0,
0)
, then∇
xψ
τ(
x,
y) =
L
x
+ τ −
2 2 yL
−1z
(
x,
y)
− I
φ
τ(
x,
y),
∇
yψ
τ(
x,
y) =
L
y
+ τ −
2 2 xL
−1z
(
x,
y)
− I
φ
τ(
x,
y),
(26)where z
(
x,
y)
is given by (25).Proof. We prove the conclusion by the following three cases.
Case (1):
(
x,
y) = (
0,
0)
. For any u,
v∈ V
, suppose that u2+
v2+ ( τ −
2)(
u◦
v)
has the spectrum decomposition u2+
v2+ ( τ −
2)(
u◦
v) =
rj=1