Contents lists available atScienceDirect
Operations Research Letters
journal homepage:www.elsevier.com/locate/orl
Stationary point conditions for the FB merit function associated with symmetric cones
Shaohua Pan
a, Yu-Lin Chang
b, Jein-Shan Chen
b,∗aSchool of Mathematical Sciences, South China University of Technology, Guangzhou 510640, China
bDepartment of Mathematics, National Taiwan Normal University, Taipei 11677, Taiwan
a r t i c l e i n f o
Article history:
Received 28 October 2009 Accepted 13 July 2010 Available online 24 July 2010
Keywords:
Fischer–Burmeister merit function Symmetric cones
Stationary points
a b s t r a c t
For the symmetric cone complementarity problem, we show that each stationary point of the unconstrained minimization reformulation based on the Fischer–Burmeister merit function is a solution to the problem, provided that the gradient operators of the mappings involved in the problem satisfy column monotonicity or have the Cartesian P0-property. These results answer the open question proposed in the article that appeared in Journal of Mathematical Analysis and Applications 355 (2009) 195–215.
© 2010 Elsevier B.V. All rights reserved.
1. Introduction
Let A
= (
V, ◦, h·, ·i)
be an n-dimensional Euclidean Jordan algebra (see Section2for the definition) andKbe the symmetric cone in V. We consider the following symmetric cone complemen- tarity problem (SCCP) which is to find a vectorζ ∈
V such that F(ζ) ∈
K,
G(ζ ) ∈
K, h
F(ζ ),
G(ζ )i =
0,
(1) where F and G are the differentiable mappings from V to V. This class of problems provides a unified framework for the nonlinear complementarity problem (NCP) over the nonnegative orthant cone in Rn, the second-order cone complementarity problem (SOCCP) and the semidefinite complementarity problem (SDCP), and becomes one of the main research interests in the current optimization field; see, e.g., [5,10,14,16,17,19].Recently, there are active studies for merit functions (or com- plementarity functions) for the SCCP. For example, Liu, Zhang and Wang [14] extended a class of merit functions proposed in [8] for the NCP to the SCCP; Kong, Tuncel and Xiu [11] studied the implicit Lagrangian merit function for the SCCP; Kong, Sun and Xiu [10]
proposed a regularized smoothing method by use of the natu- ral residual complementarity function associated with symmet- ric cones; and Huang and Ni [6] developed a smoothing algorithm with the regularized CHKS smoothing function over symmetric cones. Along this line, we also extended the one-parametric class of merit functions in [7] to the SCCP [15]. Specifically, a function
∗Corresponding author.
E-mail addresses:[email protected](S. Pan),[email protected] (Y.-L. Chang),[email protected](J.-S. Chen).
ψ :
V×
V→
R+is called a merit function associated with the coneKifψ(
x,
y) =
0⇐⇒
x∈
K,
y∈
K, h
x,
yi =
0.
(2) With such a function, the SCCP can be reformulated as an unconstrained minimizationminζ∈VΨ
(ζ ) := ψ(
F(ζ ),
G(ζ )),
(3) in the sense thatζ
∗ solves(1) if and only if it is a solution of (3) with zero optimal value. Then, the effective unconstrained minimization methods can be applied for solving it.A popular choice for
ψ
is the Fischer–Burmeister (FB) merit functionψ
FBdefined asψ
FB(
x,
y) :=
12
k φ
FB(
x,
y)k
2∀
x,
y∈
V (4) whereφ
FB:
V×
V→
V is the FB complementarity function associated withK, given byφ
FB(
x,
y) = (
x2+
y2)
1/2− (
x+
y)
(5) with x2=
x◦
x denoting the Jordan product of x and itself, and x1/2 the unique square root of x∈
K, i.e., x1/2◦
x1/2=
x. The functionψ
FB was first proved to be differentiable in [14], and later the authors of [12,15] independently showed that it is continuously differentiable everywhere with Lipschitz continuous gradients. However, it has been an open question: under what conditions every stationary point of the minimization problemminζ∈V ΨFB
(ζ ) := ψ
FB(
F(ζ ),
G(ζ ))
(6)0167-6377/$ – see front matter©2010 Elsevier B.V. All rights reserved.
doi:10.1016/j.orl.2010.07.011
is guaranteed to be a solution of(1). The main difficulty to establish such results is described in [15]. The study for stationary point conditions is extremely important in the merit function approach since, when applying effective gradient-type methods for solving the minimization reformulation problems, one at most expects to get a stationary point due to the nonconvexity of the merit functions.
The main purpose of this paper is to settle down this open problem. By exploiting the classification of a simple Euclidean Jordan algebra and extending a weaker result than the first implication of [4, Prop. 3.4] to the setting of symmetric cones, we show that each stationary point of the minimization problem(6)is a solution to(1)if the gradient operators
∇
F and−∇
G are column monotone. If the operator∇
G is invertible, this condition can be relaxed to the one that∇
G−1∇
F has the Cartesian P0-property.2. Preliminaries
This section recalls some results on Euclidean Jordan algebras that will be used in the subsequent section. More detailed expositions of Euclidean Jordan algebras can be found in Koecher’s lecture notes [9] and the monograph by Faraut and Korányi [3].
A Euclidean Jordan algebra is a triple
(
V, ◦, h·, ·i
V)
where(
V, h·, ·i
V)
is a finite-dimensional inner product space over the real number field R and(
x,
y) 7→
x◦
y:
V×
V→
V is a bilinear mapping satisfying the following conditions:(i) x
◦
y=
y◦
x for all x,
y∈
V;(ii) x
◦ (
x2◦
y) =
x2◦ (
x◦
y)
for all x,
y∈
V, where x2=
x◦
x;(iii)
h
x◦
y,
zi
V= h
y,
x◦
zi
Vfor all x,
y,
z∈
V.Let A
= (
V, ◦, h·, ·i
V)
denote a Euclidean Jordan algebra. We assume that there is an element e∈
V (called the unit element) such that x◦
e=
x for all x∈
V. By [3, Theorem III. 2.1], the set of squaresK:=
x2
|
x∈
Vis a symmetric cone. We write x
Ky (respectively, xKy) to mean x−
y∈
K (respectively, x−
y∈
intK).For x
∈
V, let m(
x) :=
min{
k: {
e,
x,
x2, . . . ,
xk}
are linearly dependent}
and define the rank of A by r:=
max{
m(
x) :
x∈
V}
. Recall that an element c∈
V is idempotent if c2=
c, and it is a primitive idempotent if it is nonzero and cannot be written as a sum of two nonzero idempotents. One says that a finite set{
c1,
c2, . . . ,
ck}
of primitive idempotents in V is a Jordan frame if cj◦
ci=
0 if j6=
i for all j,
i=
1,
2, . . . ,
k,
andk
X
j=1
cj
=
e.
Now we may state the second version of the spectral decomposi- tion theorem.Theorem 2.1 ([3, Theorem III. 1.2]). Let A be a Euclidean Jordan algebra with rank r. Then for every x
∈
V, there exist a Jordan frame{
c1,
c2, . . . ,
cr}
and real numbersλ
1(
x), . . . , λ
r(
x)
, arranged in the decreasing orderλ
1(
x) ≥ λ
2(
x) ≥ · · · ≥ λ
r(
x)
, such thatx
= λ
1(
x)
c1+ λ
2(
x)
c2+ · · · + λ
r(
x)
cr.
The numbers
λ
j(
x)
(counting multiplicities), which are uniquely determined by x, are called the eigenvalues of x, and tr(
x) = P
rj=1
λ
j(
x)
is called the trace of x.Since, by [3, Prop. III.1.5], a Jordan algebra
(
V, ◦)
with a unit element e∈
V is Euclidean if and only if the symmetric bilinear form tr(
x◦
y)
is positive definite, we may define another inner product on V byh
x,
yi :=
tr(
x◦
y) ∀
x,
y∈
V.
(7) The inner producth· , ·i
is associative by [3, Prop. II.4.3], i.e.,h
x,
y◦
zi = h
y,
x◦
zi
for any x,
y,
z∈
V. For any given x∈
V, letL(
x)
bethe Lyapunov operator defined by L
(
x)
y:=
x◦
y∀
y∈
V.
Then,L
(
x)
is symmetric with respect to the inner producth· , ·i
in the sense thath
L(
x)
y,
zi = h
y,
L(
x)
zi ∀
y,
z∈
V.
In what follows, we let
k · k
be the norm on V induced by this inner product, i.e.,k
xk := p
h
x,
xi = p
tr(
x2) =
r
X
j=1
λ
2j(
x)
!
1/2∀
x∈
V.
(8) This definition implies that the unit element e in this paper has a length equal to√
r.
Unless otherwise stated, in the rest of this paper, we assume that A
= (
V, ◦, h·, ·i)
is a simple Euclidean Jordan algebra of rank r and dimension n. By [3, Theorem V.3.7], r≥
2.Let x
∈
V have the spectral decomposition x= P
r j=1λ
j(
x)
cj, whereλ
1(
x) ≥ λ
2(
x) ≥ · · · ≥ λ
r(
x)
are the eigenvalues of x and{
c1,
c2, . . . ,
cr}
is the corresponding Jordan frame. By [3, Lemma IV. 1.3], the operatorsL(
cj),
j=
1,
2, . . . ,
r commute and admit a simultaneous diagonalization. For all i,
j∈ {
1,
2, . . . ,
r}
, define the subspacesVii
:=
Rci= {
x∈
V|
x◦
ci=
x} ,
Vij:=
x
∈
V|
x◦
ci=
12x
=
x◦
cjwhen i
6=
j,
and letCij
(
x)
be the orthogonal projection operator onto Vij. The following lemma gives the spectral decomposition of the operator L(
x)
, whose proof can be found in [9].Lemma 2.1. Let x
∈
V have the spectral decomposition x= P
rj=1
λ
j(
x)
cj. Then the linear symmetric operator L(
x)
has the spectral decompositionL
(
x) =
r
X
j=1
λ
j(
x)
Cjj(
x) + X
1≤j<l≤r
1
2
λ
j(
x) + λ
l(
x)
Cjl(
x)
(9) with the spectrumσ(
L(
x))
consisting of all distinct 12(λ
j(
x) + λ
l(
x))
for j,
l=
1, . . . ,
r.To close this section, we recall the smoothness of FB merit function
ψ
FB defined by(4) and(5), whose proof can be found in [14, Lemma 12] and [15, Prop. 4.3].Lemma 2.2. Let
ψ
FB be defined by (4) and (5). Then,ψ
FB is continuously differentiable everywhere. Furthermore,∇
xψ
FB(
0,
0) =
∇
yψ
FB(
0,
0) =
0; and if(
x,
y) 6= (
0,
0)
,∇
xψ
FB(
x,
y) =
L(
x)
L−1(
z) −
Iφ
FB(
x,
y),
∇
yψ
FB(
x,
y) =
L(
y)
L−1(
z) −
Iφ
FB(
x,
y),
where z
= (
x2+
y2)
1/2, andIdenotes the identity operator from V to V.3. Main result
First of all, we present a new representation for the elements in V. Let Vedenote the subspace generated by the unit element e, and V⊥e the orthogonal complementarity of Ve. Note that the unit element e of A is unique. Hence, any x
∈
V can be uniquely written asλ
xe+
xe withλ
x∈
R and xe∈
V⊥e. Moreover, we have the following result.Lemma 3.1. For z
= λ
ze+
ze∈
V withλ
z∈
R and ze∈
V⊥e, the following results hold.(a) tr
(
z) =
rλ
zandk
zk
2=
rλ
2z+ k
zek
2.(b) If z
∈
K, then√
r2
−
rλ
z≥ k
zek
. If in addition z6=
0, thenλ
z>
0 andk
zek >
0.(c) When r
=
2,
tr(
L2(
ze)) = k
zek
2.Proof. (a) The result is direct by the definition of
h· , ·i
and the fact thatk
ek
2=
r.(b) Since z
∈
K, we have tr(
z) ≥ k
zk
. This by part (a) implies(
r2−
r)λ
2z≥ k
zek
2, and the first part then follows. Since r≥
2, from the inequality√
r2
−
rλ
z≥ k
zek
we obtainλ
z≥
0, andk
zek =
0 wheneverλ
z=
0. This showsλ
z>
0 andk
zek >
0 if 06=
z∈
K. (c) SinceL(
e) =
I, we haveL(
z) = λ
zI+
L(
ze)
, which together withLemma 2.1impliesL
(
ze) =
r
X
j=1
(λ
j(
z) − λ
z)
Cjj(
z)
+ X
1≤j<l≤z
1
2
λ
j(
z) + λ
l(
z) −
2λ
z Cjl(
z).
Since Cjl
(
z)
for all j,
l=
1,
2, . . . ,
r are orthogonal projection operators, we haveL2
(
ze) =
r
X
j=1
(λ
j(
z) − λ
z)
2Cjj(
z)
+ X
1≤j<l≤r
1
4
λ
j(
z) + λ
l(
z) −
2λ
z 2Cjl
(
z).
Note that when r
=
2, part (a) impliesλ
1(
z) + λ
2(
z) −
2λ
z=
0, and therefore we havetr
(
L2(
ze)) = (λ
1(
z) − λ
z)
2+ (λ
2(
z) − λ
z)
2= k
zk
2−
2tr(
z)λ
z+
2λ
2z= k
zk
2−
2λ
2z= k
zek
2where the last equality is due to part (a). Thus, the proof is complete.
To achieve the main result of this paper, the key is to establish the implication that
z2
Kx2+
y2H⇒
c [L(
z) −
L(
x)
] [L(
z) −
L(
y)
] [L(
z) −
L(
x) −
L(
y)
]2 (10) for all x,
y∈
V and zK0, where c>
0 is a constant, and for the operatorsG,
H:
V→
V,
GHmeansh
x, (
G−
H)
xi
V>
0 for any 06=
x∈
V andGHmeansh
x, (
G−
H)
xi
V≥
0 for any x∈
V.The following proposition tries to establish such an implication.
Proposition 3.1. For any x
= λ
xe+
xe,
y= λ
ye+
ye∈
V and z= λ
ze+
ze∈
intK, if rλ
2z≥ k
zek
2and tr[
L2(
ze) −
L2(
xe) −
L2(
ye)] ≥
r−1(k
zek
2− k
xek
2− k
yek
2),
thenz2
Kx2+
y2H⇒
L2(
z) −
L2(
x) −
L2(
y)
0 (11) which is equivalent to saying thatz2
Kx2+
y2H⇒
2 [L(
z) −
L(
x)
] [L(
z) −
L(
y)
] [L(
z) −
L(
x) −
L(
y)
]2.
(12) Moreover, the two implications remain true when ‘‘’’ is replaced by‘‘
’’.Proof. We adopt the proof technique of [4, Prop. 3.4]. First, consider the case where z
= (
x2+
y2+ δ
e)
1/2for someδ >
0. Fix any x,
y∈
V with x= λ
xe+
xeand y= λ
ye+
yewhereλ
x, λ
y∈
R and xe,
ye∈
V⊥e. From z2=
x2+
y2+ δ
e and z= λ
ze+
ze, we haveλ
2ze+
2λ
zze+
ze2= λ
2xe+
2λ
xxe+
x2e+ λ
2ye+
2λ
yye+
y2e+ δ
e.
Noting that z2e
,
x2e,
y2e∈
Veand xe,
ye,
ze∈
V⊥e, we obtain from the last equality thatλ
zze= λ
xxe+ λ
yye andλ
2ze+
ze2= λ
2xe+
x2e+ λ
2ye+
y2e+ δ
e.
(13) From the first equality of(13),λ
zL(
ze) − λ
xL(
xe) − λ
yL(
ye) =
0, which implies thatL2
(
z) −
L2(
x) −
L2(
y) = λ
2z− λ
2x− λ
2y L(
e) +
L2(
ze) −
L2(
xe) −
L2(
ye).
Thus, to prove(11), it suffices to prove that for any 0
6=
h= λ
he+
he∈
V,λ
2z− λ
2x− λ
2yk
hk
2+ k
ze◦
hk
2− k
xe◦
hk
2− k
ye◦
hk
2>
0,
which, by noting that z2e,
x2e,
y2e∈
Ve,
h= λ
he+
heand he∈
V⊥e, is equivalent to(λ
2z− λ
2x− λ
2y)k
hk
2+ λ
2h(k
zek
2− k
xek
2− k
yek
2)
+ (k
ze◦
hek
2− k
xe◦
hek
2− k
ye◦
hek
2) >
0.
(14) Sinceλ
z>
0 byLemma 3.1(b), from the two equalities in(13)we haver
λ
2z+ k λ
xxe+ λ
yyek
2λ
2z= k
zk
2=
rλ
2x+ k
xek
2+
rλ
2y+ k
yek
2+
rδ.
Multiplying the two sides with
λ
2zand addingλ
2yk
xek
2+ λ
2xk
yek
2 simultaneously yields(λ
2x+ λ
2y)(k
xek
2+ k
yek
2) +
rλ
4z− λ
2z(
rλ
2x+ k
xek
2+
rλ
2y+ k
yek
2)
> kλ
yxe− λ
xyek
2,
which is equivalent to
(λ
2z− λ
2x− λ
2y)(
rλ
2z−k
xek
2−k
yek
2) > kλ
yxe− λ
xyek
2. This means that bothλ
2z− λ
2x− λ
2yand rλ
2z− k
xek
2− k
yek
2 are positive or both are negative. If both are negative, we must havek
xk
2+ k
yk
2>
2rλ
2z, which by the assumption rλ
2z≥ k
zek
2yields the contradictionk
zk
2> k
xk
2+ k
yk
2>
2rλ
2z≥
rλ
2z+ k
zek
2= k
zk
2. Thus, we getλ
2z> λ
2x+ λ
2y and rλ
2z> k
xek
2+ k
yek
2.
(15) Using the first equality of(13)and the second inequality of(15), for any se∈
V⊥e, se,
L2(
ze) −
L2(
xe) −
L2(
ye)
se= k
ze◦
sek
2− k
xe◦
sek
2− k
ye◦
sek
2= k λ
xxe◦
se+ λ
yye◦
sek
2λ
2z− k
xe◦
sek
2+ k
ye◦
sek
2= (λ
2x+ λ
2y− λ
2z) k
xe◦
sek
2+ k
ye◦
sek
2λ
2z− k
se◦ (λ
xye− λ
yxe)k
2λ
2z≤
0.
This shows thatL2
(
ze) −
L2(
xe) −
L2(
ye)
is negative semidefinite on V⊥e. Therefore,k
ze◦
hek
2− k
xe◦
hek
2− k
ye◦
hek
2= h
he, [
L2(
ze) −
L2(
xe) −
L2(
ye)]
hei
≥ h
he,
tr(
L2(
ze) −
L2(
xe) −
L2(
ye))
Ihei
≥
r−1k
hek
2(k
zek
2− k
xek
2− k
yek
2),
where the last inequality is due to the given assumption. Along with(15), we have that
(λ
2z− λ
2x− λ
2y)k
hk
2+ λ
2h(k
zek
2− k
xek
2− k
yek
2) + (k
ze◦
hek
2− k
xe◦
hek
2− k
ye◦
hek
2)
≥ λ
2h(k
zk
2− k
xk
2+ k
yk
2) +
r−1k
hek
2(k
zk
2− k
xk
2− k
yk
2)
=
r−1k
hk
2(k
zk
2− k
xk
2− k
yk
2) >
0.
This shows that(14)holds, and the implication in(11)is true for any x
,
y∈
V and z= (
x2+
y2+ δ
e)
1/2. Using the same arguments of [4, Prop. 3.4] yields that(11)holds.We next prove that the implication in(11)is equivalent to that of(12). Suppose that the implication in(11)holds. Fix any 0
6=
h∈
V. By the symmetry ofL(
x)
, clearly,h
h,
[L(
z) −
L(
x)
] [L(
z) −
L(
y)
] hi
= h
h,
[L(
z) −
L(
y)
] [L(
z) −
L(
x)
] hi .
LetS
(
x,
y)
denote the symmetric part of[
L(
z) −
L(
x)][
L(
z) −
L(
y)]
. Then,h
h,
[L(
z) −
L(
x)
] [L(
z) −
L(
y)
] hi = h
h,
S(
x,
y)
hi .
Using the definition ofS
(
x,
y)
, a simple computation yields that S(
x,
y) =
12[L
(
y) −
L(
z)
] [L(
x) −
L(
z)
]+
12[L
(
x) −
L(
z)
] [L(
y) −
L(
z)
]=
12[L
(
z) −
L(
x) −
L(
y)
]2+
12
L2(
z) −
L2(
x) −
L2(
y) .
The last two equations along with Eq.(11)imply the implication in (12). Conversely, if the implication in(12)holds, from the last two equations we obtain the implication in(11). The last part follows by the continuity of the operators.
ByLemma 3.1(b)–(c), when r
=
2, the assumptions ofPropo- sition 3.1 automatically hold, and we recover the first implication of [4, Prop. 3.4], or its equivalent result as below.Corollary 3.1. Suppose that r
=
2. Then, for any x,
y∈
V and zK0, it holds thatz2
Kx2+
y2H⇒
2 [L(
z) −
L(
x)
] [L(
z) −
L(
y)
] [L(
z) −
L(
x) −
L(
y)
]2.
(16) Moreover, the implication remains true when ‘‘’’ is replaced by ‘‘’’.When r
≥
3, the assumption rλ
2z≥ k
zek
2inProposition 3.1 may not hold. Also, it is hard to verify whether tr(
L2(
ze)−
L2(
xe)−
L2
(
ye)) ≥
r−1(k
zek
2− k
xek
2− k
yek
2)
holds or not. In other words, by use ofProposition 3.1it is difficult to achieve our goal for r≥
3.However, as will be shown byProposition 3.2, an implication as in(10)can be established for r
≥
3 by extending the proof of [18, Lemma 6.3(c)] to another three classes of matrix algebras.Proposition 3.2. Suppose that r
≥
3. Then, for any x,
y∈
V and zK0,z2
Kx2+
y2H⇒
4 [L(
z) −
L(
x)
] [L(
z) −
L(
y)
] [L(
z) −
L(
x) −
L(
y)
]2.
(17) Moreover, the implication remains true when ‘‘’’ is replaced by ‘‘’’.Proof. By [3, Theorem V.3.7], it suffices to prove this result for the following algebras:
(i) The algebraSnof n
×
n real symmetric matrices;(ii) The algebraHnof all n
×
n complex Hermitian matrices;(iii) The algebraQnof all n
×
n quaternionic Hermitian matrices;(iv) The algebraO3of all 3
×
3 octonionic Hermitian matrices.Among others, the four classes of matrix algebras are equipped with the Jordan product x
◦
y:=
12
(
xy+
yx)
and the trace inner producth
x,
yi
T:= <
Tr(
xy∗),
where the notation ‘‘∗
’’ means the conjugate transpose, Tr(
xy)
denotes the trace of xy which is the multiplication of matrices x and y, and<
a means the real part of a.Let C
,
Q and O denote the complex number field, the quaternion field and the octonion field, respectively. Let W be the algebra of n×
n matrices with entries in R,
C, or Q, or the algebra of 3×
3 matrices with entries in O, equipped with the inner producth· , ·i
Tand the normk·k
Tinduced byh· , ·i
T. By [3, Propositions V.1.2, V.1.5 and V.2.1], it is not difficult to verify that for any u, v, w ∈
W,<
Tr[ (w
u)(vw)] = <
Tr[ w(
uvw)] = <
Tr[ w
uvw]
= <
Tr[ w(
uv)w],
(18)<
Tr[ w(
uv)w] = <
Tr[ w(v
u)w]
if u, v, w
are Hermitian,
(19) and<
Tr(
uv) = <
Tr(
uv
∗)
if u is Hermitian.
(20) Also, by [3, Prop. V.2.1] we may verify thath
L(
x)
y,
zi
T= h
y,
L(
x)
zi
Tfor all x,
y and z from the spaceSn, orHn, orQn, orO3. Fix any x and y fromSn, orHn, orQn, orO3. Since z2Kx2+
y2, from the Löwner–Heinz inequality in [13] it follows thatz
Kx and zKy.
(21)Fix any 0
6=
a from the same space as x and y. From the above discussions, we have4
h
a,
[L(
z) −
L(
x)
] [L(
z) −
L(
y)
] ai
T=
4h (
z−
x) ◦
a, (
z−
y) ◦
ai
T= h
a(
z−
x) + (
z−
x)
a,
a(
z−
y) + (
z−
y)
ai
T=
2<
Tr [(
a(
z−
x) + (
z−
x)
a)((
z−
y)
a)
]=
2<
Tr [a(
z−
x)(
z−
y)
a+ (
z−
x)
a(
z−
y)
a]=
2<
Tra
(
z2−
zy−
xz+
xy)
a+
2<
Tr(
z−
x)
1/2(
z−
x)
1/2a(
z−
y)
1/2(
z−
y)
1/2a> <
Tra
(
2xy−
2zx−
2zy+
z2+
x2+
y2)
a+
2<
Tr(
z−
x)
1/2a(
z−
y)
1/2(
z−
y)
1/2a(
z−
x)
1/2= <
Tr[
a(
z−
x−
y)
2a] +
2(
z−
x)
1/2a(
z−
y)
1/22 T
≥ <
Tr[
a(
z−
x−
y)
2a]
= <
Tr[ (
a(
z−
x−
y))((
z−
x−
y)
a)]
= <
Tr[
a(
z−
x−
y)(
a(
z−
x−
y))
∗]
= k (
z−
x−
y)
ak
2T,
where the first equality is by the symmetry ofL
(·)
with respect toh· , ·i
T, the third is due to(20)and the fact that a(
z−
x) + (
z−
x)
a is Hermitian, the fourth is by(18), the fifth is by(18)and(21), and the first inequality is using z2Kx2+
y2. On the other hand, a,
[L(
z) −
L(
x) −
L(
y)
]2aT
= h (
z−
x−
y) ◦
a, (
z−
x−
y) ◦
ai
T= <
Tr [((
z−
x−
y) ◦
a)((
z−
x−
y) ◦
a)
]≤ <
Tr((
z−
x−
y) ◦
a)((
z−
x−
y) ◦
a)
∗= k (
z−
x−
y) ◦
ak
2T= k (
z−
x−
y)
a+
a(
z−
x−
y)k
2T4