Circular cone convexity and some inequalities associated with circular cones

(1)

R E S E A R C H

Open Access

Circular cone convexity and some inequalities

associated with circular cones

Jinchuan Zhou

1

_{, Jein-Shan Chen}

2*

_{and Hao-Feng Hung}

2

*_{Correspondence:} [email protected] 2_{Department of Mathematics,} National Taiwan Normal University, Taipei, 11677, Taiwan

Full list of author information is available at the end of the article

Abstract

The study of this paper consists of two aspects. One is characterizing the so-called circular cone convexity of f by exploiting the second-order diﬀerentiability of fLθ_{; the}

other is introducing the concepts of determinant and trace associated with circular cone and establishing their basic inequalities. These results show the essential role played by the angle

θ

, which gives us a new insight when looking into properties about circular cone.

MSC: 26A27; 26B05; 26B35; 49J52; 90C33; 65K05 Keywords: circular cone; convexity; determinant; trace

1 Introduction

Recently, much attention has been paid to the nonsymmetric cone optimization problems, see [–] and the references therein. Unlike symmetric cones [], there is no unified struc-ture for nonsymmetric cones. Hence, how to tackle nonsymmetric cone optimization is still an issue. For symmetric cone optimization, the algebraic structure associated with symmetric cones, including second-order cone and positive semi-definite matrix cones, allows us to study them via exploiting the unified Euclidean Jordan algebra []. In gen-eral, the way to deal with nonsymmetric cone optimization depends on the feature of the associated nonsymmetric cone. In this paper, we focus on a special nonsymmetric cone, circular coneLθ. The circular cone [–] is a pointed closed convex cone having

hyper-spherical sections orthogonal to its axis of revolution about which the cone is invariant to rotation. Let its half-aperture angle be θ with θ∈ (, ◦). Then, it is mathematically expressed as Lθ:= x= (x, x)T∈ R × Rn–| x≥ x cos θ =x= (x, x)T∈ R × Rn–| x≥ x cot θ .

Real applications of a circular cone lie in some engineering problems, for example, in the formulation for optimal grasping manipulation for multi-ﬁngered robots, the grasping force of ith ﬁnger is subject to a circular cone constraint, see [, ] and references for more details.

AlthoughLθis a nonsymmetric cone, we can, due to its special structure, establish the

explicit form of orthogonal decomposition (or spectral decomposition) [] as

x= λ(x)· u()x + λ(x)· u()x , ()

©2013Zhou et al.; licensee Springer. This is an Open Access article distributed under the terms of the Creative Commons Attribu-tion License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribuAttribu-tion, and reproducAttribu-tion in any medium, provided the original work is properly cited.

(2)

where ⎧ ⎨ ⎩ λ(x) = x–x cot θ, λ(x) = x+x tan θ and ⎧ ⎪ ⎨ ⎪ ⎩ u()x =_+cot_θ    cot θ In–  –¯x = sinθ –(sin θ cos θ )¯x , u()x =_+tan_θ    tan θ In–  ¯x = cosθ (sin θ cos θ )¯x

with ¯x= x/x if x= , and ¯xbeing any vector w inRn–satisfyingw =  if x= . Clearly, x∈Lθif and only if λ(x)≥ .

The formula () allows us to deﬁne the following vector-valued function: fLθ_{(x) := f}_λ (x) u()_x + fλ(x) u()_x , ()

where f is a real-valued function from J toR with J being a subset in R. Let S be the set of all x∈ Rn_{whose spectral values λ}

i(x) for i = ,  belong to J, i.e., S :={x ∈ Rn| λi(x)∈ J, i =

, }. According to [], we know that S is open if and only if J is open. In addition, as J is an interval, then S is convex because

minλ(x), λ(y) ≤ λ βx+ ( – β)y≤ λ βx+ ( – β)y ≤ maxλ(x), λ(y) , ∀β ∈ [, ].

Throughout this paper, we always assume that J is an interval inR. Clearly, as θ = ◦, L◦reduces to the second-order cone and the above expressions () and () correspond to the spectral decomposition and the SOC-function associated with the second-order cone, respectively (see [, ] for more information regarding fsoc_).

It is well known that in dealing with symmetric cone optimization problems, such as second-order cone optimization problems and positive semi-deﬁnite optimization prob-lems, this type of vector-valued functions plays an essential role. Inspired by this, we study the properties of fLθ_{, which is crucial for circular cone optimization problems. In}

our previous works, we have studied the smooth and nonsmooth analysis of fLθ _{[, ];}

and the circular cone monotonicity and second-order diﬀerentiability of fLθ _{[]. From}

the aforementioned research, there is an interesting observation: some properties com-monly shared by fsoc_{and f}Lθ _{are independent of the angle θ ; for example, f}Lθ _is

direc-tionally differentiable, Fréchet differentiable, semi-smooth if and only if f is direcdirec-tionally differentiable, Fréchet differentiable, semi-smooth; while some properties are dependent on the angle θ ; for example, fLθ _{with f (t) = –/t for t >  is circular cone monotone as}

θ∈ [◦, ◦), but not circular cone monotone as θ∈ (, ◦).

In this paper, we further study the circular cone convexity of f . More precisely, a real-valued function f : J→ R is said to beLθ-convex of order n on S if for any x, y∈ S,

fLθ_β_x_{+ ( – β)y}

(3)

The characterization ofLθ-convexity is based on the observation that f isLθ-convex if and

only if (fLθ₎_{(x)(h, h)}∈L_θ_{for all h}∈ Rn_{. Our result shows that the circular cone convexity}

requires that the angle θ belongs in [◦, ◦). In particular, we show that f isLθ-convex

of order  if and only if θ∈ [◦, ◦) and f is convex.

On the other hand, using the spectral decomposition (), we deﬁne the determinant and traceof x in the framework of circular cone as

det(x) := λ(x)λ(x) and tr(x) := λ(x) + λ(x),

respectively. In the symmetric cone setting, the concepts of determinant and trace are the key ingredients of barrier and penalty functions which are used in barrier and penalty methods (including interior point methods) for symmetric cone optimization, see [– ]. Here we further study some basic inequalities of det(x) and tr(x) in the framework of circular cone. As seen in Section , the obtained inequalities are classiﬁed into three categories: (i) the ﬁrst class is independent of the angle (i.e., still holds in the framework of circular cone); (ii) the second class is dependent on the angle, for example, for x, y∈Lθ,

the inequality

det(e + x + y)≤ det(e + x) det(e + y),

where e = (, , . . . , )∈ Rn, fails as θ ∈ (, ◦) but holds as θ ∈ [◦, ◦); (iii) the third class always fails no matter what value of θ is chosen. These results give us a new insight into a circular cone and make us focus more on the role played by the angle θ .

The notation used in this paper is standard. For example, denote by Rn _the

n-dimensional Euclidean space and by R+ the set of all nonnegative real scalars, i.e., R+={t ∈ R | t ≥ }. For x, y ∈ Rn, the inner product is denoted by xTy. LetSn mean the spaces of all real symmetric matrices inRn×n_{, and let}_Sn

+denote the cone of positive semi-deﬁnite matrices. We write x_L_θ yto stand for x – y∈Lθ. Finally, we deﬁne _:= 

for convenience.

2 Circular cone convexity

The main purpose of this section is to provide characterizations ofLθ-convex functions.

First, we need the following technical lemma.

Lemma . Given αi∈ R for i = , . . . ,  and βi∈ R for i = , , , we deﬁne

F(β, β, β) := αβ+ αβ+ αββ+ αββ+ αββ+ αβββ. () IfF(β, β, β)≥  for all (β, β, β)∈ R, then

α≥ , α≥ , α≥ , α≥ , α≥ –√αα. Furthermore, if α_≤ ⎧ ⎨ ⎩ αα for α≥ , [α– (α/α)]α for α∈ [–√αα, ), ()

(4)

Proof If β = , thenF(β, β, β) = β[αβ+ αβ]. From F(β, β, β)≥ , we have αβ+ αβ≥ . Thus, α≥  by letting β→  and α≥  by letting β→ .

If β= , thenF(β, β, β) = β[αβ+ αβ]. FromF(β, β, β)≥ , we obtain α≥  and α≥ . If β= , then F(β, β, β) = αβ+ αβ+ αββ= ββ α β β  + α+ α β β  () whenever β=  and β= . Let t = β/β. FromF(β, β, β)≥ , equation () implies

α≥ –αt– α /t, ∀t = , i.e., α≥ max t= –αt– α /t= – min t= αt+ α /t= –√αα. Furthermore, if α≥ , then F(β, β, β)≥ αβ+ αββ+ αβββ= β_ ββ α α/ α/ α β_ ββ ≥ , where the last step is due to

α α/ α/ α S +O,

which is ensured by condition (). Similarly, if α∈ [–√αα, ) (implying α=  in this case), then F(β, β, β) = √ αβ+ α √α β_  + α– α_ α β_+ αββ+ αββ+ αβββ ≥ α– α_ α β_+ αββ+ αβββ =β_ ββ α– (α/α) α/ α/ α β  ββ ≥ , where the last step is due to

α– (α/α) α/ α/ α S +O,

which is ensured by condition () and the fact α– (α/α)≥  since –√αα≤ α< .

This completes the proof.

Lemma .[, Theorem .] Let f : J→ R and fLθ_{be deﬁned as in}_{(). Then f}Lθ_is

(5)

i= , . Moreover, for u, v∈ Rn_{, if x} = , then fLθ_{(x)(u, v)} = ⎧ ⎪ ⎨ ⎪ ⎩ f(x) uTv uv+vu , either u=  or v= , f(x)uTv

f(x)(vu+uv)+f(x)(tan θ –cot θ )(uv+¯uT¯vvu)

, otherwise. If x= , then fLθ_{(x)(u, v) =} I I , where I:= vu˜ξ + ˜ u¯xTv+ v¯xTu +ãvT_u+ (˜η – ã)¯xTv¯xTu, I:= (˜η – ã)u¯xTv+ ( –  ˜d)¯xTv¯xTu+˜vu+ (˜η – ã)v¯xTu ¯x + ˜d¯xT_uv+ vTu¯x+¯xTvu +ã(uv+ vu) with ã =f(λ(x)) – f(λ(x)) λ(x) – λ(x) , ˜ξ = f(λ(x))  + cot_θ + f(λ(x))  + tan_θ, ˜ = – cot θ  + cot_θf _λ (x) + tan θ  + tan_θf _λ (x) , ˜η = cotθ  + cot_θf _λ (x) + tan _θ  + tan_θf _λ (x) , ˜d =  x cot_θ  + cot_θf _λ (x) + tan _θ  + tan_θf _λ (x) –f(λ(x)) – f (λ(x)) λ(x) – λ(x) , = – cot _θ  + cot_θf _λ (x) + tan _θ  + tan_θf _λ (x) .

The characterization ofLθ-convexity is established below, which can be regarded as the

extension of some results given in [, –] from the second-order cone setting to the circular cone setting.

Theorem . Suppose that f : J→ R is second-order continuously diﬀerentiable. If f is Lθ-convex of order n on S, then tan θ≥ , f is convex on J, and for all τ, τ∈ J with τ≤ τ,

f(τ)δ(τ, τ)≥  (τ– τ) δ(τ, τ) () and tanθ δ(τ, τ) + tanθ– δ(τ, τ) f(τ) –  (τ– τ) δ(τ, τ) ≥ –f_(τ ) tan_θ_{– }_δ_(τ , τ)  tan_{θ δ}_(τ , τ) + tan_θ_{– }_δ_(τ , τ) . ()

(6)

Furthermore, if tanθ δ(τ, τ) + tanθ– δ(τ, τ) f(τ)≥  (τ– τ) δ(τ, τ) () and δ(τ, τ)δ(τ, τ)≤  tanθ δ(τ, τ) + tanθ– δ(τ, τ) f(τ)f(τ)(τ– τ), () or if tanθ δ(τ, τ) + tanθ– δ(τ, τ) f(τ) <  (τ– τ) δ(τ, τ) and δ(τ, τ)δ(τ, τ) tanθ– f(τ) ≤tanθ– f(τ)  tan_{θ δ}_(τ , τ) + tanθ– δ(τ, τ) δ(τ, τ) –tanθ δ(τ, τ) + tanθ– δ(τ, τ) f(τ) –  (τ– τ) δ(τ, τ)  × f_(τ )(τ– τ), ()

then f isLθ-convex. Here δ(τ , τ) := f (τ ) – f (τ) – f(τ)(τ – τ) for τ , τ∈ J.

Proof According to [, Theorem .], f isLθ-convex if and only if (fLθ)(x)(h, h)∈Lθfor

all x∈ S and h ∈ Rn_{. We proceed the proof by considering the following three cases.}

Case . For x=  and h= , it follows from Lemma . that fLθ_{(x)(h, h) = f}_(x ) h   .

Hence, (fLθ₎_{(x)(h, h)}∈L_θ_{if and only if f}_(x

)≥ .

Case . For x=  and h= , it follows from Lemma . that fLθ_{(x)(h, h) =} f(x)h f(x)hh+ f(x)(tan θ – cot θ )hh . Hence, (fLθ₎_{(x)(h, h)}∈L

θif and only if f(x)≥  and

tan θh≥h+ (tan θ – cot θ )hh,

i.e., – tan θh_+h ≤h+ (tan θ – cot θ )h h ≤ tan θ h_+h .

(7)

Dividing byhand letting t = h/h yields – tan θt+ ≤ t + tan θ – cot θ ≤ tan θt+ 

⇐⇒ max

t∈R – tan θ

t+ – t≤ tan θ – cot θ ≤ min

t∈Rtan θ

t+ – t ⇐⇒ cot θ – tan θ ≤ tan θ – cot θ ≤ tan θ – cot θ

⇐⇒ tan θ ≥ .

Case . For x= , due to the simpliﬁcation of notation, let us denote

μ:= h– cot θ¯xTh, μ:= h+ tan θ¯xTh, μ:= h– ¯xT h  . () Then ¯xT h= μ– μ

tan θ+ cot θ and h=

tan θ μ_+ cot θ μ

tan θ+ cot θ . ()

Note that μ, μ, and μcan take any value inR × R × R+by taking a suitable value of h (because the vector h has n variables). It follows from Lemma . that

fLθ_{(x)(h, h)} = ˜ξh + ˜¯xThh+ãh+ (˜η – ã)(¯xTh) [( –  ˜d)(¯xT h)+ (˜η – ã)¯xThh]¯x+ [˜h+ ˜dh]¯x+ [ãh+ ˜d¯xTh]h =:  ¯x+ h , where = ˜ξ h+ ˜¯xThh+ãh+ (˜η – ã) ¯xT h  , = ( –  ˜d) ¯xT h  + (˜η – ã)¯xT_hh+˜h+ ˜dh, =  ãh+ ˜d¯xTh . Hence, (fLθ₎_{(x)(h, h)}∈L θis equivalent to ≥  and tanθ≥ ¯x+ h. Note that =   + cot_θf _λ (x) h_– ¯xT_h hcot θ+ ¯xT h  cotθ +   + tan_θf _λ (x) h_+ ¯xT_h htan θ+ ¯xT h  tanθ +ãh– ¯xT h  =   + cot_θf _λ (x) μ_+   + tan_θf _λ (x) μ_+ãμ_. ()

(8)

We now claim that ≥  for all h ∈ Rnif and only if fλ(x) ≥ , fλ(x) ≥ , and ˜a ≥ . ()

The suﬃciency is clear. Let us show the necessity. In particular, choosing h = (– tan θ ,¯x) yields μ=  and μ= . It then follows from ≥  that f(λ(x))≥ . If we choose h= (cot θ ,¯x), then we have f(λ(x))≥ . Finally, choosing h = (, kz) with k∈ R, z =  and zT ¯x=  gives = f(λ(x))  + cot_θ + f(λ(x))  + tan_θ +˜ak _{≥ .}

Dividing by k _{both sides and taking the limits as k}_{→ ∞, we obtain ˜a ≥ . Since λ} i(x)

can take an arbitrary value in J, it is clear that () is equivalent to saying that f(τ )≥  for all τ ∈ J, i.e., f is convex on J. Indeed, the condition ˜a ≥  is ensured by the fact that

˜a =f(λ(x))–f(λ(x)) λ(x)–λ(x) = f

_(t

)≥  for some t∈ (λ(x), λ(x)). Now we calculate the values of and , respectively.

= – cot θ  + cot_θf _λ (x) μ_+ tan θ  + tan_θf _λ (x) μ_ + ˜dμ_– ˜d¯xT_h+˜ah ¯xT h = – cot θ  + cot_θf _λ (x) μ_+ tan θ  + tan_θf _λ (x) μ_+ ˜dμ_–¯xT_h . ()

Meanwhile, it follows from () that

= 

˜atan θ μ+ cot θ μ tan θ+ cot θ + ˜d μ– μ tan θ+ cot θ =  tan θ+ cot θ

μ(˜a tan θ – ˜d) + μ(˜a cot θ + ˜d) . () Note that ¯x+ h= + ¯xTh+ h = _+ ¯xTh+  μ_+¯xT_h  =+ ¯xTh  + _μ_. ()

Putting () and ()-() together, the condition 

tanθ ≥ ¯x + h can be rewritten equivalently as tanθ f(λ(x))  + cot_θμ  + f(λ(x))  + tan_θμ  +˜aμ  ≥ – cot θ  + cot_θf _λ (x) μ_+ tan θ  + tan_θf _λ (x) μ_+ ˜dμ_  +  (tan θ + cot θ )

μ(˜a tan θ – ˜d) + μ(˜a cot θ + ˜d) 

(9)

i.e.,

tanθ– fλ(x) 

μ_+ (tan θ + cot θ )˜atanθ– ˜dμ_ + (tan θ + cot θ )˜a tanθ+ ˜dfλ(x)

– (˜a tan θ – ˜d)μ_μ_ + (tan θ + cot θ )(˜a tan θ – ˜d)fλ(x)

– (˜a cot θ + ˜d)μ_μ_ + tanθ+ fλ(x) fλ(x) μ_μ_

– (˜a tan θ – ˜d)(˜a cot θ + ˜d)μμμ≥ . ()

To apply Lemma ., we need to compute each coeﬃcient in (). By calculation, we have ˜a tan θ – ˜d =f _(λ (x)) – f(λ(x)) λ(x) – λ(x) tan θ–  x cotθ  + cot_θf _λ (x) + tan _θ  + tan_θf _λ (x) +  x f(λ(x)) – f (λ(x)) λ(x) – λ(x) =f _(λ (x)) – f(λ(x)) λ(x) – λ(x) tan θ–  x cot θ tan θ+ cot θf _λ (x) + tan θ tan θ+ ctan θf _λ (x) +  x f(λ(x)) – f (λ(x)) λ(x) – λ(x) = –tan θ+ ctan θ λ(x) – λ(x) fλ(x) + tan θ+ ctan θ [λ(x) – λ(x)] fλ(x) – fλ(x) =(tan θ + cot θ )[f (λ(x)) – f (λ(x)) – f _(λ (x))(λ(x) – λ(x))] [λ(x) – λ(x)] = tan θ+ cot θ [λ(x) – λ(x)] δλ(x), λ(x) ,

where the third equation follows from the fact λ(x) – λ(x) = (tan θ + ctan θ )x. Similarly, we have ˜a tan θ + ˜d =(tan θ + cot θ )[f (λ(x)) – f (λ(x)) + (  tan θ tan θ+cot θf(λ(x)) + cot θ–tan θ tan θ+cot θf(λ(x)))(λ(x) – λ(x))] [λ(x) – λ(x)] = tan θ+ cot θ [λ(x) – λ(x)]  tan_θ tan_+δ λ(x), λ(x) +tan _θ_{– } tan_θ_{+ }δ λ(x), λ(x) , ˜a cot θ + ˜d =(tan θ + cot θ )[f (λ(x)) – f (λ(x)) – f(λ(x))(λ(x) – λ(x))]

[λ(x) – λ(x)] = tan θ+ cot θ [λ(x) – λ(x)] δλ(x), λ(x) , ˜a tan_θ_{+ ˜d}

=(tan θ + cot θ )[f (λ(x)) – f (λ(x)) – [tan

_θf_(λ (x)) + ( – tanθ)f(λ(x))](λ(x) – λ(x))] [λ(x) – λ(x)] = tan θ+ cot θ [λ(x) – λ(x)] tanθ δλ(x), λ(x) +tanθ– δλ(x), λ(x) .

(10)

Corresponding each coeﬃcient in () to (), we know ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ α= (tanθ– )f(λ(x)), α= (tan θ +cot θ )  [λ(x)–λ(x)]δ(λ(x), λ(x))[  tan_θ tan_θ_+δ(λ(x), λ(x)) +tan _θ_– tan_θ_+δ(λ(x), λ(x))], α= (tan θ +cot θ )  [λ(x)–λ(x)]{[tan _{θ δ}_(λ (x), λ(x)) + (tanθ– )δ(λ(x), λ(x))]f(λ(x)), – δ(λ(x),λ(x)) [λ(x)–λ(x)]}, α= (tan θ +cot θ )  [λ(x)–λ(x)][δ(λ(x), λ(x))f _(λ (x)) – δ(λ(x),λ(x))  [λ(x)–λ(x)]], α= (tanθ+ )f(λ(x))f(λ(x)), α= –(tan θ +cot θ )  [λ(x)–λ(x)]δ(λ(x), λ(x))δ(λ(x), λ(x)).

In view of Lemma ., the condition α≥  means tan θ ≥ , α, α≥  is ensured by the convexity of f (see ()), α≥  corresponds to (), and α≥ –√ααcorresponds to (). In addition, condition () takes the special form () and (), respectively. Theorem . Suppose that f : J→ R is second-order continuously diﬀerentiable. Then f isLθ-convex of order on S if and only if tan θ≥  and f is convex on J.

Proof The necessity is clear from Theorem .. For suﬃciency, note that in () μ=  since¯x=± in this case. Hence, () takes the form of

tanθ– fλ(x)  μ_+ tanθ+ fλ(x) fλ(x) μ_μ_≥  for all μand μ, which is equivalent to verifying

tan θ≥  and fλ(x)

fλ(x)

≥ .

This is ensued by the conditions that tan θ≥  and f is convex on J. Thus, the proof is

complete.

If, in particular, θ = ◦, then () and () reduce to [, () in Proposition .]; () re-duces to [, () in Proposition .]. In addition, due to (), () holds automatically in this case. The above results indicate that theLθ-convexity is dependent on the properties of f

and the angle θ together.

3 Inequalities associated with circular cone

In this section, we establish some inequalities associated with circular cone, which we be-lieve will be useful for further analyzing the properties of fLθ_{and proving the convergence}

of interior point methods for optimization problems involved in circular cones.

In [], the author establishes the following results in the framework of second-order cone. More speciﬁcally, for x_L◦ and y_L◦, then

(a) det(e + x)/_{≥  + det(x)}/_, (b) det(x + y)≥ det(x) + det(y),

(c) det(αx + ( – α)y)≥ α_{det(x) + ( – α)}_det(y),_{∀α ∈ [, ],} (d) det(e + x + y)≤ det(e + x) det(e + y),

(11)

(e) If xL◦yL◦, then det(x)≥ det(y), tr(x) ≥ tr(y), and λi(x)≥ λi(y)for i = , ,

(f ) tr(x + y) = tr(x) + tr(y) and det(γ x) = γ_det(x)_{for all γ}_{∈ R.}

In the following, we show that, in the framework of circular cone, the above inequalities can be classiﬁed into three categories. The ﬁrst class holds independent of the angle, e.g., (a); the second class holds dependent on the angle, e.g., (b)-(e); the third class fails no matter what value of the angle is chosen, e.g., (f ).

Theorem . Let x= (x, x)∈ R × Rn–possess spectral factorization associated with cir-cular cone given as in(). Then

(a) [det(e + x)]/_{≥  + det(x)}/_{for all x}_∈_L θ;

(b) If x_L_θ y, then λ(x)≥ λ(y).

Proof (a) Note that det(x)≥  and det(e + x) ≥  since x, x + e ∈Lθ. Therefore,

det(e + x)/≥  + det(x)/

⇐⇒ det(e + x) ≥  +  det(x)/_{+ det(x)} ⇐⇒ λ(e + x)λ(e + x)≥  +  λ(x)λ(x) + λ(x)λ(x) ⇐⇒ x+  –x cot θ x+  +x tan θ ≥  + λ(x)λ(x) + λ(x)λ(x) ⇐⇒ λ(x) +  λ(x) +  ≥  + λ(x)λ(x) + λ(x)λ(x) ⇐⇒ λ(x)λ(x) + λ(x) + λ(x) + ≥  +  λ(x)λ(x) + λ(x)λ(x) ⇐⇒ λ(x) + λ(x)≥  λ(x)λ(x) ⇐⇒ λ(x) + λ(x)  ≥ λ(x)λ(x).

Hence, to prove the desired result, it suﬃces to show that λ(x) + λ(x)

 ≥

λ(x)λ(x),

which is clearly true by the arithmetic mean-geometric mean (AM-GM) inequality. (b) Since x – y∈Lθ, we know x– y≥ x– y cot θ ≥ x – y cot θ,

i.e., λ(x) = x–x cot θ ≥ y–y cot θ = λ(y). Theorem . Let x= (x, x)∈ R × Rn–possess spectral factorization associated with cir-cular cone given as in(). Then the following hold.

(a) For all x, y∈Lθ,

det(x + y)≥ det(x) + det(y) +x+y

cscθ–x_+ y_secθ. In particular, when θ∈ (, ◦], we have

(12)

(b) For all x, y∈Lθand α∈ [, ], detαx+ ( – α)y ≥ α_{det(x) + ( – α)}_{det(y) +}_α_x + ( – α)y cscθ –αx_+ ( – α)y_secθ.

In particular, when θ∈ (, ◦], we have

detαx+ ( – α)y≥ αdet(x) + ( – α)det(y). (c) If x, y∈Lθand θ∈ [◦, ◦), then

det(e + x + y)≤ det(e + x) det(e + y). ()

(d) If x_L_θy_L_θ and θ∈ (, ◦], then

λ(x)≥ λ(y), det(x)≥ det(y), and tr(x) ≥ tr(y). () Proof (a) Notice that

det(x + y) = λ(x + y)· λ(x + y) =x+ y–x+ y cot θ x+ y+x+ y tan θ = (x+ y)+ (x+ y)x+ y tan θ – (x+ y)x+ y cot θ – x+ y and det(x) + det(y) = λ(x)λ(x) + λ(y)λ(y) =x–x cot θ x+x tan θ +y–y cot θ y+y tan θ

= x_+ xx tan θ – xx cot θ – x+ y+ yy tan θ – yy cot θ – y = x_+ y_+ xx tan θ + yy tan θ – xx cot θ – yy cot θ – x–y. Then we have

det(x + y) – det(x) – det(y) = xy– xTy+ xx+ y + yx+ y – xx – yy tan θ –xx+ y + yx+ y – xx – yy cot θ. Using x, y∈Lθ(and hence x + y∈Lθ) gives

xtan θ≥ x, –xtan θ≤ –x, x≥ x cot θ, –x≤ –x cot θ, –(x+ y)≤ –x+ y cot θ.

(13)

Thus,

det(x + y) – det(x) – det(y)

≥ xy– xTy+xx+ y + yx+ y – xtanθ– ytanθ – x(x+ y) – y(x+ y) +xcotθ+ycotθ = xy– xTy+x+ y x + y –x_+ y_tanθ – (x+ y)+ x+y cotθ ≥ x+ y– x_+ y_tanθ– x_– y_– xT_y+ x+y cotθ =x+y– x_+ y_tanθ– x_– y_+x+y cotθ =x+y  + cotθ–x_+ y_ + tanθ =x+y cscθ–x + y secθ,

which is the desired result.

When θ ∈ (, ◦], we know tan θ ≤ cot θ. Since x, y ∈Lθ, i.e., x ≥ x cot θ and y≥ y cot θ, there exist a, b ≥  such that x=x cot θ + a and y =y cot θ + b. Hence,

det(x + y) – det(x) – det(y) = xy– xTy+ xx+ y + yx+ y – xx – yy tan θ –xx+ y + yx+ y – xx – yy cot θ =x + y x + y – x+ y cotθ +x+ y x + y – x+ y + ab + a cot θy + x – x+ y + a tan θy cotθ+x+ y – x + b cot θy + x – x+ y + b tan θx cotθ+x+ y – y ≥ ,

where the last step is due to x + y ≥ x+ y, x cotθ +x + y – y ≥ x + x+ y – y ≥ , and y cotθ+x+ y – x ≥ y + x+ y – x ≥  since cot θ≥ , due to θ ∈ (, ◦].

(b) The result follows from the fact that det(γ x) = γ_{det(x) for all γ}_{≥ .}

(c) Since θ ∈ [◦, ◦), cot θ ≤ . For x, y ∈Lθ, there exist two nonnegative scalars a, b≥  such that x=x cot θ + a and y=y cot θ + b. This implies

det(e + x) =x+  –x cot θ x+  +x tan θ = (a + )(cot θ + tan θ )x + (a + ), det(e + y) =y+  –y cot θ y+  +y tan θ = (b + )(cot θ + tan θ )y + (b + ).

(14)

Thus, we obtain det(e + x) det(e + y)

= (a + )(b + )(cot θ + tan θ )xy + (a + )(b + )(cot θ + tan θ )x

+ (a + )(b + )(cot θ + tan θ )y + (a + )(b + ). () On the other hand,

det(e + x + y) =x+ y+  –x+ y cot θ x+ y+  +x+ y tan θ =x + y – x+ y cot θ+ (a + b + ) ×x + y cot θ+x+ y tan θ + (a + b + ) =x + y – x+ y x + y cotθ +x + y – x+ y x+ y + (a + b + ) x + y – x+ y cot θ + (a + b + )x + y cot θ+ (a + b + )x+ y tan θ + (a + b + ) =  cotθxy + (a + b + ) cot θx + (a + b + ) cot θy + (a + b + )

+x+y cotθ+ – cotθx+ y x + y + (a + b + )(tan θ – cot θ )x+ y – x+ y ≤  cot_θ_x

y + (a + b + ) cot θx + (a + b + ) cot θy + (a + b + ) +x+y cotθ+ – cotθx + y  + (a + b + )(tan θ – cot θ )x + y –x+y + xy =  cotθxy + (a + b + )(cot θ + tan θ)x + (a + b + )(cot θ + tan θ)y

+ (a + b + )+ – cotθx + y 

–x+y

+ xy = xy + (a + b + )(cot θ + tan θ)x + (a + b + )(ctan θ + tan θ)y

+ (a + b + ). ()

Note that (a + )(b + )(cot θ + tan θ )_{≥ (cot θ + tan θ)}_{≥  and} (a + )(b + )≥ a + b + ,

(a + )(b + )≥ a + b + , (a + )(b + )≥ (a + b + ). Hence, comparing () and () yields

det(e + x + y)≤ det(e + x) det(e + y).

(d) For θ∈ (, ◦], since cot θ≥ tan θ and x – y ∈Lθ, we know x– y≥ x– y cot θ ≥ x– y tan θ ≥

y – x

(15)

which means

λ(x) = x+x tan θ ≥ y+y tan θ = λ(y).

This together with the fact λ(x)≥ λ(y) by Part (b) in Theorem . and λi(x), λi(y)≥  for i= ,  (due to x, y∈Lθ) further yields

det(x) = λ(x)λ(x)≥ λ(y)λ(y) = det(y). Meanwhile, we obtain

tr(x) = λ(x) + λ(x)≥ λ(y) + λ(y) = tr(y).

Here are some remarks for Theorem ..

(i) Inequality () fails when θ∈ (◦, ◦). For example, let x = (, , ), y = (, –, –), and cot θ = .. Then det(x) = det(y) = / and det(x + y) = , which says

det(x + y) =  <  = det(x) + det(y).

(ii) Inequality () fails when θ∈ (, ◦). For example, let x = (/, /), y= (/, –/), and cot θ = . Then

det(e + x + y) = (.)_{> (.)}_{= det(e + x) det(e + y).}

(iii) Inequality () fails when θ∈ (◦, ◦). For example, for x = (., ), y = (, ), and cot θ= .. Then x_L_θy, λ(x) = . <  = λ(y),

det(x) = (. – .)(. + ) = . < . = ( – .)( + ) = det(y), and tr(x) = . < . = tr(y).

Next, let us move from inequalities to equalities. In particular, we focus on two identities in the framework of second-order cone as below

tr(x + y) = tr(x) + tr(y) and det(γ x) = γdet(x), ∀γ ∈ R. () But these two identities fail to hold in the circular cone setting no matter what value of the angle is chosen. In fact, in the second-order cone case,

tr(x) = x and det(x) = x–x.

Hence, () holds trivially. For the circular cone setting, we have tr(x) = x+x(tan θ – cot θ) and det(x) =

x–x cot θ x+x tan θ . Thus, tr(x) is not linear any more, i.e., tr(x + y)= tr(x)+tr(y); e.g., for x = (, ) and y = (, –), and cot θ = / (or cot θ = ). Then

tr(x + y) = =  = tr(x) + tr(y) or tr(x + y) = = – = tr(x) + tr(y).

In addition, det(γ x) = γ_{det(x) holds as γ} _{≥  but not true as γ < ; e.g., for x = (, ),} γ = –, and cot θ = / (or cot θ = ), then

(16)

The precise relationship between tr(x + y) and tr(x) + tr(y) is provided as below. Theorem . tr(x + y) ⎧ ⎨ ⎩ ≥ tr(x) + tr(y) as θ ∈ (, ◦_], ≤ tr(x) + tr(y) as θ ∈ [◦_{, }◦_). Proof The result follows from the fact that

tr(x + y) – tr(x) – tr(y) =x + y – x+ y

(cot θ – tan θ ).

Note that tr(x) is positively homogeneous, i.e., tr(γ x) = γ tr(x) for all γ ≥ . This together with Theorem . yields the following result.

Corollary . The trace tr(x) is concave as θ∈ (, ◦] and is convex as θ∈ [◦, ◦). These results further indicate that the angle plays an essential role for a circular cone. As in symmetric cone optimization, we believe that these inequalities about det(x) and tr(x) are key ingredients in penalty and barrier functions which can be adapted in design-ing barrier and penalty algorithms (includdesign-ing interior point algorithm) for circular cone optimization. This merits our further research.

Competing interests

The authors declare that they have no competing interests. Authors’ contributions

All authors read and approved the ﬁnal manuscript. Author details

1_{Department of Mathematics, School of Science, Shandong University of Technology, Zibo, 255049, P.R. China.} 2_{Department of Mathematics, National Taiwan Normal University, Taipei, 11677, Taiwan.}

Authors’ information

The second author is a member of Mathematics Division, National Center for Theoretical Sciences, Taipei Oﬃce. Acknowledgements

We are gratefully indebted to anonymous referees for their valuable suggestions that helped us to essentially improve the original presentation of the paper. The ﬁrst author’s work is supported by the National Natural Science Foundation of China (11101248, 11271233) and Shandong Province Natural Science Foundation (ZR2010AQ026, ZR2012AM016). The second author’s work is supported by the National Science Council of Taiwan.

Received: 17 July 2013 Accepted: 13 November 2013 Published:04 Dec 2013 References

1. Matsukawa, Y, Yoshise, A: A primal barrier function Phase I algorithm for nonsymmetric conic optimization problems. Jpn. J. Ind. Appl. Math. 29, 499-517, (2012)

2. Nesterov, Y: Towards nonsymmetric conic optimization. CORE (2006)

3. Skajaa, A, Jørgensen, JB, Hansen, PC: On implementing a homogeneous interior-point algorithm for nonsymmetric conic optimization. DTU Informatics, Building 321 (2011)

4. Skajaa, A, Ye, Y-Y: A homogeneous interior-point algorithm for nonsymmetric convex conic optimization (2012, submitted manuscript)

5. Faraut, U, Korányi, A: Analysis on Symmetric Cones. Oxford Mathematical Monographs. Oxford University Press, New York (1994)

6. Dattorro, J: Convex Optimization and Euclidean Distance Geometry. Meboo Publishing, Palo Alto (2005)

7. Zhou, J-C, Chen, J-S: Properties of circular cone and spectral factorization associated with circular cone. J. Nonlinear Convex Anal. 14, 807-816 (2013)

8. Zhou, J-C, Chen, J-S: On the vector-valued functions associated with circular cones (2013, submitted manuscript) 9. Zhou, J-C, Chen, J-S: Circular cone monotonicity and second order diﬀerentiability of vector-valued functions

(17)

10. Chang, Y-L, Yang, C-Y, Chen, J-S: Smooth and nonsmooth analysis of vector-valued functions associated with circular cones. Nonlinear Anal., Theory Methods Appl. 85, 160-173 (2013)

11. Ko, C-H, Chen, J-S: Optimal grasping manipulation for multiﬁngered robots using semismooth Newton method. Math. Probl. Eng. 2013, Article ID 681710 (2013)

12. Pan, S-H, Chiang, Y, Chen, J-S: SOC-monotone and SOC-convex functions v.s. matrix-monotone and matrix-convex functions. Linear Algebra Appl. 437(5), 1264-1284 (2012)

13. Chen, J-S, Chen, X, Tseng, P: Analysis of nonsmooth vector-valued functions associated with second-order cone. Math. Program. 101(1), 95-117 (2004)

14. Chen, J-S, Pan, S-H: A survey on SOC complementarity functions and solution methods for SOCPs and SOCCPs. Pac. J. Optim. 8(1), 33-74 (2012)

15. Auslender, A: Penalty and barrier methods: a uniﬁed framework. SIAM J. Optim. 10, 211-230 (1999)

16. Auslender, A, Ramirez, H: Penalty and barrier methods for convex semideﬁnite programming. Math. Methods Oper. Res. 63, 195-219 (2006)

17. Chang, Y-L, Chen, J-S: Convexity of symmetric cone trace functions in Euclidean Jordan algebras. J. Nonlinear Convex Anal. 14(1), 53-61 (2013)

18. Chen, J-S: The convex and monotone functions associated with second-order cone. Optimization 55(4), 363-385 (2006)

19. Chen, J-S, Chen, X, Pan, S-H, Zhang, J: Some characterizations for SOC-monotone and SOC-convex functions. J. Glob. Optim. 45(2), 259-279 (2009)

20. Chen, J-S, Liao, T-K, Pan, S-H: Using Schur complement theorem to prove convexity of some SOC-functions. J. Nonlinear Convex Anal. 13, 421-431 (2012)

10.1186/1029-242X-2013-571

Cite this article as: Zhou et al.: Circular cone convexity and some inequalities associated with circular cones. Journal