2 Circular cone

(1)

Pacific Journal of Optimization, vol. 14, no. 3, pp. 399-419, 2018

From symmetric cone optimization to nonsymmetric cone optimization: Spectral decomposition, nonsmooth analysis, and

projections onto nonsymmetric cones

¹

Xin-He Miao²

Department of Mathematics Tianjin University, China

Tianjin 300072, China

Yue Lu ³

School of Mathematical Sciences Tianjin Normal University

Tianjin 300387, China

Jein-Shan Chen ⁴ Department of Mathematics National Taiwan Normal University

Taipei 11677, Taiwan October 6, 2017

(revised on February 8, 2018)

Abstract. It is well known that Euclidean Jordan algebra is an unified framework for symmetric cone programs, including positive semidefinite programs and second-order cone programs. Unlike symmetric cone programs, there is no unified analysis technique to deal with nonsymmetric cone programs. Nonetheless, there are several common concepts

1This is an extended version of “From symmetric cone optimization to nonsymmetric cone optimization: Projections onto nonsymmetric cones”, Proceedings of the Twenty-Eighth RAMP Symposium, Niigata University, pp. 25-34, October, 2016.

2E-mail: xinhemiao@tju.edu.cn. The author’s work is supported by National Natural Science Foun- dation of China (No. 11471241).

3The research is supported by National Natural Science Foundation of China (Grant Number:

11601389), Doctoral Foundation of Tianjin Normal University (Grant Number: 52XB1513) and 2017- Outstanding Young Innovation Team Cultivation Program of Tianjin Normal University (Grant Number:

135202TD1703).

4Corresponding author. The author’s work is supported by Ministry of Science and Technology, Taiwan. E-mail: jschen@math.ntnu.edu.tw

(2)

when dealing with general conic optimization. More specifically, we believe that spectral decomposition associated with cones, nonsmooth analysis regarding cone-functions, projections onto cones, and cone-convexity are the bridges between symmetric cone programs and nonsymmetric cone programs. Hence, this paper is devoted to looking into the first three items in the setting of nonsymmetric cones. The importance of cone-convexity is recognized in the literature so that it is not discussed here. All results presented in this paper are very crucial to subsequent study about the optimization problems associated with nonsymmetric cones.

Key words. Spectral decomposition, nonsmooth analysis, projection, symmetric cone, nonsymmetric cone.

Mathematics Subject Classification: 49M27, 90C25

1 Introduction

Symmetric cone optimization, including SDP (positive semidefinite programming) and SOCP (second-order cone programming) as special cases, has been a popular topic during the past two decades. In fact, for many years, there has been much attention on symmetric cone optimization, see [10, 11, 14, 20, 27, 30, 32, 35, 38] and references therein.

Recently, some researchers have paid attention to nonsymmetric cones, for example, homogeneous cone [9, 28, 40], matrix norm cone [18], p-order cone [1, 23, 41], hyperbolicity cone [24, 26, 36], circular cone [13, 15, 42] and copositive cone [16], etc.. In general, the structure of symmetric cone is quite different from the one of non-symmetric cone.

In particular, unlike the symmetric cone optimization in which the Euclidean Jordan algebra can unify the analysis, so far no unified algebra structure has been found for non-symmetric cone optimization. This motivates us to find the common bridge between them. Based on our earlier experience, we think the following four items are crucial:

• spectral decomposition associated with cones.

• smooth and nonsmooth analysis for cone-functions.

• projection onto cones.

• cone-convexity.

The role of cone-convexity had been recognized in the literature. In this paper, we focus on the other three items that are newly explored recently by the authors. Moreover, we look into several kinds of nonsymmetric cones, that is, the circular cone, the p-order cone, the geometric cone, the exponential cone and the power cone, respectively. The symmetric cone can be unified under Euclidean Jordan algebra, which will be introduced

(3)

later. Unlike the symmetric cone, there is no unified framework for dealing with nonsymmetric cones. This is the main source where the difficulty arises from. Note that the homogeneous cone can be unified under so-called T -algebra [28, 39, 40].

We begin with introducing Euclidean Jordan algebra [29] and symmetric cone [19].

Let V be an n-dimensional vector space over the real field R, endowed with a bilinear mapping (x, y) 7→ x ◦ y from V × V into V. The pair (V, ◦) is called a Jordan algebra if (i) x ◦ y = y ◦ x for all x, y ∈ V,

(ii) x ◦ (x²◦ y) = x²◦ (x ◦ y) for all x, y ∈ V.

Note that a Jordan algebra is not necessarily associative, i.e., x ◦ (y ◦ z) = (x ◦ y) ◦ z may not hold for all x, y, z ∈ V. We call an element e ∈ V the identity element if x ◦ e = e ◦ x = x for all x ∈ V. A Jordan algebra (V, ◦) with an identity element e is called a Euclidean Jordan algebra if there is an inner product, h·, ·i_V, such that

(iii) hx ◦ y, zi

V = hy, x ◦ zi

V for all x, y, z ∈ V.

Given a Euclidean Jordan algebra A = (V, ◦, h·, ·i_V), we denote the set of squares as K := x²| x ∈ V .

By [19, Theorem III.2.1], K is a symmetric cone. This means that K is a self-dual closed convex cone with nonempty interior and for any two elements x, y ∈ intK, there exists an invertible linear transformation T : V → V such that T (K) = K and T (x) = y.

Below are three well-known examples of Euclidean Jordan algebras.

Example 1.1. Consider Rⁿ with the (usual) inner product and Jordan product defined respectively as

hx, yi =

n

X

i=1

x_iy_i and x ◦ y = x ∗ y ∀x, y ∈ Rⁿ

where x_i denotes the ith component of x, etc., and x ∗ y denotes the componentwise product of vectors x and y. Then, Rⁿis a Euclidean Jordan algebra with the nonnegative orthant Rⁿ+ as its cone of squares.

Example 1.2. Let Sⁿ be the space of all n × n real symmetric matrices with the trace inner product and Jordan product, respectively, defined by

hX, Y i_T:= Tr(XY ) and X ◦ Y := 1

2(XY + Y X) ∀X, Y ∈ Sⁿ.

Then, (Sⁿ, ◦, h·, ·i_T) is a Euclidean Jordan algebra, and we write it as Sn. The cone of squares Sⁿ+ in Sn is the set of all positive semidefinite matrices.

(4)

Example 1.3. The Jordan spin algebra Ln. Consider Rⁿ (n > 1) with the inner product h·, ·i and Jordan product

x ◦ y :=

hx, yi x₀y + y¯ ₀x¯

for any x = (x₀, ¯x), y = (y₀, ¯y) ∈ R × Rⁿ⁻¹. We denote the Euclidean Jordan algebra (Rⁿ, ◦, h·, ·i) by Ln. The cone of squares, called the Lorentz cone (or second-order cone), is given by

L⁺n :=(x₀; ¯x) ∈ R × Rⁿ⁻¹| x₀ ≥ k¯xk .

For any given x ∈ A, let ζ(x) be the degree of the minimal polynomial of x, i.e., ζ(x) := mink : {e, x, x², · · · , x^k} are linearly dependent .

Then, the rank of A is defined as max{ζ(x) : x ∈ V}. In this paper, we use r to denote the rank of the underlying Euclidean Jordan algebra. Recall that an element c ∈ V is idempotent if c² = c. Two idempotents c_i and c_j are said to be orthogonal if c_i ◦ c_j = 0.

One says that {c₁, c₂, . . . , c_k} is a complete system of orthogonal idempotents if

c²_j = c_j, c_j ◦ c_i = 0 if j 6= i for all j, i = 1, 2, · · · , k, and

k

X

j=1

c_j = e.

An idempotent is primitive if it is nonzero and cannot be written as the sum of two other nonzero idempotents. We call a complete system of orthogonal primitive idempotents a Jordan frame. Now we state the second version of the spectral decomposition theorem.

Theorem 1.1. [19, Theorem III.1.2] Suppose that A is a Euclidean Jordan algebra with the rank r. Then, for any x ∈ V, there exists a Jordan frame {c1, . . . , c_r} and real numbers λ1(x), . . . , λr(x), arranged in the decreasing order λ1(x) ≥ λ2(x) ≥ · · · ≥ λr(x), such that

x = λ₁(x)c₁+ λ₂(x)c₂+ · · · + λ_r(x)c_r.

The numbers λj(x) (counting multiplicities), which are uniquely determined by x, are called the eigenvalues and tr(x) =Pr

j=1λ_j(x) the trace of x.

From [19, Prop. III.1.5], a Jordan algebra (V, ◦) with an identity element e ∈ V is Euclidean if and only if the symmetric bilinear form tr(x ◦ y) is positive definite. Then, we may define another inner product on V by hx, yi := tr(x ◦ y) for any x, y ∈ V. The inner product h·, ·i is associative by [19, Prop. II. 4.3], i.e., hx, y ◦ zi = hy, x ◦ zi for any x, y, z ∈ V. Every Euclidean Jordan algebra can be written as a direct sum of so- called simple ones, which are not themselves direct sums in a nontrivial way. In finite dimensions, the simple Euclidean Jordan algebras come from the following five basic structures.

(5)

Theorem 1.2. [19, Chapter V.3.7] Every simple Euclidean Jordan algebra is isomorphic to one of the following.

(i) The Jordan spin algebra Lⁿ.

(ii) The algebra Sⁿ of n × n real symmetric matrices.

(iii) The algebra Hⁿ of all n × n complex Hermitian matrices.

(iv) The algebra Qⁿ of all n × n quaternion Hermitian matrices.

(v) The algebra O³ of all 3 × 3 octonion Hermitian matrices.

Given an n-dimensional Euclidean Jordan algebra A = (V, h·, ·i, ◦) with K being its corresponding symmetric cone in V. For any scalar function f : R → R, we define a vector-valued function f^sc(x) (called L¨owner function) on V as

f^sc(x) = f (λ₁(x))c₁+ f (λ₂(x))c₂+ · · · + f (λ_r(x))c_r (1) where x ∈ V has the spectral decomposition

x = λ₁(x)c₁+ λ₂(x)c₂+ · · · + λ_r(x)c_r.

When V is the space Sⁿ which means n × n real symmetric matrices. The spectral decomposition reduces to the following: for any X ∈ Sⁿ,

X = P





 λ₁

. ..

λ_n





P^T,

where λ₁, λ₂, · · · , λ_n are eigenvalues of X and P is orthogonal (i.e., P^T = P⁻¹). Under this setting, for any function f : R → R, we define a corresponding matrix valued function associated with the Euclidean Jordan algebra Sⁿ := Sym(n, R), denoted by f^mat: Sⁿ → Sⁿ, as

f^mat(X) = P





 f (λ₁)

. ..

f (λn)





P^T.

For this case, Chen, Qi and Tseng in [12] show that the function f^mat inherits from f the properties of continuity, Lipschitz continuity, directional differentiability, Fr´echet differentiability, continuous differentiability, as well as semismoothness. We state them as below.

Theorem 1.3. (a) f^mat is continuous if and only if f is continuous.

(6)

(b) f^mat is directionally differentiable if and only if f is directionally differentiable.

(c) f^mat is Fr´echet-differentiable if and only if f is Fr´echet-differentiable.

(d) f^mat is continuously differentiable if and only if f is continuously differentiable.

(e) f^mat is locally Lipschitz continuous if and only if f is locally Lipschitz continuous.

(f ) f^mat is globally Lipschitz continuous with constant κ if and only if f is globally Lip- schitz continuous with constant κ.

(g) f^mat is semismooth if and only if f is semismooth.

When V is the Jordan spin algebra Lⁿ in which K corresponds to the second-order cone (SOC), which is defined as

Kⁿ := {(x₁, ¯x) ∈ R × Rⁿ⁻¹| k¯xk ≤ x₁},

the function f^sc reduces to so-called SOC-function f^soc studied in [4, 6, 7, 8]. More specifically, under such case, the spectral decomposition for any x = (x₁, ¯x) ∈ R × Rⁿ⁻¹ becomes

x = λ₁(x)u⁽¹⁾_x + λ₂(x)u⁽²⁾_x , (2) where λ₁(x), λ₂(x), u⁽¹⁾x and u⁽²⁾x with respect to Kⁿ are given by

λi(x) = x1+ (−1)ⁱk¯xk, u⁽ⁱ⁾_x =







1 2

1, (−1)ⁱ x¯ k¯xk

if ¯x 6= 0,

1 2

1, (−1)ⁱw

if ¯x = 0,

for i = 1, 2, with w being any vector in Rⁿ⁻¹ satisfying kwk = 1. If ¯x 6= 0, the decomposition (2) is unique. With this spectral decomposition, for any function f : R → R, the L¨owner function f^sc associated with Kⁿ reduces to f^soc as below:

f^soc(x) = f (λ₁(x))u⁽¹⁾_x + f (λ₂(x))u⁽²⁾_x ∀x = (x₁, ¯x) ∈ R × Rⁿ⁻¹. (3) The picture of second-order cone Kⁿ in R³ is depicted in Figure 1.

For general symmetric cone case, Baes [2] consider the convexity and differentiability properties of spectral functions. For this SOC setting, Chen, Chen and Tseng in [8] show that the function f^soc inherits from f the properties of continuity, Lipschitz continuity, directional differentiability, Fr´echet differentiability, continuous differentiability, as well as semismoothness. In other words, the following hold.

Theorem 1.4. (a) f^soc is continuous if and only if f is continuous.

(b) f^soc is directionally differentiable if and only if f is directionally differentiable.

(7)

Figure 1: The second-order cone in R³

(c) f^soc is Fr´echet-differentiable if and only if f is Fr´echet-differentiable.

(d) f^soc is continuously differentiable if and only if f is continuously differentiable.

(e) f^soc is locally Lipschitz continuous if and only if f is locally Lipschitz continuous.

(f ) f^soc is globally Lipschitz continuous with constant κ if and only if f is globally Lips- chitz continuous with constant κ.

(g) f^soc is semismooth if and only if f is semismooth.

As for general symmetric cone case, Sun and Sun [38] uses φ

V to denote f^sc defined as in (1). More specifically, for any function φ : R → R, they define a corresponding function associated with the Euclidean Jordan algebra V by

φV(x) = φ(λ₁(x))c₁+ φ(λ₂(x))c₂+ · · · + φ(λ_r(x))c_r,

where λ₁(x), λ₂(x), · · · , λ_r(x) and c₁, c₂, · · · , c_r are the spectral values and spectral vectors of x, respectively. In addition, Sun and Sun [38] extend some of the aforementioned results to more general symmetric cone case regarding f^sc (i.e., φ

V).

Theorem 1.5. Assume that the symmetric cone is simple in the Euclidean Jordan algebra V.

(a) φ

V is continuous if and only if φ is continuous.

(b) φ_V is directionally differentiable if and only if φ is directionally differentiable.

(c) φ

V is Fr´echet-differentiable if and only if φ is Fr´echet-differentiable.

(d) φ

V is continuously differentiable if and only if φ is continuously differentiable.

(8)

(e) φ

V is semismooth if and only if φ is semismooth.

With respect to matrix cones, Ding et al. [17] recently introduce a class of matrix- valued functions, which is called spectral operator of matrices. This class of functions generalizes the well known L¨owner operator and has been used in many important applications related to structured low rank matrices and other matrix optimization problems in machine learning and statistics. Similar to Theorem 1.4 and Theorem 1.5, the continuity, directional differentiability and Frechet-differentiability of spectral operator are also obtained. See [17, Theorem 3, 4 and 5] for more details.

For subsequent needs, for a closed convex cone K ⊆ Rⁿ, we also recall its dual cone, polar cone, and the projection onto itself. For any a given closed convex cone K ⊆ Rⁿ, its dual cone is defined by

K^∗ := {y ∈ Rⁿ| hy, xi ≥ 0, ∀x ∈ K},

and its polar cone is K^◦ := −K^∗. Let ΠK(z) denote the Euclidean projection of z ∈ Rⁿ onto the closed convex cone K. Then, it follows that z = ΠK(z) − ΠK^∗(−z) and

ΠK(z) = argmin_x∈K 1

2kx − zk².

2 Circular cone

The definition of the circular cone L_θ is defined as [42]:

L_θ := x = (x1, ¯x) ∈ R × Rⁿ⁻¹| kxk cos θ ≤ x₁

= x = (x₁, ¯x) ∈ R × Rⁿ⁻¹| k¯xk ≤ x₁tan θ .

From the concept of the circular cone L_θ, we know that when θ = ^π₄, the circular cone is exactly the second-order cone Kⁿ. In addition, we also see that L_θ is solid (i.e., int L_θ 6= ∅), pointed (i.e., L_θ∩ −L_θ = 0), closed convex cone, and has a revolution axis which is the ray generated by the canonical vector e₁ := (1, 0, · · · , 0)^T ∈ Rⁿ. Moreover, its dual cone is given by

L^∗_θ := {y = (y₁, ¯y) ∈ R × Rⁿ⁻¹| kyk sin θ ≤ y₁}

= {y = (y₁, ¯y) ∈ R × Rⁿ⁻¹| k¯yk ≤ y₁cot θ}

= L^π

2−θ.

The pictures of circular cone L_θ in R³ are depicted in Figure 2.

In view of the expression of the dual cone L^∗_θ, we see that the dual cone L^∗_θ is also a solid, pointed, closed convex cone. By the reference [42], the explicit formula of projection onto the circular cone L_θ can be expressed by in the following theorem.

(9)

Figure 2: Three different circular cones in R³.

Theorem 2.1. ([42]) Let x = (x₁, ¯x) ∈ R × Rⁿ⁻¹ and x₊ denote the projection of x onto the circular cone L_θ. Then x₊ is given below:

x₊=







x if x ∈ L_θ, 0 if x ∈ −L^∗_θ, u otherwise, where

u =







x₁+ k¯xk tan θ 1 + tan²θ

x₁+ k¯xk tan θ 1 + tan²θ tan θ

x¯ k¯xk





.

Zhou and Chen [42] also present the decomposition of x, which is similar to the one in the setting of second-order cone.

Theorem 2.2. ([42, Theorem 3.1]) For any x = (x₁, ¯x) ∈ R × Rⁿ⁻¹, one has x = λ₁(x)u⁽¹⁾_x + λ₂(x)u⁽²⁾_x ,

where

λ1(x) = x1 − k¯xk cot θ λ₂(x) = x₁ + k¯xk tan θ and

u⁽¹⁾_x = 1 1 + cot²θ

1 0

0 cot θ

1

−w

u⁽²⁾_x = 1 1 + tan²θ

1 0

0 tan θ

1 w

with w = _k¯^x_xk^¯ if ¯x 6= 0, and any vector in Rⁿ⁻¹ satisfying kwk = 1 if ¯x = 0.

(10)

Theorem 2.3. ([42, Theorem 3.2]) For any x = (x₁, ¯x) ∈ Rⁿ× R, we have x+= (λ1(x))+u⁽¹⁾_x + (λ2(x))+u⁽²⁾_x ,

where (a)₊ := max{0, a}, λ_i(x) and u⁽ⁱ⁾x for i = 1, 2 are given as in Theorem 2.2.

With this spectral decomposition of x, for any function f : R → R, the L¨owner function f^circ associated with L_θ is defined as below:

f^circ(x) = f (λ₁(x))u⁽¹⁾_x + f (λ₂(x))u⁽²⁾_x ∀x = (x₁, ¯x) ∈ R × Rⁿ⁻¹. (4) In [15], Chang, Yang and Chen have obtained that many properties of the function f^circ are inherited from the function f , which is represented in the following theorem.

Theorem 2.4. ([15]) For any the function f : R → R, the vector-valued function f^circ is defined by (4). Then, the following results hold.

(a) f^circ is continuous at x ∈ Rⁿ with spectral values λ₁(x), λ₂(x) if and only if f is continuous at λ1(x), λ2(x).

(b) f^circ is directionally differentiable at x ∈ Rⁿ with spectral values λ₁(x), λ₂(x) if and only if f is directionally differentiable at λ1(x), λ2(x).

(c) f^circ is differentiable at x ∈ Rⁿ with spectral values λ₁(x), λ₂(x) if and only if f is differentiable at λ1(x), λ2(x).

(d) f^circ is strictly continuous at x ∈ Rⁿ with spectral values λ₁(x), λ₂(x) if and only if f is strictly continuous at λ1(x), λ2(x).

(e) f^circ is semismooth at x ∈ Rⁿ with spectral values λ₁(x), λ₂(x) if and only if f is semismooth at λ₁(x), λ₂(x).

(f ) f^circ is continuously differentiable at x ∈ Rⁿ with spectral values λ₁(x), λ₂(x) if and only if f is continuously differentiable at λ₁(x), λ₂(x).

We point out that there is a close relation between L_θ and Kⁿ (see [34, 42]) as below Kⁿ = AL_θ where A :=tan θ 0

0 I

.

We point out a few points regarding circular cones. First, as mentioned in [43], it is possible to construct a new inner product which ensures the circular cone is self-dual.

However, it is not possible to make both L_θ and Kⁿ are self-dual under a certain inner product. Secondly, as shown in [43], the relation Kⁿ= ALθ does not guarantee that there exists a similar close relation between f^circ and f^soc. The third point is that the structure of circular cone helps on constructing complementarity functions for the circular cone complementarity problem as indicated in [34].

(11)

3 The p-order cone

The p-order cone in Rⁿ, which is a generalization of the second-order cone Kⁿ[14], is defined as

K_p :=







x ∈ Rⁿ

x₁ ≥

n

X

i=2

|x_i|^p

!_p¹





(p ≥ 1). (5)

In fact, the p-order cone K_p can be equivalently expressed as

K_p =x = (x₁, ¯x) ∈ R × Rⁿ⁻¹| x₁ ≥ k¯xk_p , (p ≥ 1),

where ¯x := (x₂, · · · , x_n)^T ∈ Rⁿ⁻¹. From (5), it is clear to see that when p = 2, K₂ is exactly the second-order cone Kⁿ. That means that the second-order cone is a special case of p-order cone. Moreover, it is known that K_p is a convex cone and its dual cone is given by

K^∗_p =







y ∈ Rⁿ

y₁ ≥

n

X

i=2

|y_i|^q

!¹_q



 or equivalently

K_p^∗ =y = (y1, ¯y) ∈ R × Rⁿ⁻¹| y₁ ≥ k¯yk_q = Kq

with ¯y := (y₂, · · · , y_n)^T ∈ Rⁿ⁻¹, where q ≥ 1 and satisfies ¹_p+¹_q = 1. From the expression of the dual cone K^∗_p, we see that the cone K^∗_p is also a convex cone. For an application of p-order cone programming, we refer the readers to [41], in which a primal-dual potential reduction algorithm for p-order cone constrained optimization problems is studied. Be- sides, in [41], a special optimization problem called sum of p-norms is transformed into an p-order cone constrained optimization problems. The pictures of three different cones K_p in R³ are depicted in Figure 3.

Figure 3: Three different p-order cones in R³

(12)

In [33], Miao, Qi and Chen explore the expression of the projection onto p-order cone and the spectral decomposition associated with p-order cone, which are shown the following theorems.

Theorem 3.1. ([33, Theorem 2.1]) For any z = (z₁, ¯z) ∈ R × Rⁿ⁻¹, then the projection of z onto K_p is given by

ΠKp(z) =







z, z ∈ K_p

0, z ∈ −K_p^∗ = −K_q

u, otherwise (i.e., −k¯zk_q < z₁ < k¯zk_p) where u = (u1, ¯u) with ¯u = (u2, u3, · · · , un)^T ∈ Rⁿ⁻¹ satisfying

u1 = k¯ukp = (|u2|^p+ |u3|^p+ · · · + |un|^p)¹^p and

u_i− z_i+ u₁− z₁

u^p−1₁ |u_i|^p−2u_i = 0, ∀i = 2, · · · , n.

Theorem 3.2. ([33, Theorem 2.2]) Let z = (z₁, ¯z) ∈ R × Rⁿ⁻¹. Then, z can be decom- posed as

z = α₁(z) · v⁽¹⁾(z) + α₂(z) · v⁽²⁾(z), where







α₁(z) = z1+ k¯zkp

2 α2(z) = z1− k¯zkp

2

and











v⁽¹⁾(z) =

1

¯ w

v⁽²⁾(z) =

1

− ¯w

with ¯w = _k¯_zk^z^¯

p if ¯z 6= 0; while ¯w being an arbitrary element satisfying k ¯wk_p = 1 if ¯z = 0.

For the projection onto p-order cone, we notice that this projection is not an explicit formula because it is hard to solve the equations which in Theorem 3.1. Moreover, the decomposition for z is not an orthogonal decomposition, which is different from the case in the second-order cone and circular cone setting. Because the decomposition for z is not an orthogonal decomposition, the corresponding nonsmooth analysis for its cone- functions is not established.

4 Geometric cone

The geometric cone is defined as bellow [22]:

Gⁿ:=

(

(x, θ) ∈ Rⁿ+× R+

n

X

i=1

e⁻^xi^θ ≤ 1 )

(13)

where x = (x₁, · · · , x_n)^T ∈ Rⁿ+ and we also use the convention e⁻^xi⁰ = 0. From the definition of the geometric cone Gⁿ, we know that Gⁿ is solid (i.e., int Gⁿ 6= ∅), pointed (i.e., Gⁿ∩ −Gⁿ= 0), closed convex cone, and its dual cone is given by

(Gⁿ)^∗ = (

(y, µ) ∈ Rⁿ+× R+

µ ≥ X

yi>0

y_iln y_i Pn

i=1y_i )

where µ ∈ R⁺ and y = (y1, · · · , yn)^T ∈ Rⁿ+. In view of the expression of the dual cone (Gⁿ)^∗, we see that the dual cone (Gⁿ)^∗ is also a solid, pointed, closed convex cone, and ((Gⁿ)^∗)^∗ = Gⁿ. When n = 1, we note that the geometric cone G¹ is just nonnegative octant cone R²+. In addition, by the expression of the geometric cone Gⁿ and its dual cone (Gⁿ)^∗, it is not hard to verify that the boundary of the geometric cone Gⁿ and its dual cone (Gⁿ)^∗ can be respectively expressed as follows:

bd Gⁿ= (

(x, θ) ∈ Rⁿ+× R+

n

X

i=1

e⁻^xi^θ = 1 )

and

bd (Gⁿ)^∗ = (

(y, µ) ∈ Rⁿ+× R+

µ = X

yi>0

y_iln y_i Pn

i=1y_i )

.

For an application of geometric cone programming, we refer the readers to [21], in which the author shows how to transform a prime-dual pair of geometric optimization problem into a constrained optimization problem related with Gⁿ and (Gⁿ)^∗. The pictures of Gⁿ and its dual cone (Gⁿ)^∗ in R³ are depicted in Figure 4.

Figure 4: The geometric cone (left) and its dual cone (right) in R³ Next, we present the projection of (x, θ) ∈ Rⁿ× R onto the geometric cone Gⁿ.

(14)

Theorem 4.1. Let x = (x, θ) ∈ Rⁿ × R. Then the projection of x onto the geometric cone Gⁿ is given by

ΠGⁿ(x) =







x, if x ∈ Gⁿ, 0, if x ∈ (Gⁿ)^◦, u, otherwise,

(6)

where u = (u, λ) ∈ Rⁿ+× R⁺ with u = (u1, u2, · · · , un)^T ∈ Rⁿ+ satisfying u_i− x_i+ λ(λ − θ)

Pn

j=1e⁻^uj^λu_j

e⁻^ui^λ = 0, i = 1, 2, · · · , n (7)

and n

X

i=1

e⁻^ui^λ = 1. (8)

Proof. From Projection Theorem [3, Prop. 2.2.1], we know that, for every x = (x, θ) ∈ Rⁿ× R, a vector u ∈ Gⁿ is equal to the projection point ΠGⁿ(x) if and only if

u ∈ Gⁿ, x − u ∈ (Gⁿ)^◦, and hx − u, ui = 0.

With this, the first two cases of (6) are obvious. Hence, we only need to consider the third case. Based on (8) and the definition of Gⁿ, it is obvious that u ∈ Gⁿ. In addition, from (7), we obtain that Pn

i=1ui(ui− xi) + λ(λ − θ) = 0, which explains that hx − u, ui = 0.

Next, we argue that x − u ∈ (Gⁿ)^◦. To see this, by (7) and (8), we have

n

X

i=1

(ui− xi) = − λ(λ − θ) Pn

j=1e⁻^uj^λu_j. Together with (7) again, it follows that ^Pn^uⁱ^−xⁱ

j=1(uj−x_j) = e⁻^ui^λ, which leads to ln^Pn^uⁱ^−xⁱ j=1(uj−x_j) =

−^u_λⁱ. Hence, we have

X

ui−xi>0

(u_i− x_i) ln u_i− x_i Pn

j=1(u_j − x_j)

= − X

ui−xi>0

(ui− xi)u_i λ

= −1 λ

X

ui−xi>0

(u_i− x_i)u_i

≤ 1

λ · λ(λ − θ) = λ − θ, where the inequality holds since Pn

i=1u_i(u_i − x_i) + λ(λ − θ) = 0. This explains that u − x ∈ (Gⁿ)^∗, i.e, x − u ∈ (Gⁿ)^◦. Then, the proof is complete. 2

(15)

For the projection onto geometric cone Gⁿ, we notice again that this projection is not an explicit formula since the equations (7) and 8 cannot be easily solved. Moreover, the decomposition associated with the geometric cone Gⁿ and the corresponding nonsmooth analysis for its cone-functions are not established.

5 The exponential cone

The exponential cone is defined as bellow [5, 37]:

Ke:= cl n

(x1, x2, x3)^T ∈ R³

x2e^x1^x2 ≤ x3, x2 > 0 o

. In fact, the exponential cone can be expressed as the union of two sets, i.e.,

Ke :=

n

(x1, x2, x3)^T ∈ R³

x2e^x1^x2 ≤ x3, x2 > 0 o

∪(x1, 0, x3)^T

x1 ≤ 0, x3 ≥ 0 . As shown in [5], the dual cone K^∗_e of the exponential cone K_e is given by

K_e^∗ = cln

(y₁, y₂, y₃)^T ∈ R³

− y₁e^y2^y1 ≤ ey₃, y₁ < 0o .

In addition, the dual cone K^∗_e is expressed as the union of the two following sets:

K^∗_e =n

(y₁, y₂, y₃)^T ∈ R³

− y₁e^y2^y1 ≤ ey₃, y₁ < 0o

∪(0, y2, y₃)^T

y₂ ≥ 0, y₃ ≥ 0 . From the expression of the exponential cone K_e and its dual cone K^∗_e, it is known that the exponential cone Ke and its dual cone K^∗_e are closed convex cone in R³. Moreover, based on the expression of K_e and K^∗_e, it is easy to verify that their boundary can be respectively expressed as follows:

bd Ke :=

n

(x1, x2, x3)^T ∈ R³

x2e^x1^x2 = x3, x2 > 0 o

∪(x1, 0, x3)^T

x1 ≤ 0, x3 ≥ 0 . and

bd K^∗_e :=

n

(y1, y2, y3)^T ∈ R³

− y1e^y2^y1 = ey3, y1 < 0 o

∪(0, y2, y3)^T

y2 ≥ 0, y3 ≥ 0 . For an application of exponential cone programming, we refer the readers to [5], in which interior-point algorithms for structured convex optimization involving exponential have been investigated. The pictures of the exponential cone Ke and its dual cone K^∗_e in R³ are depicted in Figure 5.

For the geometric cone Gⁿ and the exponential cone K_e, there exists the relationship between these two types of cones, which is described in the following proposition.

Proposition 5.1. Under the suitable conditions, there is a corresponding relationship between the geometric cone Gⁿ and exponential cone K_e.

(16)

Figure 5: The exponential cone (left) and its dual cone (right) in R³

Proof. For any (x, θ) ∈ Gⁿ with x = (x₁, x₂, · · · , x_n)^T ∈ Rⁿ+, we have Pn

i=1e⁻^xi^θ ≤ 1.

With this, it is equivalent to say

e⁻^xi^θ ≤ z_i, and

n

X

i=1

z_i = 1.

Hence, we obtain that (−x_i

θ, 1, z_i)^T ∈ K_e (i = 1, 2, · · · , n) and

n

X

i=1

z_i = 1.

For the above analysis, it is clear to see that the proof is reversible.

Besides, we give another form of transformation for the exponential cone Ke. Indeed, for any ˜x := (x₁, x₂, x₃)^T := (ˆx^T, x₃)^T ∈ K_e with ˆx := (x₁, x₂)^T, we have two cases, i.e.,

(a) x₂e^x1^x2 ≤ x₃ and x₂ > 0, or (b) x1 ≤ 0, x2 = 0, x3 ≥ 0.

For the case (a), if x₂ = x₃ and x₁ ≤ 0, it follows that e^x1^x2 ≤ 1 and x₂ > 0, which yields (−x₁, x₂)^T ∈ G¹. Under the condition x₂ = x₃, if x₁ > 0, we find that there is no relationship between K_e and G¹. For the case (b), if x₂ = x₃, then, we have x₁ ≤ 0 and x₂ = x₃ = 0. this implies that e^x1⁰ = 0. By this, we have ˆx = (−x₁, 0)^T ∈ G¹. 2

We also present the projection of x ∈ R³ onto the exponential cone K_e.

(17)

Theorem 5.1. Let x = (x₁, x₂, x₃)^T ∈ R³. Then the projection of x onto the exponential cone Ke is given by

ΠK_e(x) =







x, if x ∈ K_e,

0, if x ∈ (K_e)^◦ = −K^∗_e, v, otherwise,

(9)

where v = (v₁, v₂, v₃)^T ∈ R³ has the following form:

(a) if x1 ≤ 0 and x2 ≤ 0, then v = (x1, 0,^x³^+|x₂ ³^|)^T.

(b) otherwise, the projection ΠKe(x) = v satisfies the equations:

v₁ − x₁ + e^v1^v2

v₂e^v1^v2 − x₃

= 0, v₂(v₂− x₂) − (v₁− x₁)(v₂− v₁) = 0, v₂e^v1^v2 = v₃.

Proof. As the argument of Theorem 4.1, the first two cases of (9) are obvious. Hence, we only need to consider the third case, i.e., x /∈ Ke∪ (Ke)^◦. For convenience, we denote

A :=n

(x₁, x₂, x₃)^T

x₂e^x1^x2 ≤ x₃, x₂ > 0o

and B :=(x₁, 0, x₃)^T

x₁ ≤ 0, x₃ ≥ 0 . (a) If x1 ≤ 0 and x2 ≤ 0, since the exponential cone Ke is closed and convex, by Proposition 2.2.1 in [3], we get that v is the projection of x onto K_e if and only if

hx − v, y − vi ≤ 0, ∀y ∈ K_e. (10)

From this, we need to verify that v = (x₁, 0,^x³^+|x₂ ³^|)^T satisfies (10). For any y :=

(y₁, y₂, y₃)^T ∈ K_e, it follows that

hx − v, y − vi = x₂y₂+x3− |x3| 2

y₃− x3+ |x3| 2

= x₂y₂+ y₃x₃− |x₃|

2 .

If y ∈ A, we have y2 > 0 and y3 ≥ y2e^y1^y2 > 0, which leads to hx − v, y − vi = x₂y₂+ y₃x₃− |x₃|

2 ≤ 0.

If y ∈ B, we have y₂ = 0 and y₃ ≥ 0, which implies that hx − v, y − vi = y₃x₃− |x₃|

2 ≤ 0.

Hence, under the conditions of x₁ ≤ 0 and x₂ ≤ 0, we can obtain that ΠK_e(x) = v = (x₁, 0,^x³^+|x₂ ³^|)^T.

(18)

(b) If x belongs to other cases, we assert that the projection ΠKe(x) of x onto K_e lies in the set A. Suppose not, i.e., ΠKe(x) ∈ B. Then, for any x = (x1, x2, x3)^T ∈ R³, it follows that ΠK_e(x) = v = (min{x₁, 0}, 0,^x³^+|x₂ ³^|)^T ∈ B. By Projection Theorem [3, Prop. 2.2.1], we know that the projection v should satisfy the condition

v ∈ K_e, x − v ∈ (K_e)^◦, and hx − v, vi = 0.

However, we see that there exists x₁ > 0 or x₂ 6= 0 such that v − x = (min{x₁, 0} − x₁, −x₂,|x₃| − x₃

2 )^T ∈ K/ _e^∗,

i.e., x − v /∈ (K_e)^◦. For example, when x₁ = 1, x₂ = 0 and x₃ = 1, we have v − x = (−1, 0, 0)^T ∈ K/ ^∗_e. This contradicts with x − v ∈ (Ke)^◦. Hence, the projection ΠKe(x) ∈ A.

To obtain the expression of ΠK_e(x), we look into the following problem:

min f (x) = ¹₂kv − xk²

s.t. v ∈ A. (11)

In light of the convexity of the function f and the set A, it is easy to verify that the problem (11) is a convex optimization problem. Moreover, it follows from v ∈ A that

v₁

v₂ − ln v3+ ln v2 ≤ 0.

Thus, the KKT conditions of the problem (11) are recast as











v1− x1+_v^µ

2 = 0, v₂− x₂+ µ(−^v_v¹2

2

+ _v¹

2) = 0, v3− x3− _v^µ

3 = 0, µ ≥ 0, ^v_v¹

2 − ln v₃+ ln v₂ ≤ 0, µ(^v_v¹

2 − ln v₃+ ln v₂) = 0.

(12)

From (12), by the fact that the projection of x ∈ /∈ K_e∪ (K^∗_e)^◦ must be a point in the boundary, it is not hard to see that ^v_v¹

2 − ln v₃+ ln v₂ = 0 and µ > 0, i.e., v₃ = v₂e^v1^v2 and µ > 0. In addition, by the first and third equations in (12), we have

v₁− x₁+v₃(v₃− x₃) v2

= 0.

Combining with v3 = v2e^v1^v2, this implies that v₁ − x₁ + e^v1^v2

v₂e^v1^v2 − x₃

= 0.

On the other hand, by the first and second equations in (12), we have v₂(v₂− x₂) = (v₁− x₁)(v₂− v₁).

(19)

Therefore, we obtain that the projection ΠKe(x) = v satisfies the following equations:

v₁− x₁+ e^v1^v2

v₂e^v1^v2 − x₃

= 0, v₂(v₂− x₂) − (v₁− x₁)(v₂ − v₁) = 0, v2e^v1^v2 = v3. Then, the proof is complete. 2

Here, we say a few words about Theorem 5.1. Unfortunately, unlike second-order cone or circular cone cases, we do not obtain an explicit formula for the projection onto the exponential cone, since there are nonlinear transcendental equations in Theorem 5.1. For example, when we examine the projection onto the exponential cone K_e. Let x = (1, −2, 3). For the case in Theorem 5.1(b), using the second condition v₂(v₂− x₂) − (v₁− x₁)(v₂− v₁) = 0, we have

v₂ = v₁− 3 +p−3v₁²− 2v₁+ 9

2 .

Combining with the first condition v₁ − x₁ + e^v1^v2

v₂e^v1^v2 − x₃

= 0 in the case (b), this yields a nonlinear transcendental equations as bellow:

v₁− 1 + e

2v1 v1−3+

√

−3v21−2v1+9 v₁− 3 +p−3v₁²− 2v₁+ 9

2 e

2v1 v1−3+

√

−3v21−2v1+9 − 3

!

= 0.

From this equation, we do not have the specific expression of v₁. Hence, the explicit formula for the projection onto exponential cone cannot be obtained. Moreover, analogous to the geometric cone Gⁿ, the decomposition for x associated with the exponential cone K_e and the corresponding nonsmooth analysis for its cone-functions are not established.

6 The power cone

The high dimensional power cone is defined as bellow [25, 39]:

K^α_m,n :=

(

(x, z) ∈ R^m+ × Rⁿ kzk ≤

m

Y

i=1

x^α_iⁱ )

,

where α_i > 0, Pm

i=1α_i = 1 and x = (x₁, · · · , x_m)^T. For the power cone, when m = 2, n = 1, Truong and Tuncel [39] have discussed the homogeneity of the power cone.

However, Hien [25] states that the power cone is not homogeneous in general case, and the power cone is self-dual cone. Moreover, when m = 2 and α₁ = α₂ = ¹₂, we see that the power cone K_m,n^α is exactly the rotated second-order cone, which has a broad range

(20)

of applications. In [25], Hien provides the expression of the dual cone of the power cone K^α_m,n as below:

(K^α_m,n)^∗ = (

(s₁, · · · , s_m, ω₁, · · · , ω_n) ∈ R^m+ × Rⁿ

m

Y

i=1

s_i α_i

αi

≥ kωk )

,

where ω = (ω₁, · · · , ω_n)^T ∈ Rⁿ. For an application of power cone programming, we refer the readers to [5], in which a lot of practical applications such as location problems and geometric programming can be modelled using K^α_m,n and its limiting case K_e. The pictures of the power cone K^α_m,n and its dual cone (K_m,n^α )^∗ in R³ are depicted in Figure 6, where the parameters (m, n) = (2, 1) and (α₁, α₂) = (0.8, 0.2).

Figure 6: The power cone (left) and its dual cone (right) in R³.

The projection onto the power cone K_m,n^α is already figured out by Hien in [25], which is presented in the following theorem.

Theorem 6.1. ([25, Proposition 2.2]) Let (x, z) ∈ R^m×Rⁿ with x = (x₁, · · · , x_m)^T ∈ R^m and z = (z1, · · · , zn)^T ∈ Rⁿ. Set (ˆx, ˆz) be the projection of (x, z) onto the power cone K^α_m,n. Denote

Φ(x, z, r) = 1 2

m

Y

i=1

x_i+

q

x²_i + 4α_ir(kzk − r)

αi

− r.

(a) If (x, z) /∈ K^α_m,n∪ −(K_m,n^α )^∗ and z 6= 0, then its projection onto K_m,n^α is ( xˆ_i = ¹₂

x_i+px²_i + 4α_ir(kzk − r)

, i = 1, · · · , m, ˆ

z_l= z_l_kzk^r , l = 1, · · · , n,

(21)

where r = r(x, z) is the unique solution of the following system:

E(x, z) : Φ(x, z, r) = 0, 0 < r < kzk.

(b) If (x, z) /∈ K^α_m,n∪ −(K^α_m,n)^∗ and z = 0, then its projection onto K^α_m,n is

xˆ_i = (x_i)₊= max{0, x_i}, i = 1, · · · , m, ˆ

z_l = 0, l = 1, · · · , n.

(c) If (x, z) ∈ K^α_m,n, then its projection onto K_m,n^α is itself, i.e., (ˆx, ˆz) = (x, z).

(d) If (x, z) ∈ −(K^α_m,n)^∗, then its projection onto K_m,n^α is zero vector, i.e., (ˆx, ˆz) = 0.

Nonetheless, Hein does not obtain an explicit formula for the projection onto the power cone K_m,n^α in [25]. Accordingly, analogous to the geometric cone Gⁿ and the exponential cone K_e, the decomposition for (x, z) associated with the power cone K^α_m,n and the corresponding nonsmooth analysis for its cone-functions are not established yet.

7 Conclusion

According to the authors’ earlier experience on symmetric cone optimization, we believe that spectral decomposition associated with cones, nonsmooth analysis regarding cone- functions, projections onto cones, and cone-convexity are the bridges between symmetric cone programs and nonsymmetric cone programs. Therefore, in this paper, we survey some related results about circular cone, p-order cone, geometric cone, exponential cone, and the power cone. Although the results are not quite complete due to the difficulty of handling nonsymmetric cones, they are very crucial to subsequent study towards nonsymmetric cone optimization. Further investigations are definitely desirable. We summarize and list out some future topics as below.

1. Exploring more structures and properties for each non-symmetric cone. Also looking for more non-symmetric cones, e.g., EDM cone.

2. For geometric cone, exponential cone, and power cone, etc., figuring out their spectral decompositions, projections, and doing nonsmooth analysis for their corresponding cone-functions like f^sc, f^mat and f^circ. We point out that through appropriate transformations (for example, α-representation and extended α-representation defined in [5]), the aforementioned geometric cone, exponential cone, and power cone can be generated from the 3-dimensional power cone and the exponential cone in Figure 5 and 6. More recently, Lu et al. [31] propose two types of decomposition approaches for these cones. We believe their results yield a possibility to construct the corresponding cone-functions.

3. Designing appropriate algorithms based on the structures of non-symmetric cones.

(22)

References

[1] E.D. Andersen, C. Roos, and T. Terlaky, Notes on duality in second order and p-order cone optimization, Optimization, 51 (2002), 627-643.

[2] M. Baes, Convexity and differentiablity properties of spectral functions and spectral mappings on Euclidean Jordan algebras, Linear Algebra and its Applications, 422 (2007), 664-700.

[3] D.P. Bertsekas, A. Nedi´c and A.E. Ozdaglar, Convex analysis and optimization, Cambridge, MA: Athena Scientific, (2003).

[4] Y.-L. Chang and J.-S. Chen, The H¨older continuity of vector-valued functions associated with second-order cone, Pacific Journal of Optimization, 8 (2012), 135-141.

[5] R. Chares, Cones and interior-point algorithms for structured convex optimization involving powers and exponentials, http://hdl.handle.net/2078.1/28538, (2009).

[6] J.-S. Chen, Alternative proofs for some results of vector-valued functions associated with second-order cone, Journal of Nonlinear and Convex Analysis, 6 (2005), 297- 325.

[7] J.-S. Chen, The convex and monotone functions associated with second-order cone, Optimization, 55 (2006), 363-385.

[8] J.-S. Chen, X. Chen, and P. Tseng, Analysis of nonsmooth vector-valued functions associated with second-order cones, Mathematical Programming, 101 (2004), 95-117.

[9] C.B. Chua, A t-algebraic approach to primal-dual interior-point algorithms, SIAM Journal on Optimization, 20 (2009), 503-523.

[10] J.-S. Chen and S.-H. Pan, A entropy-like proximal algorithm and the exponential multiplier method for symmetric cone programming, Computational Optimization and Applications, 47 (2010), 477-499.

[11] J.-S. Chen, S.-H. Pan, and T.-C. Lin, A smoothing Newton method based on the gen- eralized Fischer-Burmeister function for MCPs, Nonlinear Analysis: Theory, Meth- ods and Applications, 72 (2010), 3739-3758.

[12] X. Chen, H. Qi, and P. Tseng, Analysis of nonsmooth symmetric-matrix-valued functions with applications to semidefinite complementarity problems, SIAM Journal on Optimization, 13 (2003), 960-985.

[13] A. Pinto Da Costa and A. Seeger, Numerical resolution of cone-constrained eigen- value problems, Computational and Applied Mathematics, 28 (2009), 37-61.