• 沒有找到結果。

1Introduction Propertiesofcircularconeandspectralfactorizationassociatedwithcircularcone

N/A
N/A
Protected

Academic year: 2022

Share "1Introduction Propertiesofcircularconeandspectralfactorizationassociatedwithcircularcone"

Copied!
13
0
0

加載中.... (立即查看全文)

全文

(1)

to appear in Journal of Nonlinear and Convex Analysis, 2013

Properties of circular cone and spectral factorization associated with circular cone

Jinchuan Zhou 1 Department of Mathematics

School of Science

Shandong University of Technology Zibo 255049, P.R. China E-mail: [email protected]

Jein-Shan Chen 2 Department of Mathematics National Taiwan Normal University

Taipei 11677, Taiwan E-mail: [email protected]

October 11, 2012

Abstract. Circular cone includes second-order cone as a special case when the rotation angle is 45 degree. This paper gives an insight on circular cone, in which we describe the tangent cone, normal cone, second order tangent cone, and second order regularity of circular cone. Moreover, we establish the spectral factorization associated with circular cone. These results are crucial to subsequent study regarding various analysis towards optimizations associated with circular cone.

Keywords. Circular cone, second order regular, spectral factorization.

1 Introduction

The circular cone [9] is a pointed closed convex cone having hyperspherical sections orthogonal to its axis of revolution about which the cone is invariant to rotation. Let its

1The author’s work is supported by National Natural Science Foundation of China (11101248) and Shandong Province Natural Science Foundation (ZR2010AQ026).

2Corresponding author. Member of Mathematics Division, National Center for Theoretical Sciences, Taipei Office. The author’s work is supported by National Science Council of Taiwan.

(2)

half-aperture angle be θ where θ ∈ (0,π2). Then, we denote the n-dimensional circular cone by Lθ which is expressed as

Lθ :=x = (x1, x2)T ∈ IR × IRn−1 | cos θkxk ≤ x1 . (1) Circular cone includes second-order cone (SOC), given by

Kn := {x = (x1, x2)T ∈ IR × IRn−1 | kx2k ≤ x1}, (2) as a special case when the rotation angle is 45 degree. This can be verified by

Kn = (x1, x2)T ∈ IR × IRn−1 | kx2k ≤ x1

= (x1, x2)T ∈ IR × IRn−1 | x1 ≥ 0, 2kx2k2 ≤ 2x21

= (x1, x2)T ∈ IR × IRn−1 | x1 ≥ 0, 2x21+ 2kx2k2 ≤ 4x21

=



(x1, x2)T ∈ IR × IRn−1

q

2x21 + 2kx2k2 ≤ 2x1



= (

(x1, x2)T ∈ IR × IRn−1

√2 2

q

x21+ kx2k2 ≤ x1 )

= (

(x1, x2)T ∈ IR × IRn−1

√2

2 kxk ≤ x1 )

= n

(x1, x2)T ∈ IR × IRn−1 cosπ

4kxk ≤ x1o .

Considerable attentions have been devoted to the second-order cone Kn [5, 6, 7], a special case of self-dual cone. However, the study on the circular cone Lθ, a non-self-dual (or non-symmetric cone) is rather limited. In this paper, we show that there exists a close relationship between Knand Lθ by establishing an inequality regrading distance between Kn and Lθ. This nice property plays an essential role in our subsequence analysis and give us more information and insight on Lθ. In particular, we develop the formulae of tangent cone, normal cone, and second-order tangent cone of Lθ in terms of Kn (the formula of the latter has been given by different scholars). Furthermore, we show that Lθ, as a non-self-dual and non-polytechnic cone, is also second-order regular. Note that we know the second-order cone and positive semi-definitive cone are both second order regular, but there are all symmetric. Thus this is an interesting case which indicates the second order regularity of a non-symmetric cone. Finally, we develop the spectral factorization of z in terms of Lθ by studying the projection on Lθ which will be useful in dealing with optimization associated circular cone.

(3)

In fact, it is not hard to see that

Lθ = (x1, x2)T ∈ IR × IRn−1 | x1 ≥ 0, kxk2cos2θ ≤ x21

= (x1, x2)T ∈ IR × IRn−1 | x1 ≥ 0, (x21+ kx2k2) cos2θ ≤ x21

= (x1, x2)T ∈ IR × IRn−1 | x1 ≥ 0, kx2k2 ≤ x21tan2θ

= (x1, x2)T ∈ IR × IRn−1 | kx2k ≤ x1tan θ , which yields

x1 x2



∈ Lθ ⇐⇒ tan θx1 x2



∈ Kn ⇐⇒  tan θ 0

0 1

 x1 x2



∈ Kn. (3) For simplicity, let us denote

A := tan θ 0

0 1

 . Then, the above expression (3) is equivalent to

x1 x2



∈ Lθ ⇐⇒ Ax1 x2



∈ Kn. (4)

We point out that the matrix A is positive definite whose inverse matrix is A−1 = ctanθ 0

0 1



where ctanθ := 1 tan θ.

To close this section, we say a few words about the notations. For a convex cone K, its dual cone is defined by

(K) = {v | hv, xi ≥ 0, ∀x ∈ K} , while its polar cone is given by

(K) = {v | hv, xi ≤ 0, ∀x ∈ K} .

2 Insight on circular cone

In this section, we give an insight on circular cone in which we shall study some properties of Lθ, including characterizing its tangle cone, normal cone, second-order tangent cone, etc.. To this end, we first describe the relationship between Kn and Lθ.

Theorem 2.1. Let Lθ and Kn be defined as in (1) and (2), respectively. Then, we have

(4)

(a) Lθ = A−1Kn and Kn = ALθ. (b) AKn= Lπ

2−θ and Lπ

2−θ = A2Lθ. (c) Lθ = Lπ

2−θ and (Lθ) = Lθ.

Proof. (a) This follows from equivalence (4) because Lθ = {x | x ∈ Lθ}

= {x | Ax ∈ Kn}

= x | x ∈ A−1Kn

= A−1Kn. (b) According to part(a), we have

Lπ

2−θ =

"

ctan(π

2 − θ) 0

0 1

#

Kn = tan θ 0

0 1



Kn = AKn = A(ALθ) = A2Lθ

which is the desired result.

(c) It is known that Kn is self-dual. Hence, we have

Kn = (Kn) = {v | hv, ki ≥ 0, ∀k ∈ Kn}

= {v | hv, Azi ≥ 0, ∀z ∈ Lθ}

= {v | hAv, zi ≥ 0, ∀z ∈ Lθ}

= {v | Av ∈ Lθ}

= A−1Lθ which implies Lθ = AKn = Lπ

2−θ by part(b). The remaining part is true for all closed convex cone. 2

Theorem 2.2. For any x, z ∈ IRn, we have

kAk−1 dist(Az, Kn) ≤ dist(z, Lθ) ≤ kA−1k dist(Az, Kn) (5) and

kA−1k−1 dist(A−1x, Lθ) ≤ dist(x, Kn) ≤ kAk dist(A−1x, Lθ). (6) Proof. First, we observe the following:

dist(x, Kn)

= min

k∈Knkx − kk = min

k∈ALθkx − kk = min

z∈Lθ

kx − Azk = min

z∈Lθ

kA(A−1x) − Azk

= min

z∈Lθ

kA(A−1x − z)k ≤ kAk min

z∈Lθ

kA−1x − zk = kAk dist A−1x, Lθ , (7)

(5)

dist(z, Lθ)

= min

u∈Lθ

kz − uk = min

u∈A−1Knkz − uk = min

k∈Knkz − A−1kk = min

k∈KnkA−1(Az) − A−1kk

= min

k∈KnkA−1(Az − k)k ≤ kA−1k min

k∈KnkAz − kk = kA−1k dist(Az, Kn). (8) These prove the second inequality in (5) and (6), respectively. Next, plugging z = A−1x and x = Az in (7) and (8), respectively, yields the first inequality in (5) and (6), respectively. Thus, the proof is complete. 2

Theorem 2.2 indicates that the distances of arbitrary points to Kn and Lθ are equiv- alent. This is an essential property for analyzing the tangent cone and normal cone of Lθ. Before we move on, we recall the definitions of tangent cone and normal cone. Given a subset S ⊂ IRn and x ∈ S, the contingent cone TS(x) and inner tangent cone TSi(x) of S at x are defined respectively as

TS(x) := {d ∈ IRn| ∃tn ↓ 0, dist (x + tnd, S) = o(tn)}

and

TSi(x) := {d ∈ IRn| dist(x + th, S) = o(t), t ≥ 0)}.

In general, these two cones can be different. However, when S is convex, they are equal to each other and to the closure of the radial cone, see [4, page 45]. Hence for convex sets, we simply speak of tangent cone rather than contingent or inner tangent cones.

Moreover, the Fr´echet/regular normal cone (also known as the prenormal cone), written as bNS(x), is defined as

NbS(x) := {v ∈ IRn| hv, z − xi ≤ o(kz − xk), for z ∈ S},

and the Mordukhovich/limiting normal cone (or simply normal cone) is defined as NS(x) := lim sup

zSx

NbS(z).

When S is convex, NS(x) = bNS(x) and is the polar cone of TS(x), i.e., NS(x) := {v ∈ IRn | hv, di ≤ 0, ∀d ∈ TS(x)} .

Theorem 2.3. For any z ∈ Lθ, we have (a) TLθ(z) = A−1TKn(Az),

(b) NLθ(z) = ANKn(Az).

(6)

Proof. (a) Let us first show that TLθ(z) ⊆ A−1TKn(Az). Choose d ∈ TLθ(z). Then, by definition of tangent cone, we have

dist(z + td, Lθ) = o(t). (9)

Plugging x = A(z + td) into (6) yields

kA−1k−1 dist(z + td, Lθ) ≤ dist (A(z + td), Kn) ≤ kAk dist(z + td, Lθ).

This together with (9) implies dist(Az + tAd, Kn) = o(t). Thus, Ad ∈ TKn(Az), which says d ∈ A−1TKn(Az).

Conversely, let d ∈ A−1TKn(Az). Since Ad ∈ TKn(Az), from definition of tangent cone, we know

dist(Az + tAd, Kn) = o(t). (10)

Replacing z in (5) by z + td gives

kAk−1 dist(Az + tAd, Kn) ≤ dist(z + td, Lθ) ≤ kA−1k dist(Az + tAd, Kn).

This together with (10) implies dist(z + td, Lθ) = o(t), which says d ∈ TLθ(z).

(b) The desired result follows from

NLθ(z) = {v ∈ IRn | hv, di ≤ 0, ∀d ∈ TLθ(z)}

= v ∈ IRn | hv, A−1wi ≤ 0, ∀w ∈ TKn(Az)

= v ∈ IRn | hA−1v, wi ≤ 0, ∀w ∈ TKn(Az)

= v ∈ IRn | A−1v ∈ NKn(Az)

= ANKn(Az).

2

Theorem 2.3 tells us that the explicit formula of tangent cone TLθ(z) can be estab- lished by TKn(Az), which has been given in [3].

It is well known that in the study of second order analysis for optimization problems, we need the following inner and outer second order tangent sets to describe the possible curvature of the feasible region. Below, we state their official definitions.

Definition 2.1. [4, Definition 3.28] The set limits TSi,2(x, d) :=



w ∈ IRn

dist



x + td +1 2t2w, S



= o(t2), t ≥ 0



(7)

and

TS2(x, d) =



w ∈ IRn

∃ tn↓ 0 such that dist



x + tnd +1 2t2nw, S



= o(t2n)



are called the inner and outer second order tangent sets, respectively, to the set S at x in the direction d.

Definition 2.2. [4, Definition 3.32] We say that the set S is second order directionally differentiable at a point x ∈ S in a direction d ∈ TS(x), if TSi(x) = TS(x) and TSi,2(x, d) = TS2(x, d). We simply say that S is second order directionally differentiable at a point x ∈ S if it is second order directionally differentiable in all directions d ∈ TS(x).

Theorem 2.4. Let z ∈ Lθ and d ∈ TLθ(z). Then, TLi,2

θ(z, d) = TL2

θ(z, d) = A−1TK2n(Az, Ad).

Proof. The first equality is due to the second order directionally differentiable of Kn as shown in [10, Proposition 3.1] and the second equality can be proved by the same arguments as in Theorem 2.3. 2

Definition 2.3. [4, Definition 3.85] We say that a subset S ⊂ IRn is second order regular at x if it satisfies

(i) TS2(x, d) = TSi,2(x, d) for all d ∈ TS(x);

(ii) for any d ∈ TS(x) and for any sequence x + tnd +12t2nrn∈ S such that tnrn→ 0, the following condition holds:

n→∞lim dist rn, TS2(x, d) = 0.

Theorem 2.5. The circular cone Lθ is second order regular.

Proof. Let z ∈ Lθ and d ∈ TLθ(z). According to Theorem 2.4, it suffices to show that for any sequence z + tnd + 12t2nrn∈ Lθ with tnrn→ 0, there holds

n→∞lim dist rn, TL2θ(z, d) = 0. (11) We will complete the proof by using the relationship between Lθ and Kn. Since z + tnd +

1

2t2nrn∈ Lθ, we know Az + tnAd +12t2nArn ∈ K by Theorem 2.1(a). Note that tnArn→ 0

(8)

because ktnArnk ≤ kAk · ktnrnk. In addition, Kn is second order regular (see [10] for detailed proof), from Definition 2.3, we have

n→∞lim dist Arn, TK2n(Az, Ad) = 0. (12) On the other hand, we observe that

dist rn, TL2

θ(z, d)

= dist rn, A−1TK2n(Az, Ad)

= dist A−1(Arn), A−1TK2n(Az, Ad)

≤ kA−1k dist Arn, TK2n(Az, Ad) . This together with (12) implies the validity of (11). 2

3 Spectral factorization associated with circular cone

In this section, we will develop the spectral factorization associated with circular cone which is the basis of further investigations for optimization associated with circular cone.

To this end, we start with studying the projection on Lθ, i.e., ΠLθ(z) := arg min

x∈Lθ

kz − xk = {x ∈ Lθ | kz − xk ≤ kz − uk, ∀u ∈ Lθ} .

It should be mentioned that the projection cannot be obtained by using the relation- ship between Lθ and Kn because

kA−1xk ≤ kA−1yk ; kxk ≤ kyk whenever θ 6= π/4.

For example, let x = (8, 1), y = (4, 2), and θ = cot−1(1/8). Then, kA−1xk =√

2 < √

17/2 = kA−1yk, but kxk =√

65 >√

20 = kyk.

Therefore, we seek another way to characterize the projection. First, we note that for any closed convex cone Ω

Π−Ω(x) = −Π(−x).

In fact, letting a = Π−Ω(x) yields

k(−x) − (−a)k = kx − ak ≤ kx − (−y)k = k(−x) − yk ∀y ∈ Ω,

where the inequality comes from the fact that a = Π−Ω(x) by definition of projection.

This means that −a = Π(−x). Besides, it is well known that any vector z ∈ IRn can be written as

z = Π(z) + Π(z).

(9)

Hence,

z = ΠLθ(z) + ΠLθ(z) = ΠLθ(z) + Π−Lθ(z)

= ΠLθ(z) − ΠL

θ(−z) = ΠLθ(z) − ΠLπ

2−θ(−z). (13)

Due to the special structure of Lθ, the explicit formula of projection is given below.

ΠLθ(z) =

z, if z ∈ Lθ 0, if z ∈ −Lθ u, otherwise,

(14)

where

u =

z1 + kz2k tan θ 1 + tan2θ

 z1+ kz2k tan θ 1 + tan2θ tan θ

 z2 kz2k

.

In fact, formula (14) can be found in several places, for example, [8], [1, page 508] or [2, Theorem 3.3.6]. For completeness we provide the detailed argument on (14), nonetheless, by a different approach from that in [2, Theorem 3.3.6], which leads us to establish the spectral factorization associated with Lθ.

The first two cases in (14) follow from (13) directly. Now, consider the third case.

Note that it corresponds to z1tan θ < kz2k and −z1ctanθ < kz2k. Hence we must have z2 6= 0, because otherwise, we would have z1 < 0 and z1 > 0, which is impossible. Let us calculate the projection in the third case by solving the Karush-Kuhn-Tucker conditions for the following convex programming problems

min 1

2kx − zk2 s.t. x ∈ Lθ which is equivalent to

min 1

2kx − zk2

s.t. kx2k − x1tan θ ≤ 0.

The KKT point of the above convex programming is to find x ∈ Lθ and λ ≥ 0 such that

 x1− z1 x2− z2

 + λ

(" 0 x2 kx2k

#

− tan θ 1 0

)

= 0,

which is equivalent to solving

x1 = z1+ λ tan θ,

x2 = 1

1 + (λ/kx2k) z2. (15)

(10)

Thus,

kz2k = (1 + (λ/kx2k)) kx2k = kx2k + λ = kx2k + x1 − z1

tan θ = x1tan θ + x1− z1

tan θ , where the third equality is due to (15) and the last equality comes from the fact that kx2k = x1tan θ since the projection point of z /∈ Lθ must lie in the boundary of Lθ. Then, we have

x1 = z1+ kz2k tan θ 1 + tan2θ . Substituting this into the first equation in (15) yields

λ = kz2k − z1tan θ 1 + tan2θ .

Therefore, according to the second equation in (15), we obtain x2 = z1+ kz2k tan θ

1 + tan2θ tan θ

 z2 kz2k which says

ΠLθ(z) =

z1+ kz2k tan θ 1 + tan2θ

 z1+ kz2k tan θ 1 + tan2θ tan θ

 z2 kz2k

 (16)

under this subcase.

From (13), we see that ΠL

θ(z) = −ΠLπ

2−θ(−z) which implies

ΠL

θ(z) = −

−z1+ kz2kctanθ 1 + ctan2θ

 −z1 + kz2kctanθ

1 + ctan2θ ctanθ −z2 kz2k

=

z1− kz2kctanθ 1 + ctan2θ

 z1− kz2kctanθ

1 + ctan2θ ctanθ −z2 kz2k

. (17)

According to the above arguments, we obtain the following result, which is called the spectral factorization for z associated with circular cone.

Theorem 3.1. For any z ∈ IRn, one has

z = λ1(z) · u(1)z + λ2(z) · u(2)z (18)

(11)

where

λ1(z) = z1− kz2kctanθ λ2(z) = z1+ kz2k tan θ and

u(1)z = 1 1 + ctan2θ

1 0

0 ctanθ

  1

−w



u(2)z = 1 1 + tan2θ

1 0

0 tan θ

  1 w



with w = z2

kz2k if z2 6= 0, and any vector in IRn−1 satisfying kwk = 1 if z2 = 0.

Proof. The case of z2 = 0 is clear by simply calculating (18). The case of z2 6= 0 follows from (13), (16), and (17) because

z = ΠLθ(z) + ΠLθ(z)

=

z1+ kz2k tan θ 1 + tan2θ

 z1+ kz2k tan θ 1 + tan2θ tan θ

 z2 kz2k

+

z1− kz2kctanθ 1 + ctan2θ

 z1 − kz2kctanθ

1 + ctan2θ ctanθ −z2 kz2k

= z1 + kz2k tan θ 1 + tan2θ

1 0

0 tan θ

" 1 z2 kz2k

#

+z1− kz2kctanθ 1 + ctan2θ

1 0

0 ctanθ

" 1

− z2 kz2k

# . 2

With Theorem 3.1, we could derive another expression for the projection shown as below.

Theorem 3.2. For any z ∈ IRn, we have ΠLθ(z) = λ1(z)

+· u(1)z + λ2(z)

+· u(2)z , (19)

where (a)+ := max{0, a}, λi(z) and uiz for i = 1, 2 are given as in Theorem 3.1.

Proof. The proof is divided into two cases, according to whether z2 = 0 or z2 6= 0.

Case 1: z2 = 0. If z1 ≥ 0, then z1tan θ ≥ 0 = kz2k and λi(z) = z1 ≥ 0. Hence z ∈ Lθ and both sides of (19) are z by (14) and (18). If z1 < 0, then −z1ctanθ ≥ 0 = kz2k and λi(z) = z1 < 0 for i = 1, 2. Hence, z ∈ −Lπ

2−θ = −Lθ and both sides of (19) are 0 by (14).

Case 2: z2 6= 0. If z ∈ Lθ, then z1tan θ ≥ kz2k which implies z1 ≥ 0. Therefore, λi(z) ≥ 0 for i = 1, 2 which gives ΠLθ(z) = z = λ1(z)u1z2(z)u2z by (14) and (18). If z ∈ −Lθ, then

(12)

−z ∈ Lπ

2−θ, i.e., −z1ctanθ ≥ kz2k, which says z1 ≤ 0. Hence, λ1(z) = z1− kz2kctanθ ≤ 0 and λ2(z) = z1+kz2k tan θ ≤ 0. This indicates that the right-hand side of (19) is zero and it coincides ΠLθ(z) = 0 by (14) under this case. Other cases correspond to z1tan θ < kz2k and −z1ctanθ < kz2k, i.e., λ1(z) = z1− kz2kctanθ < 0 and λ2(z) = z1+ kz2k tan θ > 0.

Simplifying the right-hand side of (19) with this, we see that (14) is also satisfied under this case. Thus, all the above shows the validity of (19). 2

In particular, when θ = π/4, expressions (18) and (19) takes, respectively, the form of

z = (z1− kz2k)1 2

 1

−w



+ (z1+ kz2k)1 2

 1 w



and

ΠLθ(z) = (z1 − kz2k)+1 2

 1

−w



+ (z1+ kz2k)+1 2

 1 w



where w = z2

kz2k if z2 6= 0, and any vector in IRn−1 satisfying kwk = 1 if z2 = 0. These are exactly the well-known spectral factorization and projection associated with Kn.

We believe that the spectral factorization given in Theorem 3.1 is very important for developing theory and algorithm for optimization associated with Lθ like the role played by the spectral factorization associated with Kn in second-order cone optimization. We leave it for our future research topic.

References

[1] H. H. Bauschke and S. G. Kruk, Reflection-projection method for convex fea- sibility problems with an obtuse cone, Journal of Optimization Theory and Applica- tions, vol. 120, pp. 503-531, 2004.

[2] H. H. Bauschke, Projection Algorithms and Monotone Operators, PhD Thesis, Simon Fraster University, 1996.

[3] J. F. Bonnans and H. R. Cabrera, Perturbation analysis of second-order cone programming problems, Mathematical Programming, vol. 104, pp. 205-227, 2005.

[4] J. F. Bonnans and A. Shapiro, Perturbation Analyisis of Optimization Problems, Springer-Verlag, New York, 2000.

[5] J.-S. Chen, The convex and monotone functions associated with second-order cone, Optimization, vol. 55, pp. 363-385, 2006.

(13)

[6] J.-S. Chen, X. Chen and P. Tseng, Analysis of nonsmooth vector-valued func- tions associated with second-oredr cone, Mathmatical Programming, vol. 101, pp.

95-117, 2004.

[7] J.-S. Chen, T.-K. Liao, and S.-H. Pan, Using Schur Complement Theorem to prove convexity of some SOC-functions, Journal of Nonlinear and Convex Analysis, vol. 13, pp. 421-431, 2012.

[8] A. Pinto Da Costa and A. Seeger, Numberical resolultion of cone-constrained eigenvalue problems, Computational and Applied Mathematics, vol. 28, pp. 37-61, 2009.

[9] J. Dattorro, Convex Optimization and Euclidean Distance Geometry, Meboo Pub- lishing, California, 2005.

[10] J.-C. Zhou and J.-S. Chen, Saddle points of nonlinear second-order cone pro- gramming and second order regularity of second-order cone, submitted manuscript, 2012.

參考文獻

相關文件

Based on a class of smoothing approximations to projection function onto second-order cone, an approximate lower order penalty approach for solving second-order cone

It is well known that second-order cone (SOC) programming can be regarded as a special case of positive semidefinite programming using the arrow matrix.. This paper further studies

To achieve inequalities for the aforementioned SOC weighted means, we need the following technical lemma [10, Lemma 4], which is indeed a symmetric cone version of

Note that we know the second-order cone and positive semi-definite cone are both second order regular, but there are all symmetric.. Thus this is an interesting case which indicates

It is well known that second-order cone programming can be regarded as a special case of positive semidefinite programming by using the arrow matrix.. This paper further studies

Then, we achieve a few inequalities on the new-extended weighted means and their corresponding traces associated with second-order cone.. As a byproduct, a version of

Chen, A semi-distance associated with symmetric cone and a new proximal distance function on second-order cone, Linear and Nonlinear Anal.. 5

The second algorithm is based on the Fischer-Burmeister merit function for the second-order cone complementarity problem and transforms the KKT system of the second-order