• 沒有找到結果。

(i) f is strongly convex on C, if ∃ α > 0 s.t. ∀ x, y ∈ C, f (y) − f (x) ≥ hy − x, ∇f (x)i + αky − xk

N/A
N/A
Protected

Academic year: 2021

Share "(i) f is strongly convex on C, if ∃ α > 0 s.t. ∀ x, y ∈ C, f (y) − f (x) ≥ hy − x, ∇f (x)i + αky − xk"

Copied!
4
0
0

加載中.... (立即查看全文)

全文

(1)

§2 A Generalized Proximal Point Algorithm

In this section, we introduce a generalized proximal point algorithm for solving vari- ational inequalities involving general set-valued operators. We first recall the concept of the strongly convex functions introduced by [28] and the some definitions of continuous property as follows.

Definition 2.1. Let C be a closed convex subset of X, f be differentiable on a neighborhood of C, and let ∇f be the gradient of f .

(i) f is strongly convex on C, if ∃ α > 0 s.t. ∀ x, y ∈ C, f (y) − f (x) ≥ hy − x, ∇f (x)i + αky − xk

2

;

4

(2)

(ii) f is convex on C, if ∀ x, y ∈ C, f (y) − f (x) ≥ hy − x, ∇f (x)i.

Also, for a set-valued operator T : X −→ X

, we shall say that

(iii) T is upper semicontinuous (u.s.c.) at x ∈ X, if for any open set G containing T (x), there is some neighborhood V (x) of x, such that T (y) ⊂ G for all y ∈ V (x);

(iv) T is (l,w)-u.s.c., if T is u.s.c. from line segments in X to the weak topology of X

; (v) T is (w,s)-u.s.c., if T is u.s.c. from the weak topology of X to the norm topology of

X

;

(vi) T is Lipschitz continuous, if there exists a constant m > 0 such that kw

1

− w

2

k ≤ mku − vk, ∀ w

1

∈ T (u), w

2

∈ T (v).

To establish our proximal point algorithm, we consider a differentiable auxiliary func- tion M : C −→ R, and denoted ∇M by M

0

. Define B : C × C × C −→ R by

B(x, y, z) = hM

0

(x) − M

0

(y), z − xi.

For some x ∈ C, y ∈ T (x), and a positive number ε, we introduce the auxiliary problem (P 2) as follows.

(P2) : find ˜ x ∈ C such that

εhy, z − xi + B(˜ x, x, z) ≥ 0, ∀z ∈ C. (2) In fact, if ˜ x exists and equal to x, then B(˜ x, x, z) = 0, which implies that ˜ x is a solution of the original problem (P 1) (i.e. (P 2) reduces to (P 1) ).

Algorithm 1 to VI(T,C) :

Step 1: Take any x

0

∈ C and y

0

∈ T (x

0

).

Step 2: Knowing (x

k

, y

k

) and ε

k

, compute x

k+1

∈ C such that

ε

k

hy

k

, z − x

k+1

i + B(x

k+1

, x

k

, z) ≥ 0, ∀z ∈ C. (3) Step 3: Take any y

k+1

∈ T (x

k+1

) and return to Step 2, until kx

k+1

− x

k

k is below

some threshold.

Remark. In particular, if M (x) =

12

kxk

2

, then M

0

(x) =

(

x, in a Hilbert space J (x), in a Banach space . In this case, our algorithm 1 reduces to the classical proximal point algorithm.

5

(3)

We also need the following well-known proposition, due to Karamardian [14, 17], see also Ortoga [20].

Proposition 2.2 [14, Proposition 6.1]. Let f be a differentiable function on an open convex set C of X, then f is strongly convex on C with constant α ⇐⇒ ∇f is strongly monotone with constant b on C, where b = 2α.

We give hereafter a lemma that will be used to prove the well-definedness of the sequence {x

k

} generated by our Algorithm 1.

Lemma 2.3. Let x ∈ C and T be weakly monotone with constant L. If M

0

is strongly monotone with constant b on C, and ε <

Lb

, then the operator F (y) = εT (y) + M

0

(y) − M

0

(x) is strongly monotone on C with constant b − εL. Moreover if T is (l,w)-u.s.c. and M

0

is (l,w)-continuous, then F is also (l,w)-u.s.c..

Proof : For all (y

1

, w

1

), (y

2

, w

2

) ∈ G(F ), we have some z

1

∈ T (y

1

) and z

2

∈ T (y

2

) such that

w

1

= εz

1

+ M

0

(y

1

) − M

0

(x), and

w

2

= εz

2

+ M

0

(y

2

) − M

0

(x).

Since T is weakly monotone,

hz

1

− z

2

, y

1

− y

2

i ≥ −Lky

1

− y

2

k

2

, and M

0

is strongly monotone,

hM

0

(y

1

) − M

0

(y

2

), y

1

− y

2

i ≥ bky

1

− y

2

k

2

. Thus,

hw

1

− w

2

, y

1

− y

2

i = εh(z

1

− z

2

) + M

0

(y

1

) − M

0

(y

2

), y

1

− y

2

i

= εhz

1

− z

2

, y

1

− y

2

i + hM

0

(y

1

) − M

0

(y

2

), y

1

− y

2

i

≥ ε(−L)ky

1

− y

2

k

2

+ bky

1

− y

2

k

2

= (b − εL)ky

1

− y

2

k

2

,

Hence F is strongly monotone. On the other hand, let G be open in the weak topology of X

and F (y) ⊂ G, where y ∈ L, and L is a line segment in X. This implies that

εT (y) ⊂ G + M

0

(x) − M

0

(y).

Since T and hence εT is (l, w)-u.s.c. at y, there is an open neighborhood V

1

(x) of X such that

εT (z) ⊂ G + M

0

(x) − M

0

(y), ∀z ∈ V

1

(x) ∩ L.

6

(4)

Hence, for all w ∈ εT (z),

w ∈ G + M

0

(x) − M

0

(y), which implies that

M

0

(y) ∈ G + M

0

(x) − w.

Since M

0

is (l, w)-continuous, there is an open neighborhood V

2

(x) of X such that M

0

(u) ∈ G + M

0

(x) − w, ∀u ∈ V

2

(x) ∩ L.

Thus above results imply

w + M

0

(u) − M

0

(x) ∈ G, ∀w ∈ εT (z), u ∈ V

2

(x) ∩ L, z ∈ V

1

(x) ∩ L.

Let V (x) = V

1

(x) ∩ V

2

(x). Then for all z ∈ V (x) ∩ L, we have F (y) = εT (z) + M

0

(z) − M

0

(x) ⊂ G.

Thus, we complete the proof. 2

Remark. Under different assumptions on T , the strongly monotone constants of F are different. For example,

(i) if T is monotone, then F is strongly monotone with constant b;

(ii) if T is strongly monotone with constant a, then F is strongly monotone with constant εa + b;

(iii) if T has the Dunn property, then F is strongly monotone with constant b.

7

參考文獻

相關文件

the larger dataset: 90 samples (libraries) x i , each with 27679 features (counts of SAGE tags) (x i ) d.. labels y i : 59 cancerous samples, and 31

The best way to picture a vector field is to draw the arrow representing the vector F(x, y) starting at the point (x, y).. Of course, it’s impossible to do this for all points (x, y),

If x or F is a vector, then the condition number is defined in a similar way using norms and it measures the maximum relative change, which is attained for some, but not all

In particular, if s = f(t) is the position function of a particle that moves along a straight line, then f ′(a) is the rate of change of the displacement s with respect to the

Particles near (x, y, z) in the fluid tend to rotate about the axis that points in the direction of curl F(x, y, z), and the length of this curl vector is a measure of how quickly

Figure 6 shows the relationship between the increment ∆y and the differential dy: ∆y represents the change in height of the curve y = f(x) and dy represents the change in height

The sample point can be chosen to be any point in the subrectangle R ij , but if we choose it to be the upper right-hand corner of R ij [namely (x i , y j ), see Figure 3],

[classification], [regression], structured Learning with Different Data Label y n. [supervised], un/semi-supervised, reinforcement Learning with Different Protocol f ⇒ (x n , y