One-day Workshop on Probability and Finance

(1)

One-day Workshop on Probability and Finance

July 10 (Thursday), 2008

Room 308, New Mathematics Building, National Taiwan University About this Workshop

The meeting aims to address some researches on Probability Theory and Financial Mathematics. It consists of four one-hour lectures and a session of short communications. Informal walk-in talks are welcome for the SC session. No registration is required.

One-Hour Lectures, Morning Session

10:00 - 11:00 Horng-Tzer Yau (Harvard University)

11:15 - 12:15 Chuan-Hsiang Han (National Tsing Hua University) Lunch at the venue

One-Hour Lectures, Afternoon Session

13:30 - 14:30 Ching-Tang Wu (National Chiao Tung University) 14:45 - 15:45 Guan-Yu Chen (National Chiao Tung University) Short Communications

16:00 - 17:30 Each talk is less than 20 minutes.

Now-scheduled: N.-R. Shieh (NTU) Sponsors :

Taida Institute of Mathematical Science (http://www.tims.ntu.edu.tw)

Mathematics Division, National Center for Theoretical Sciences (Taipei Office)

(http://math.cts.ntu.edu.tw/)

(2)

Local semicircle law and complete delocalization for Wigner random matrices

Horng-Tzer Yau

Joint work with L. Erd˝ os, B. Schlein (Munich)

(3)

WIGNER ENSEMBLE

H = (h _jk ) is a hermitian N × N matrix, N 1.

h _jk = 1

√ N (x _jk + iy _jk ), (j < k), h _jj =

s 2

N x _jj

where x _jk , y _jk (j < k) and x _jj are independent with distributions x _jk , y _jk ∼ dν := e ^−g(x) dx,

Normalization: E ^x _jk ^{= 0,} E ^x ² _jk ⁼ ¹ ₂ ^.

Example: g(x) = x ² is GUE.

Normalization ensures that Spec(H) = [−2, 2] + o(1)

Results hold for real symmetric matrices as well, e.g. for GOE.

(4)

!2 2

! 2"

(x) = 4 ! x 2 1

Eigenvalues: E ₁ ≤ E ₂ ≤ . . . E _N

Typical eigenvalue spacing is E _i − E _i−1 ∼ _N ¹ .

(5)

MAIN QUESTIONS

1) Density of states (DOS) — Wigner semicircle law.

2) Eigenvalue spacing distribution (Wigner-Dyson statistics and level repulsion);

3) (De)localization properties of eigenvectors.

RELATIONS:

• 2) is finer than 1) [bulk vs. individual ev.]

• Level repulsion ⇐⇒ Delocalization ??? [Big open conjecture]

Motivation in background: Random Schr¨ odinger operators in the

extended states regime.

(6)

DENSITY OF STATES

N (I) := #{µ _n ∈ I} number of evalues µ _n of H in I ⊂ R ^. Smoothed density of states around E with window size η:

% _η (E) = 1

N π ImTr 1

H − E − iη = 1 N π

X α

η

(µ _α − E) ² + η ²

% _η (E) and N (I) with I = [E − ₂ ^η , E + ₂ ^η ] are closely related.

WIGNER SEMICIRCLE LAW

For any fixed I ⊂ R ^,

N →∞ lim

E N (I)

N =

Z

I % _sc (x)dx, % _sc (x) = 1 2π

q

4 − x ² 1 (|x| ≤ 2)

Similar statement for % _η (E), window size η = O(1) fixed.

Fluctations and almost sure convergence are also known.

(7)

The Wigner-Dyson statistics (universal distribution of eigenvalue spacing) requires info on individual evalues on a scale η ∼ 1/N . It is believed to hold for general Wigner matrices, but proven only for Gaussian and related models and the proofs use explicit formulas for the joint ev. distribution. [Dyson, Deift, Johansson]

GOALS:

(i) Prove Semicircle Law for any scales η N ⁻¹ .

(ii) Prove that eigenvectors are delocalized.

(8)

Theorem 1: Fix κ, ε > 0. Let η ^{(log N )} _N ⁸ , then, as N → ∞, P







sup

|E|≤2−κ

N [E − ₂ ^η , E + ₂ ^η ]

N η − % _sc (E)

≥ ε







≤ e ^{−c(log N )} ²

i.e. Semicircle Law holds for energy windows ∼ 1/N (mod logs)

Theorem 2: Fix κ > 0, then P







∃ v , k v k ₂ = 1, H v = µ v , |µ| ≤ 2−κ, k v k _∞ ≥ (log N ) ⁵ N ^1/2







≤ e ^{−c(log N )} ²

i.e. almost all eigenfunctions are fully delocalized.

(9)

ASSUMPTIONS on the single site distribution dν = e ^−g(x) dx

(i) sup g ⁰⁰ < ∞

(ii) There exists δ > 0 such that ^R e ^δx ² dν(x) < ∞

(iii) dν satisfies the logarithmic Sobolev inequality

Z

u log u dν ≤ C

Z

|∇ √

u| ² dν

Item (i) was needed for a concentration Lemma. J. Bourgain

has informed us that this lemma also holds if (i) is replaced by

a decay stronger than Gaussian (e.g. bounded r.v.).

(10)

Lemma: [Upper bound]. Assume g ⁰⁰ < ∞. Let |I| ≥ ^{log N} _N , then P {N (I) ≥ KN |I|} ≤ e ^{−cKN |I|}

for large K. Similar result holds for P {% _η (E) ≥ K}.

Proof: Decompose

H = h a ^∗ a B

!

, h ∈ C ^, a ∈ C ^{N −1} ^{, B ∈} C (N −1)×(N −1)

Let λ _α , u _α be the ev’s of B and define

ξ _α := N | a · u _α | ² , E ^ξ _α ^{= 1}

For the (1,1) matrix element of G _z = (H − z) ⁻¹ , z = E + iη:

G _z (1, 1) = 1

h − z − a · (B − z) ⁻¹ a ⁼



 h − z − 1 N

N −1 X α=1

ξ _α λ _α − z





−1

(11)

|G _z (1, 1)| ≤ 1

Im ^h h − z − _N ¹ ^P _α _λ ^ξ ^α

α −z i

≤ η ⁻¹

1 + ¹

N P

α ξ _α

(λ _α −E) ² +η ²

≤ N η

P α : λ _α ∈I

ξ _α for any interval I = [E − η, E + η].

N _I ≤ Cη Im TrG _z ≤ Cη

N X k=1

|G _z (k, k)|

Repeating the above construction for each k, N _I ≤ CN η ²

N X k=1

X α : λ ^(k) _α ∈I

ξ _α ^(k)

−1

so to get an upper bound on N _I , we need a lower bound on ^P ξ _α .

(12)

Good news: For the decomposition H = h a ^∗

a B

!

,

the eigenvalues µ _α of H and λ _α of B are interlaced:

µ ₁ ≤ λ ₁ ≤ µ ₂ ≤ λ ₂ ≤ . . . so the number of λ _α ∈ I is N (I) ± 1.

N _I ≤ CN η ²

N X k=1

X α : λ ^(k) _α ∈I

ξ _α ^(k)

−1

Suppose

X α : λ ^(k) _α ∈I

ξ _α ^(k) ≥ c #{λ ^(k) _α ∈ I} ≥ cN (I)

(recalling E ξ = 1 and hoping for weak correlation) then we had N (I) . ^N

2 η ²

N (I) = ⇒ N (I) . ^{N η}

(13)

Lower bound on ^P _α ξ _α :

Recall ξ _α = N | a · u _α | ² . Note that a is indep of λ _α , u _α . The ξ _α ’s are not independent, but almost, so their sum has a strong con- centration property.

Lemma: Let g ⁰⁰ < ∞ or supp ν compact, then P





X α∈A

ξ _α ≤ δ|A|



 ≤ e ^−c|A|

Note ^X

α∈A

ξ _α = N ^X

α∈A

| a · u _α | ² = N |P _A a | ² , P _A = proj

Lemma: Let z = (z ₁ , . . . z _N ), z _j = x _j + iy _j , x _j , y _j ∼ dν(x). Let P be a projection of rank m in C ^N ^{. Then}

E e ^−c(P ^z ^,P ^z ⁾ ≤ e ^−c ⁰ ^E ^(P ^z ^,P ^z ⁾ = e ^−c ⁰ ^m

(14)

Proof of the local semicircle law: Consider the Stieltjes transform

m(z) =

Z %(x)dx x − z

The Stieltjes tr. of the semicircle law satisfies m _sc (z) + 1

m _sc (z) + z = 0

This fixed point equation is stable away from the spectral edge.

Let m(z) be the Stieltjes tr. of the empirical density of H, and m ^(k) (z) that of the minor B ^(k) :

m(z) = 1

N Tr 1

H − z , m ^(k) (z) = 1

N − 1 Tr 1

B ^(k) − z

(15)

Then from the expansion m(z) = 1

N

N X k=1

G _z (k, k) = 1 N

N X k=1

1 h _kk − z − a ^(k) · ¹

B ^(k) −z a ^(k) obtain

m = 1 N

N X k=1

1 h _kk − z − 1 − _N ¹ m ^(k) − X _k with

X _k = a ^(k) · 1

B ^(k) − z a ^(k) − E _k

a ^(k) · 1

B ^(k) − z a ^(k)

| {z }

=(1− _N ¹ )m ^(k)

= 1 N

N −1 X α=1

ξ _α ^(k) − 1 λ ^(k) _α − z

(recall ξ _α ^(k) = N | a ^(k) · u ^(k) _α | ² , E _k ξ _α ^(k) = 1)

(16)

m = 1 N

N X k=1

1 h _kk − z − 1 − _N ¹ m ^(k) − X _k (i) P {h _kk ≥ ε} ≤ e ^−δε ² ^N

(ii) By interlacing property

m − 1 − _N ¹ m ^(k)

= o(1)

(iii) Lemma: P {|X _k | ≥ ε} ≤ e −cε(log N ) ²

Then, away from an event of tiny prob, we have m = − 1

N

N X k=1

1 m + z + δ _k

where the random variables δ _k satisfy |δ _k | ≤ ε. From stability of the equation m _sc = − _m ¹

sc +z , we get |m − m _sc | ≤ Cε.

(17)

Proof of Lemma: Forget k.

X = 1 N

N −1 X α=1

ξ _α − 1

λ _α − z , ξ = | b · v _α | ² ,

With a high prob in the prob. space of the minor, we have

#{λ _α ∈ I} ≤ N η(log N ) ² . Fix such an event and play with a ^. Compute

d dβ

e ^−β log E e ^e ^β ^X

= e ^−β E u log u ≤ Ce ^−β E |∇ √

u| ² , u := e ^e ^β ^X E e ^e ^β ^X

and

e ^−β E |∇ √

u| ² ≤ e ^β E



 u ^X

k

∂X

∂b _k

2 

 = e ^β N ² E

u ^X

α

ξ _α

|λ _α − z| ²

≤ e ^β

N η E [uY ]

with Y = ¹ ^P _α ^ξ ^α .

(18)

e ^−β E |∇ √

u| ² ≤ e ^β

N η E ^{[uY ],} ^{Y =} ¹ N

X α

ξ _α

|λ _α − z|

Use entropy inequality

E [uY ] ≤ γ ⁻¹ E u log u + γ ⁻¹ E e ^γY

(with optimal γ ∼ e ^β /N η) and log-Sobolev once more to get E |∇ √

u| ² ≤ E e ^γY Integrate the inequality

d dβ

e ^−β log E e ^e ^β ^X

≤ e ^−β E e ^γY from −∞ to β ₀ ∼ ¹ ₂ log(N η) − 2 log log N .

The boudary term at β = −∞ vanishes since E X = 0, thus

(19)

log E e ^e ^β0 ^X ≤ E e ^δY , Y = 1 N

X α

ξ _α

|λ _α − z|

with δ ∼ 1/(log N ) ⁴ 1.

Since ξ _α = | b · v _α | ² has finite exponential moment, if there are not too many λ _α near E, then Y has finite exponential moment for a small δ.

This controls the exponential moment of X.

(20)

EXTENDED STATES: EIGENVECTOR DELOCALIZATION

No concept of absolutely continuous spectrum.

v ∈ C ^N ^, ^k v k ₂ = 1 is extended if k v k _p ∼ N

1 p − ¹ ₂

, p 6= 2.

E.g. For GUE, all eigenvectors have k v k ₄ ∼ N ^−1/4 (symmetry) Question: in general for Wigner? [T. Spencer]

Our Theorem 2 answers to this in the strongest possible norm, with log corrections, for all eigenvectors (away from the edge)

Theorem 2: Fix κ > 0, then P







∃ v , k v k ₂ = 1, H v = µ v , |µ| ≤ 2−κ, k v k _∞ ≥ (log N ) ⁵ N ^1/2







≤ e ^{−c(log N )} ²

(21)

Proof. Decompose as before H = h a ^∗ a ^B

!

,

Let H v = µ v ^and v = (v ₁ , w ^), w ∈ C ^{N −1} ^{. Then}

hv ₁ + a · w = µv ₁ , a v ₁ + B w = µ w ⁼ ⇒ w = (µ − B) ⁻¹ a v ₁ From the normalization, 1 = k w k ² + |v ₁ | ² , we have

|v ₁ | ² = 1 1 + ¹

N P

α ξ _α (µ−λ _α ) ²

≤ 1

1 N

1 (q/N ) ²

P

α∈A ξ _α , (ξ _α := N | a · u _α | ² ) where recall λ _α , u _α are the ev’s of B and let

A = ⁿ α : |λ _α − µ| = q N

o q ∼ (log N ) ⁸

Concentration ineq. and lower bound on the local DOS imply

X α∈A

ξ _α ≥ c|A| ≥ cq with very high probability, thus

|v ₁ | ² ≤ q

N = ⇒ k v k _∞ ≤ N ^−1/2 modulo logs

(22)

SUMMARY

• All results for general Wigner matrices, no Gaussian formulas

• We established the Semicircle Law for the DOS on scale ^{(log N )} ⁸

N

(optimal modulo logs)

• All eigenvectors are fully delocalized away from the spectral edges. Optimal estimate on the sup norm (modulo logs)

OPEN QUESTIONS:

• Are all conditions necessary (strong decay plus log-Sobolev)?

• Wigner-Dyson distribution of level spacing [DREAM...]

(23)

Large Deviations, Small Default Probabilities and

Importance Sampling

Chuan-Hsiang Han

Dept. of Quantitative Finance, NTHU TIMS

July 10, 2008

(24)

Outline

• Credit Derivatives: market data and is- sues

• Approach I - reduced form: copula method

• Approach II - structural form: first pas- sage time problem

• Modification: stochastic correlation

• Conclusions and Future works

(25)

Introduction of Credit Derivatives

• A contract between two parties whose values are contingent on the creditwor- thiness of underlying asset (s).

• Single-name: only one reference asset, like CDS (Credit Default Swaps).

• Multi-name: several assets in one basket,

like CDO (Collateralized Debt Obliga-

tions) or BDS (Basket Default Swaps).

(26)

Credit Default Swap

0B 13,000B 26,000B 39,000B 52,000B 65,000B

2001 2002 2003 2004 2005 2006 2007 Credit Default Swap Outstandings (USD)

ISDA Market Survey

(27)

Source: Securities Industry and Financial Markets As-

sociation.

(28)

A Example: Credit Swap Evaluation

premium =

IE {(1 − R) × B(0, τ ) × I (τ < T )} / IE





 N X j=1

4 _{j−1, j} × B(0, t _j ) × I ^{(τ > t} _j ⁾







Notations: τ : default time, R: recovery rate,

B(0, t): discount factor, 4 _{j−1, j} : time incre-

ment.

(29)

Some Mathematical Issues

• Modeling default time

• Modeling correlations between default times

• Estimating joint default probability: rare

event in high dimension

(30)

Approaches to Modeling Default Times

• Intensity-Based (Reduced Form)

View firm’s default as extraneous, modeling the hazard rate of the firm.

IP (τ ≤ t) = F (t) = 1 − exp

−

Z _t 0

h(s)ds

.

• Asset Value-Based (Structural Form) First passage time problem: in 2-d

dS

_1t

= µ

1

S

_1t

dt + σ

1

S

_1t

dW

_1t

dS

_2t

= µ

2

S

_2t

dt + σ

2

S

_2t

d(ρ W

_1t

+ p

1 − ρ

²

W

_2t

)

Joint default occurs if S _1t < B ₁ and S _2t < B ₂

for some t ≤ T .

(31)

Reduced Form Approach: Copula Method ^∗

Default Times Modeling: ⁿ τ _i = F _i ⁻¹ (U _i ) ^o ⁿ

i=1 , U ’s are (standard) uniform random variables.

A n-dimensional Copula is a distribution func- tion on [0, 1] ⁿ with uniform marginal distri- butions.

Through a copula function, one can build up correlations between default times.

∗

Cherubini, Luciano, Vecchiato (2004), Nel-

(32)

Gaussian Copula

• Li (2000) introduced Gaussian copula C(u ₁ , u ₂ , · · · , u _n ; Σ) =

Φ _Σ (Φ ⁻¹ (u ₁ ), Φ ⁻¹ (u ₂ ), · · · , Φ ⁻¹ (u _n )), where Σ denotes the variance-covariance matrix.

• Laurent and Gregory (2003) introduced Gaussian Factor Copula so that parame- ters numbers are reduced from O(n ² ) to O(n).

• Easy to compute but lack of economic

sense.

(33)

Structural Form Approach: Review

• Merton (1974) applied Black-Scholes Op- tion Theory (1973). Default time only happens at maturity.

• Black and Cox (1976) proposed the first passage time problem (1-dim) to model default event.

• Zhou (2001) extended to 2-dim case.

(34)

Credit Risk Modeling:

Structural Form Approach

Multi-Names Dynamics: for 1 ≤ i ≤ n dS _it = µ _i S _it dt + σ _i S _it dW _it , d ^D W _it , W _jt ^E = ρ _ij dt.

Each default time τ _i for the i ^th name is de- fined as τ _i = inf {t ≥ 0 : S _it ≤ B _i }, where B _i denotes the i ^th debt level.

The i ^th default event is defined as {τ _i ≤ T }.

(35)

Joint Default Probability:

First Passage Time Problem

Q: How to compute, for any finite n names, DP = IE ⁿ Π ⁿ _i=1 I _(τ

i ≤T ) | F _t ^o ?

Explicit Formulas exist for 1 and 2 names

cases so far...(no mention for stochastic cor-

relation/volaility...)

(36)

Multi-Dimensional Girsanov Theorem

Given a Radon-Nikodym derivative dIP

d ˜ IP = Q ^h _T = e

R _T

0 h(s,S _s )·d ˜ W _s − ¹ ₂ R _T

0 ||h(s,S _s )|| ² ds

, W ˜ _t = W _t + ^R ₀ ^t h(s, S _s )ds is a vector of Brown- ian motions under ˜ IP . Thus

DP = ˜ IE ⁿ Π ⁿ _i=1 I _(τ

i ≤T ) Q ^h _T ^o .

If h = − _DP ¹ σ ^T ∇DP , zero variance for the

new estimator.

(37)

Monte Carlo Simulations:

Importance Sampling

An importance sampling method is to select a constant vector h = (h ₁ , · · · , h _n ) to satisfy the following n conditions

IE {S ˜ _iT |F ₀ } = B _i , i = 1, · · · , n.

Each h _i can be uniquely determined by the linear system

Σ ⁱ _j=1 ρ _ij h _j = ^µ ⁱ

σ _i − ^{ln B} _σ ⁱ ^/S ⁱ⁰

i T , for i = 1, · · · , n.

(38)

Trajectories under different measures

Single Name Case

(39)

Single Name Default Probability

B BMC Exact Sol Importance Sampling 50 0.0886 (0.0028) 0.0945 0.0890 (0.0016) 20 0 (0) 7.7 ∗ 10

⁻⁵

7.2 ∗ 10

⁻⁵

(2.3 ∗ 10

⁻⁶

)

1 0 (0) 1.3 ∗ 10

⁻³⁰

1.8 ∗ 10

⁻³⁰

(3.4 ∗ 10

⁻³¹

)

Number of simulations are 10 ⁴ and the Eu-

ler discretization takes time step size T /400,

where T is one year. Other parameters are

S ₀ = 100, µ = 0.05 and σ = 0.4.

(40)

Three-Names Joint Default Probability

ρ BMC Importance Sampling

0.3 0.0049(6.98 ∗ 10

⁻⁴

) 0.0057(1.95 ∗ 10

⁻⁴

) 0 3.00 ∗ 10

⁻⁴

(1.73 ∗ 10

⁻⁴

) 6.40 ∗ 10

⁻⁴

(6.99 ∗ 10

⁻⁵

) -0.3 0(0) 2.25 ∗ 10

⁻⁵

(1.13 ∗ 10

⁻⁵

)

Parameters are S ₁₀ = S ₂₀ = S ₃₀ = 100, µ ₁ = µ ₂ = µ ₃ = 0.05, σ ₁ = σ ₂ = 0.4, σ ₃ = 0.3 and B ₁ = B ₂ = 50, B ₃ = 60. Standard errors are shown in parenthesis.

Effect of Correlation! Debt to Asset Ratios

(B _i /S _i0 ) are not small.

(41)

We propose an algorithm to compute the joint default prob.

In fact, the choice of our new measure

is optimal in Large Deviations Theory.

(42)

Large Deviations Theory:

Cramer’s Theorem

Let {X _i } be real-valued IID r.v.’s under IP and IEX ₁ < ∞. For any x ≥ IEX ₁ , we have

n→∞ lim 1

n ln IP

S _n

n ≥ x

= −Γ ^∗ (x) = − inf

y≥x Γ ^∗ (y).

1. S _n = ^P ⁿ _i=1 X _i : sample sum

2. Γ(θ) = ln IE ^h e ^θX ¹ ⁱ : the cumulant function

3. Γ ^∗ (x) = sup _θ∈< [θ x − Γ(θ)]: Legendre

transform of Γ (also called rate function).

(43)

Tie to Importance Sampling

Define an expo. change of measure IP _θ by dIP _θ

dIP = exp (θ S _n − n Γ(θ)) , p _n := IP

S _n

n ≥ x

= IE _θ

I _Sn

n ≥x exp (−θ S _n + n Γ(θ))

. The optimal 2 ^nd moment (M _n ² (θ, x)) of the

new estimator can be shown as

M _n ² (θ _x , x) ≈ p ² _n , where Γ ^∗ (x) = θ _x x − Γ(θ _x ).

Under the optimal measure, the event is

not rare any more! (Note: IE [S /n] = x.)

(44)

Large Deviation Principle (LDP)

A X -valued seq. {Z ^ε } _ε defined on (Ω, F , IP ) satisfies a LDP with the rate function I if (1) Upper Bound: for any closed subset F of X , lim sup _ε→0 ε ln IP [Z ^ε ∈ F ] ≤ − inf _x∈F I(x) (2) Lower Bound: for any open subset G of X , lim sup _ε→0 ε ln IP [Z ^ε ∈ G] ≥ − inf _x∈G I(x) If F ⊆ X s.t. inf _x∈F ₀ I(x) = inf _{x∈ ¯} _F := I _F , then

lim sup

ε→0

ε ln IP [Z ^ε ∈ F ] = −I _F .

(45)

Freidlin-Wentzell Theorem

The solution of

dX _t ^ε = b(X _t ^ε )dt + √

εσ(X _t ^ε )dW _t , X ₀ ^ε = x,

satisfies a LDP with the rate function I(f ) =

1 2

Z _T

0 < ˙ f (t) − b(f (t)), a ⁻¹ (f (t))( ˙ f (t) − b(f (t))) > dt for some nice function f , or I(f ) = ∞ oth-

erwise. a(x) = σ(x) σ ⁰ (x).

(46)

Single-Name Default Prob. Approximation

IP



 inf

0≤t≤T S _t = S ₀ e

µ− ^σ2 ₂

t+σW _t

≤ B





= IE

"

I ^inf

0≤t≤T ε µ − σ ² 2

!

t + εσW _t ≤ −1

!#

:= P _ε (scaling by ln (B/S ₀ ) = −1 ε )

≈ exp

−1

ε ² 2 σ ² T

. ( by F-W Thm )

(47)

Importance Sampling:

2 ^nd Moment Approximation

IE ˜

"

I ^inf

0≤t≤T S _t ≤ B

!

e ^{2 h ˜} ^W ^T ^−h ² ^T

#

S _t = S ₀ e

µ− ^σ2 ₂ −σh

t+σ ˜ W _t

, h = µ

σ − ln B/S ₀ σ T

= IE ˆ



 I



 inf

0≤t≤T S ₀ e

µ− ^σ2 ₂ +σh

t+σ ˆ W _t

≤ B







 e ^h ² ^T

(48)

2 ^nd Moment Approximation (Cont.)

= IE ˆ

"

I ^inf

0≤t≤T ε 2µ − σ ² 2

!

+ 1 T

!

t + εσ ˆ W _t ≤ −1

!#

× e

r

σ + _εσT ¹

2 T (scaling by ln (B/S ₀ ) = −1 ε ) := M _ε ²

≈ exp

−1

ε ² σ ² T

. ( by F-W Thm )

Theorem: By M _ε ² ≈ (P _ε ) ² we observe the

optimality of chosen measure.

(49)

The Optimal Variance Reduction:

A Numerical Evidence

0 10 20 30 40 50 60 70 80 90

10^!30 10^!20 10^!10 10⁰ 10¹⁰ 10²⁰ 10³⁰ 10⁴⁰ 10⁵⁰

!(") vs P_B(T), B = 15

!(") PB(T)

(50)

A Modification: Stochastic Correlation



 

 

 

 

dS _t ¹ = rS _t ¹ dt + σ ₁ S _t ¹ dW _t ¹

dS _t ² = rS _t ² dt + σ ₂ S _t ² d(ρ(Y _t )dW _t ¹ +

q

1 − ρ ² (Y _t )dW _t ² ) dY _t = ¹

δ (m − Y _t )dt +

√ √ 2β δ dZ _t Joint default probability

P ^δ (t, x ₁ , x ₂ , y) := IE _x ₁ _,x ₂ _,y

Π I _{min

t≤u≤T S _t ⁱ ≤B _i }

In this case, the construction of our IS method

fails!

(51)

Full Expansion of P ^δ

Theorem

P ^δ (t, x ₁ , x ₂ , y) =

∞ X i=0

δ ⁱ P _i (t, x ₁ , x ₂ , y),

where P ⁰ s can be obtained recursively and the y variable can be factored out (separate).

Proof: by means of Singular Perturbation Techniques.

Accuracy results are ensured given smooth-

(52)

Leading Order Term

P ₀ (t, x ₁ , x ₂ ) solves the homogenized PDE (y-independent).

L _1,0 + ρ ¯ L _1,1 P ₀ (t, x ₁ , x ₂ ) = 0

ρ =< ρ(y) >, average taken wrt the invar- ¯ tiant measure of Y.

Differential operators are L _1,0 = ∂

∂t +

2 X i=1

σ _i ² x ² _i 2

∂ ²

∂x ² _i +

2 X i=1

µ _i x _i ∂

∂x _i L _1,1 = σ ₁ σ ₂ x ₁ x ₂ ∂ ²

∂x ₁ ∂x ₂ .

(53)

Other Terms

P _n+1 (t, x ₁ , x ₂ , y) =

i+j=n+1 X i≥0,j≥1

ϕ ⁽ⁿ⁺¹⁾ _i,j (y) L ⁱ _1,0 L ^j _1,1 P _n where a seq. of Poisson eqns to be solved:

L ₀ ϕ ⁽ⁿ⁺¹⁾ _i+1,j (y) =

ϕ ⁽ⁿ⁾ _i,j (y)− < ϕ ⁽ⁿ⁾ _i,j (y) >

L ₀ ϕ ⁽ⁿ⁺¹⁾ _i,j+1 (y) =

ρ(y) ϕ ⁽ⁿ⁾ _i,j (y)− < ρ ϕ ⁽ⁿ⁾ _i,j >

, where L ₀ = β ^{2 ∂} ²

∂y ² + (m − y) _∂y ^∂ .

(54)

Numerical Result I: Stochastic Correlation

α =

¹_δ

BMC Importance Sampling 0.1 0.0037(6 ∗ 10

⁻⁴

) 0.0032(1 ∗ 10

⁻⁴

)

1 0.0074(9 ∗ 10

⁻⁴

) 0.0065(2 ∗ 10

⁻⁴

) 10 0.0112(1 ∗ 10

⁻³

) 0.0116(4 ∗ 10

⁻⁴

) 50 0.0163(1 ∗ 10

⁻³

) 0.0137(5 ∗ 10

⁻⁴

) 100 0.016(1 ∗ 10

⁻³

) 0.0132(4 ∗ 10

⁻⁴

)

Parameters are S ₁₀ = S ₂₀ = 100, B ₁ = 50, B ₂ = 40, m = π/4, ν = 0.5, ρ(y) = |sin(y)|.

Using homogenization in IS, note the ef-

fect of correlation.

(55)

Numerical Result II: Stochastic Correlation

α =

¹_δ

BMC Importance Sampling 0.1 0(0) 9.1 ∗ 10

⁻⁷

(7 ∗ 10

⁻⁸

)

1 0(0) 7.5 ∗ 10

⁻⁶

(6 ∗ 10

⁻⁷

) 10 0(0) 2.4 ∗ 10

⁻⁵

(2 ∗ 10

⁻⁶

) 50 1 ∗ 10

⁻⁴

(1 ∗ 10

⁻⁴

) 2.9 ∗ 10

⁻⁵

(3 ∗ 10

⁻⁶

) 100 1 ∗ 10

⁻⁴

(1 ∗ 10

⁻⁴

) 2.7 ∗ 10

⁻⁵

(2 ∗ 10

⁻⁶

)

Parameters are S ₁₀ = S ₂₀ = 100, B ₁ = 30, B ₂ = 20, m = π/4, ν = 0.5.

Note the effect of correlation.

(56)

Conclusion

• Credit risk models are introduced.

• A simple yet efficient importance sam- pling method is proposed, justified by large deviations theory.

• Full expansion of joint default probabil-

ity under stochastic correlation and its

application to importance sampling.

(57)

Future Works

• Generalized to stochastic volatility mod- els.

• Risk management of credit portfolios.

• Similar variance analysis for Gaussian cop- ula models.

• Homogenization in Large Deviations.

(58)

Acknowledgment

• S.-J. Sheu, N.-R. Shieh, Doug Vestal.

(by name order)

• NCTS (Taipei Office)

• TIMS, NTU.

• NSC.

(59)

Thank You!

(60)

Weak Brownian Motion and its Applications

Wu, Ching-Tang

Department of Applied Mathematics National Chiao Tung University

July 10, 2008 National Taiwan University

(61)

Outline

1 Motivation

2 Weak Brownian Motions

3 Martingale Marginal Property

4 Wiener Chaos

5 Future Works

6 References

(62)

Motivation

Pricing Formula

In a financial model with interest rate 0, stock price process (S_t) and risk neutral probability measure P^∗, the price of European call option at time 0 is given by

π(K, T ) = E^∗(S_T − K)⁺ , (1) where T is the maturity and K is the strike price.

Two methods to discuss it:

Stochastic analysis Dynamic analysis

(63)

Motivation

Pricing Formula

In a financial model with interest rate 0, stock price process (S_t) and risk neutral probability measure P^∗, the price of European call option at time 0 is given by

π(K, T ) = E^∗(S_T − K)⁺ , (1) where T is the maturity and K is the strike price.

Two methods to discuss it:

Stochastic analysis Dynamic analysis

(64)

Motivation

Relation between π and S

_t

Method 1: Stochastic Analysis

u_t: marginal utility of S_t under P^∗, Then π is determined by the distribution function and the partial moment and

π_xx(·, t) = u_t with density function p(·, t).

Method 2: Dynamic Analysis Suppose S_t satisfies

dS_t= S_t(b_tdt + σ(S_t, t) dW_t), then we have Dupire equation

πt= 1

2x²σ²(x, t)πxx− xσ_tπx. (2) Solve it!

(65)

Motivation

Relation between π and S

_t

Method 1: Stochastic Analysis

u_t: marginal utility of S_t under P^∗, Then π is determined by the distribution function and the partial moment and

π_xx(·, t) = u_t with density function p(·, t).

Method 2: Dynamic Analysis Suppose S_t satisfies

dS_t= S_t(b_tdt + σ(S_t, t) dW_t), then we have Dupire equation

πt= 1

x²σ²(x, t)πxx− xσ_tπx. (2)

(66)

Motivation

Another Point of View

Consider

π(K, T ) = E^∗(S_T − K)⁺ , where (S_t) is a martingale with respect to P^∗.

Breeden and Litzenberger (1978) and Dupire (1997) show that P^∗(S_T > K) = − ∂

∂K+π(K, T ), where ∂

∂K+π(K, t) means the right-derivative of π with respect to K.

Question

Does there exist a stochastic process whose marginal (or k-marginal) is identical to the marginal of (St)?

(67)

Motivation

Another Point of View

Consider

Question

Does there exist a stochastic process whose marginal (or k-marginal) is

(68)

Motivation

Another Point of View

Consider

Question

Does there exist a stochastic process whose marginal (or k-marginal) is identical to the marginal of (St)?

(69)

Motivation

Stoyanov’s Conjecture

There exists a stochastic process X with X0 = 0 satisfying

1 X_t− X_s∼ N (0, t − s) for all s < t.

2 X_t₂ − X_t₁ and X_t₄ − X_t₃ are independent for 0 ≤ t₁ < t₂ ≤ t₃ < t₄. But X is nota Brownian motion.

Thus,

We aim to see if there exists a stochastic process whose marginal (or k-marginal) is identical to the marginal of a Brownian motion, but is not a Brownian motion.

(70)

Motivation

Stoyanov’s Conjecture

There exists a stochastic process X with X0 = 0 satisfying

1 X_t− X_s∼ N (0, t − s) for all s < t.

2 X_t₂ − X_t₁ and X_t₄ − X_t₃ are independent for 0 ≤ t₁ < t₂ ≤ t₃ < t₄. But X is nota Brownian motion.

Thus,

We aim to see if there exists a stochastic process whose marginal (or k-marginal) is identical to the marginal of a Brownian motion, but is not a Brownian motion.

(71)

Weak Brownian Motions

Definition

A stochastic process X is called a weak Brownian motion of order k, if for all (t₁, t₂, ..., t_k)

(Xt1, Xt2, ..., Xtk)^(law)= (Bt1, Bt2, ..., Btk) , where B is a Brownian motion.

Another formulation

E[f1(Xt1) · · · f_k(Xt_k)] = E[f1(Bt1) · · · f_k(Bt_k)]

for f1, ..., f_k ∈ C₀¹(R).

(72)

Definition

Another formulation

for f1, ..., f_k ∈ C₀¹(R).

There exists weak Brownian motion of order 4 which differs from Brownian motion.

(73)

Definition

Another formulation

for f1, ..., f_k ∈ C₀¹(R).

(74)

Main Results

Theorem (F¨ollmer-W.-Yor (2000))

Let k ∈ N. There exists a process (Xt)_0≤t≤1 which is not Brownian motion such that the k-dimensional marginals of X are identical to those of Brownian motion.

Theorem

For every ε > 0, there exists a probability measure Q 6= P on C([0, 1]) such that

1 Q ≈ P

2 Q ⊥ P and satisfies

Q = P on F_J = σ(X_t: t ∈ J )

for any J ⊆ [0, 1] such that J^c contains some interval of length ε.

(75)

Main Results

Theorem (F¨ollmer-W.-Yor (2000))

Let k ∈ N. There exists a process (Xt)_0≤t≤1 which is not Brownian motion such that the k-dimensional marginals of X are identical to those of Brownian motion.

Theorem

For every ε > 0, there exists a probability measure Q 6= P on C([0, 1]) such that

1 Q ≈ P

2 Q ⊥ P and satisfies

Q = P on F_J = σ(X_t: t ∈ J )

(76)

Properties

Proposition

Let X be a weak Brownian motion of order k.

1 If k ≥ 2, then X has a continuous version. Moreover, if X is a Gaussian process, then X is a Brownian motion

2 If k ≥ 4, then hXit= t. Moreover, if X is a martingale, then X is a Brownian motion.

Remark

A weak Brownian motion may not be a martingale, e.g.,

X_t=

( Wt, t ≤ 1/2,

W¹

2 + √

2 − 1 W_t−1

2, t > 1/2.

(77)

Properties

Proposition

Let X be a weak Brownian motion of order k.

1 If k ≥ 2, then X has a continuous version. Moreover, if X is a Gaussian process, then X is a Brownian motion

2 If k ≥ 4, then hXit= t. Moreover, if X is a martingale, then X is a Brownian motion.

Remark

A weak Brownian motion may not be a martingale, e.g.,

X_t=

( Wt, t ≤ 1/2,

W¹ + √

2 − 1 W 1, t > 1/2.

(78)

Itˆ o Integral

If X is a continuous weak Brownian motion of order k ≥ 1 whose paths have quadratic variation

hXi_t= t, then the Itˆo integral

Z t 0

f (Xt) dXt

exists as a pathwise limit of non-nticipting Riemann sums along dyadic partitions for any bounded f ∈ C¹ and satisfies the Itˆo’s formula even though X may not be a semimartingale, see F¨ollmer (1981). Moreover,

E

Z t 0

f (Xt) dXt

= 0

which may be viewed as a weak form of martingale property.

(79)

Itˆ o Integral

If X is a continuous weak Brownian motion of order k ≥ 1 whose paths have quadratic variation

hXi_t= t, then the Itˆo integral

Z t 0

f (Xt) dXt

exists as a pathwise limit of non-nticipting Riemann sums along dyadic partitions for any bounded f ∈ C¹ and satisfies the Itˆo’s formula even though X may not be a semimartingale, see F¨ollmer (1981). Moreover,

E

Z t 0

f (Xt) dXt

= 0

(80)

Characterization

1 W is a Brownian motion if and only if there exist an orthonormal basis (ϕn) on L²([−0, 1]) and a sequence of i.i.d. N (0, 1)-distributed random variables (ξ_n) such that

Wt=

∞

X

k=1

Z t 0

ϕn(u) du

ξn.

2 X is a weak Brownian motion of order k if and only if there exist an orthonormal basis (ϕ_n) on L²([−0, 1]) and a sequence of uncorrelated N (0, 1)-distributed random variables (η_n) such that

∞

X

k=1

λ₁

Z t1

0

ϕ_n(u) du + · · · + λ_k Z tk

0

ϕ_n(u) du

η_n is Gaussian for all λ₁, ..., λ_k ∈ R, t1 ≤ t₂≤ · · · ≤ t_k and

X_t=

∞

XZ t 0

ϕ_n(u) du

η_n

(81)

Characterization

1 W is a Brownian motion if and only if there exist an orthonormal basis (ϕn) on L²([−0, 1]) and a sequence of i.i.d. N (0, 1)-distributed random variables (ξ_n) such that

Wt=

∞

X

k=1

Z t 0

ϕn(u) du

ξn.

2 X is a weak Brownian motion of order k if and only if there exist an orthonormal basis (ϕ_n) on L²([−0, 1]) and a sequence of uncorrelated N (0, 1)-distributed random variables (η_n) such that

∞

X

k=1

λ₁

Z t1

0

ϕ_n(u) du + · · · + λ_k Z tk

0

ϕ_n(u) du

η_n is Gaussian for all λ₁, ..., λ_k ∈ R, t1 ≤ t₂≤ · · · ≤ t_k and

∞ t

(82)

Martingale Marginal Property

Martingale Marginal

Definition

The family of densities Q = {q(x, t) : t > 0} has martingale marginal propertyif there exists a probability space on which one may define a martingale (Mt) such that for every t, the law of M is given by the density q(M, t).

Theorem (Strassen (1965))

A family of probability measures (µ_n)_n≥0 has martingale marginal property if and only if for all n ≥ 0,

Z

|x| µ_n(dx) < ∞, and for any concave µn-integrable function ψ, the sequence

Z

ψ(x) µn(dx)

is non-increasing (the values of the integrals may be −∞).

(83)

Martingale Marginal

Definition

The family of densities Q = {q(x, t) : t > 0} has martingale marginal propertyif there exists a probability space on which one may define a martingale (Mt) such that for every t, the law of M is given by the density q(M, t).

Theorem (Strassen (1965))

A family of probability measures (µ_n)_n≥0 has martingale marginal property if and only if for all n ≥ 0,

Z

|x| µ_n(dx) < ∞, and for any concave µn-integrable function ψ, the sequence

Z

ψ(x) µn(dx)

is

(84)

Generalizations

Remark

Doob (1968) proved the continuous version of above result.

Theorem (Rothschild and Stiglitz (1970, 1971))

{q(x, t) : t > 0} has martingale marginal property if and only if for all K and for all T1 ≤ T₂

Z ∞ 0

Sq(S, T2) dS ≤ Z ∞

0

Sq(S, T1) dS Z ∞

0

(S − K)⁺q(S, T₂) dS ≥ Z ∞

0

(S − K)⁺q(S, T₁) dS.

Remark

This concept is relative to stochastic orders, see F¨ollmer and Schied (2004).

(85)

Generalizations

Remark

Z ∞ 0

Sq(S, T2) dS ≤ Z ∞

0

Sq(S, T1) dS Z ∞

0

(S − K)⁺q(S, T₂) dS ≥ Z ∞

0

(S − K)⁺q(S, T₁) dS.

Remark

(86)

Generalizations

Remark

Z ∞ 0

Sq(S, T2) dS ≤ Z ∞

0

Sq(S, T1) dS Z ∞

0

(S − K)⁺q(S, T₂) dS ≥ Z ∞

0

(S − K)⁺q(S, T₁) dS.

Remark

This concept is relative to stochastic orders, see F¨ollmer and Schied (2004).

(87)

Markov Martingales

Question

Given a family of densities {q(x, t) : t > 0}, does there exist a probability space on which one can define a Markov martingale (Mt) such that for every t, the law of M is given by the density q(M, t)?

Theorem (Kellerer (1972))

1 Let {q(x, t) : t > 0} be a family of marginal densities, with finite first moment, such that for s < t

Z

f (x)q(x, t) dx ≥ Z

f (x)q(x, s) dx

for all convex non-decreasing functions f , then there exists a Markov submartingale (M_t) with marginal densities {q(M_t, t) : t > 0}.

(88)

Markov Martingales

Question

Given a family of densities {q(x, t) : t > 0}, does there exist a probability space on which one can define a Markov martingale (Mt) such that for every t, the law of M is given by the density q(M, t)?

Theorem (Kellerer (1972))

1 Let {q(x, t) : t > 0} be a family of marginal densities, with finite first moment, such that for s < t

Z

f (x)q(x, t) dx ≥ Z

f (x)q(x, s) dx

for all convex non-decreasing functions f , then there exists a Markov submartingale (M_t) with marginal densities {q(M_t, t) : t > 0}.

2 Furthermore, if the means are independent of t, then (Mt) is a Markov martingale.

(89)

Constructions

Define the family of barycetre functions ψ(x, t) =

R∞

x yq(y, t) dy R∞

x q(y, t) dy .

Suppose ψ(x, t) is increasing in t and q(x, t) is a family of zero mean densities.

Theorem (Madan and Yor (2002))

Let (B_t) be a standard Brownian motion. Define a stopping time τt= inf

s : sup

0≤u≤s

Bu≥ ψ(B_s, t)

.

(90)

Constructions

Define the family of barycetre functions ψ(x, t) =

R∞

x yq(y, t) dy R∞

x q(y, t) dy .

Suppose ψ(x, t) is increasing in t and q(x, t) is a family of zero mean densities.

Theorem (Madan and Yor (2002))

Let (B_t) be a standard Brownian motion. Define a stopping time τt= inf

s : sup

0≤u≤s

Bu≥ ψ(B_s, t)

.

Then Mt:= Bτt is an inhomogeneous Markov martingale with density q.

(91)

Wiener Chaos

Consequence of the Main Results

Let X represent the coordinate process and L²(P) = L²(C([0, 1]), P).

Notation

For every k ∈ N, define

Π_k:=

( _k Y

i=1

f_i(X_t_i) : t₁ < · · · < t_k≤ 1, f_i is bounded, Borel measurable )

.

Then Corollary

For every k, Π is not total in L²(P).

(92)

Wiener Chaos

Consequence of the Main Results

Let X represent the coordinate process and L²(P) = L²(C([0, 1]), P).

Notation

Π_k:=

( _k Y

i=1

f_i(X_t_i) : t₁ < · · · < t_k≤ 1, f_i is bounded, Borel measurable )

.

Then Corollary

For every k, Π_k is not total in L²(P).

(93)

Wiener Chaos

Decomposition

Notation

K0 := Π0 = R, Kn+1= ¯Πn+1∩ ¯Π^⊥_n, where ⊥ denotes orthogonality relation in L²(P).

Lemma

L²(P) =

∞

M

n=1

Kn.

Remark

(94)

Wiener Chaos

Decomposition

Notation

Lemma

L²(P) =

∞

M

n=1

Kn.

Remark

K_n is called the nth time-space Wiener chaos.

(95)

Wiener Chaos

Decomposition

Notation

Lemma

L²(P) =

∞

M

n=1

Kn.

Remark

One-day Workshop on Probability and Finance