• 沒有找到結果。

National Center for Theoretical Sciences (NCTS) at the National Tsing Hua University

STOCHASTIC DIFFERENTIAL EQUATIONS ARE BECOMING INCREASINGLY MORE POPULAR

(1) Random effects in physical, biological, and financial problems can be modeled using stochastic differential equations.

(2) Stochastic models are considered to be more realistic for many prob-lems.

(3) However, the development of accurate stochastic differential equation models is not well understood.

(4) In this series of lectures, stochastic differential equations are introduced, numerical methods are explained, and a procedure is described for deriving stochastic differential equations.

THESE LECTURES ARE DIVIDED INTO FOUR PARTS

Part I: Random variables and stochastic processes are reviewed and sto-chastic integrals are introduced.

Part II: The theory and approximation of Itˆo stochastic differential equa-tions (SDEs) are studied.

Part III: A procedure is explained for deriving accurate SDE models. SDEs are derived for several problems in biology, physics, and finance.

Part IV: The derivation procedure is extended to stochastic partial dif-ferential equations (SPDEs). SPDEs are derived for several problems in physics and biology.

STOCHASTIC DIFFERENTIAL EQUATIONS

It is proved that a unique solution to an SDE exists in Hilbert space HSP. Several properties of stochastic differential equations are derived.

Exact solutions and moments of the solution are found.

Numerical methods for approximating solutions of SDEs are described.

The forward Kolmogorov partial differential equation is derived.

A procedure is described for estimating parameters in an SDE.

IT ˆO STOCHASTIC DIFFERENTIAL EQUATIONS We study

X(t, ω) = X(0, ω) + Z t

0

f (s, X(s, ω)) ds + Z t

0

g(s, X(s, ω)) dW (s, ω) for 0 ≤ t ≤ T where X(0, ·) ∈ HRV or in differential form

dX(t, ω) = f (t, X(t, ω)) dt + g(t, X(t, ω)) dW (t, ω).

f is called the drift coefficient and g is called the diffusion coefficient.

It is assumed f and g satisfy:

Condition (c6): |f(t, x)−f(s, y)|2 ≤ k(|t−s|+|x−y|2) for 0 ≤ s, t ≤ T and x, y ∈ R.

Condition (c7): |f(t, x)|2 ≤ k(1 + |x|2) for 0 ≤ t ≤ T and x ∈ R.

PROPERTIES OF f AND g THAT SATISFY (c6) AND (c7)

Let u(t) = f (t, X(t)). Then, (c7) implies that u ∈ HSP when X ∈ HSP and, of course, the same holds for function g. Indeed, for X ∈ HSP,

kuk2SP = Z T

0 E|f(t, X(t)|2dt ≤ Z T

0 k(1 + E|X(t)|2) dt ≤ kT + kkXk2SP.

Condition (c6) implies that kf(t1, X) −f(t1, Y )kSP < ǫ when kX −Y kSP < ǫ/k1/2 and kf(t1, X) − f(t2, X)kSP < ǫ when |t2 − t1| < ǫ2/kT .

To see the first inequality, consider kf(t1, X) − f(t1, Y )k2SP =

Z T

0 E|f(t1, X(t)) − f(t1, Y (t))|2dt

Z T

0 kE|X(t) − Y (t)|2dt = kkX − Y k2SP.

Thus, if kX − Y kSP < ǫ/k1/2, then kf(t1, X) − f(t1, Y )kSP < ǫ. To see the second inequality, consider

kf(t1, X) − f(t2, X)k2SP =

Z T

0 E|f(t1, X(t)) − f(t2, X(t))|2dt

≤ Z T

0 kE|t2 − t1| dt = kT |t2 − t1|.

Thus, if |t2 − t1| < ǫ2/kT then kf(t1, X) − f(t2, X)kSP < ǫ.

EXISTENCE OF A UNIQUE SOLUTION

We now prove existence and uniqueness for X ∈ HSP that solves the SDE when f and g satisfy conditions (c6) and (c7). To show existence and uniqueness, a Cauchy sequence of functions in HSP is constructed. The limit of this sequence is the solution.

Let X0(t) = X(0), where X(0) ∈ HRV is the given initial condition. Define This inequality follows from the following argument.

kX1 − X0k2SP =

Continuing this procedure, the sequence {Xn}n=1 ⊂ HSP is defined as

To see that this sequence is Cauchy in HSP, Xn+1(t) − Xn(t) =

Let an(t) = E|Xn+1(t) − Xn(t)|2, and then, by the previous inequality, This latter inequality implies that

an(t) ≤ Lntn−1

(n − 1)!kX1 − X0k2SP

and it follows that

kXn+1 − Xnk2SP ≤ LnTn

n! kX1 − X0k2SP.

Consider m > n. By repeated application of the triangle inequality, kXm − XnkSP ≤ kXm − Xm−1kSP + kXm−1 − Xm−2kSP +· · · + kXn+1 − XnkSP

From the previous inequality, as kX1 − X0kSP is bounded, given ǫ > 0 there exists an N > 0 such that kXn − XmkSP < ǫ when n, m > N . Hence, {Xn}n=1 is Cauchy in HSP and converges to a unique X ∈ HSP.

Now let Y ∈ HSP where Y satisfies the relation Y (t) = −X(t) + X(0) +

Z t 0

f (s, X(s)) ds + Z t

0

g(s, X(s)) dW (s).

Because

0 = −Xn(t) + X(0) + Z t

0

f (s, Xn−1(s)) ds + Z t

0

g(s, Xn−1(s)) dW (s) and kX − XnkSP → 0 as n → ∞, it is clear that kY kSP = 0.

So X(t) is the unique solution in HSP.

PROPERTIES OF SOLUTIONS OF SDEs

The first theorem implies that the solution X(t) is bounded and the second theorem states that the solution is continuous on [0, T ] in the k · kRV norm.

THEOREM: BOUNDEDNESS OF SOLUTIONS

Assume that f and g satisfy (c6) and (c7) and X ∈ HSP is the solution of

Letting a(t) = E|X2(t)| and b(t) = 3E|X2(0)| + (3t2 + 3t)k, then a(t) ≤ b(t) + (3T + 3)k

Z t 0

a(s) ds.

Using the Bellman-Gronwall inequality, it follows that a(t) ≤ b(t) + (3T + 3)k

Z t 0

exp k(3T + 3)(t − s)b(s) ds.

As b(t) is increasing on [0, T ],

a(t) ≤ b(t) + b(t)(3T + 3)k Z t

0

exp k(3T + 3)(t − s) ds.

Thus,

E|X(t)|2 ≤ 3 E|X(0)|2 + kT2 + kT exp 3k(T + T2)

for 0 ≤ t ≤ T.

THEOREM: CONTINUITY OF SOLUTIONS

Assume that f and g satisfy conditions (c6) and (c7) and X ∈ HSP is the solution of the SDE Then,

E|X(t) − X(r)|2 ≤ c|t − r| for 0 ≤ r, t ≤ T.

The previous theorem implies that there is an M > 0 such that E|X(s)|2 ≤ M for 0 ≤ s ≤ T . Using this and (c7),

THE BELLMAN-GRONWALL INEQUALITY

A useful inequality is the Bellman-Gronwall inequality:

If a(t) ≤ b(t) + c

To see this, suppose that

a(t) ≤ b(t) + c

Substituting the last inequality into (*) gives:

a(t) ≤ b(t) + c Z t

0

exp(c(t − s))b(s) ds.

IT ˆO’s FORMULA AND EXACT SOLUTIONS

Itˆo’s formula is stated for stochastic differential equations.

Itˆo’s formula is helpful in finding exact solutions to certain SDEs.

Itˆo’s formula can be used to determine exact moments of the solution for certain problems.

IT ˆO’s FORMULA

Consider the Itˆo SDE in differential form:

dX(t) = f (t, X(t)) dt + g(t, X(t)) dW (t) for 0 ≤ t ≤ T with X(0) ∈ HRV.

Let F be a smooth function so Itˆo’s formula can be applied to F (t, X). Then, F satisfies:

dF (t, X(t)) =  ∂F (t, X)

∂t + f (t, X)∂F (t, X)

∂x + 1

2g2(t, X)∂2F (t, X)

∂x2

 dt + g(t, X)∂F (t, X)

∂x dW (t).

Itˆo’s formula allows us, for example, to determine moments of the solution for certain SDEs. To find these moments, useful is the relation:

E

Z t 0

G(t, X(t)) dW (t)



= 0.

IT ˆO’s FORMULA

Before considering several examples, care must be taken with SDEs.

For example, applying Itˆo’s formula to F (t, X(t)) = X2(t) yields

d(X2(t)) = [2X(t)f (t, X(t)) + g2(t, X(t))] dt + [2X(t)g(t, X(t))] dW (t).

In particular, notice that

d(X2(t)) 6= 2X(t)dX(t) = 2X(t)[f(t, X(t)) dt + g(t, X(t)) dW (t)].

EXAMPLE: FINDING EXACT MOMENTS FOR AN SDE Consider the SDE:

dX(t) = dt + X(t) dW (t), X(0) = 0.

Then

X(t) = t + Z t

0

X(s) dW (s).

It follows that

E(X(t)) = t.

Applying Itˆo’s formula to F (t, X) = X2 yields

d(X2(t)) = [2X(t) + X2(t)] dt + 2X2(t) dW (t) so that

E(X2(t)) = E Z t

0

2X(s) + X2(s) ds.

This leads to a differential equation for E(X2(t)):

dE(X2(t))

dt = 2E(X(t)) + E(X2(t)) = 2t + E(X2(t))

with E(X2(0)) = 0. Solving this ODE gives the second moment for X(t), E(X2(t)) = −2t − 2 + 2et.

From these relations, Var(X(t)) = E(X2(t)) − (E(X(t))2 = 2et − 2 − 2t − t2.

This procedure can be continued. If Itˆo’s formula is applied to F (t, X) = X3, then

d(X3(t)) = [3X2(t) + 3X3(t)] dt + 3X3(t) dW (t).

This leads to the differential equation for E(X3(t)):

dE(X3(t))

dt = 3E(X2(t)) + 3E(X3(t)) = −6t − 6 + 6et + 3E(X3(t)) with E(X3(0)) = 0. Solving this ODE gives the third moment for X(t),

E(X3(t)) = 2t + 8

3 − 3et + 1 3e3t.

EXAMPLE: FINDING EXACT MOMENTS FOR AN SDE Consider the SDE:

dX(t) = −1

4X3(t) dt + 1

2X2(t) dW (t) with X(0) = 1 2.

In this example, E(X(t)) and E(X3(t)) are to be determined exactly.

First,

dE(X(t)) = −1

4E(X3(t)) dt with E(X(0)) = 1 2

so E(X3(t)) is needed in order to find E(X(t)). Applying Itˆo’s formula to the SDE gives

dX3(t) =



−3

4X5(t) + 3

4X5(t)



dt + 3

2X4(t) dW (t) = 3

2X4(t) dW (t) with E(X3(0)) = 1

8. Thus, E(X3(t)) = 1

8 and it follows that E(X(t)) = 1

2 − 1 32t.

EXAMPLE: FINDING EXACT MOMENTS FOR AN SDE Consider the stochastic differential equation

dX(t) =  1

3X1/3(t) + 6X2/3(t)



dt + X2/3(t) dW (t) with X(0) = 1.

In this example, we wish to determine E(X(t)) and E(X2(t)) exactly. First notice that

dE(X(t)) 6=  1

3 E(X(t))1/3

+ 6 E(X(t))2/3 dt

so an appropriate change of variables is required to find the moments. Let Yn(t) = (X(t))n/3 for n = 0, 1, 2, . . . , 6.

Next, applying Itˆo’s formula, the stochastic differentials are obtained dYn(t) =  1

Letting Zn(t) = E(Yn(t)) = E((X(t))n/3), then the initial-value system is:

dZn(t)

dt = 1

18(n2 − n)Zn−2(t) + 2nZn−1(t) for n = 1, 2, . . . , 6 with Zn(0) = 1 for n = 1, 2, . . . , 6 and Z0(t) = 1. Solving this gives

Z1(t) = E((X(t))1/3) = 2t + 1 Z2(t) = E((X(t))2/3) = 4t2 + 37

9 t + 1 Z3(t) = E((X(t)) = 8t3 + 38

3 t2 + 19

3 t + 1 Z6(t) = E((X(t))2) = 64t6 + 656

3 t5 + 2660

9 t4 + 49145

243 t3 + 665

9 t2 + 41

3 t + 1.

In particular, E(X(1)) = 28.0 and E(X2(1)) = 869.0206.

EXAMPLE: FINDING THE EXACT SOLUTION OF AN SDE Consider the stochastic differential equation

dX(t) = −αX(t) dt + σ dW (t), X(0) = X0

where α, σ, and X0 are constants. Let F (t, X) = eαtX(t). By Itˆo’s formula, d eαtX(t) = αeαtX(t) − αeαtX(t) dt + σeαtdW (t)

Thus,

eαtX(t) − X(0) = Z t

0

eαsσ dW (s).

So the exact solution is

X(t) = X(0)e−αt + e−αt Z t

0

eαsσ dW (s).

EXAMPLE: FINDING THE EXACT SOLUTION OF AN SDE Consider the stochastic differential equation

dX(t) = f (t)X(t) dt + g(t)X(t) dW (t), X(0) = X0

where X0 is a constant. For this problem, the exact solution has the form X(t) = X0exp

Z t

0 f (s) − 1

2g2(s) ds + Z t

0

g(s) dW (s)

 . To see this, let F (t, X) = ln(X(t)). Applying Itˆo’s formula,

d(ln(X(t))) = f(t) − 1

2g2(t) dt + g(t) dW (t).

Thus,

ln(X(t)) − ln(X0) = Z t

0 f (s) − 1

2g2(s) ds + Z t

0

g(s) dW (s) which yields the solution.

APPROXIMATING SDEs

The exact solution to an SDE is generally difficult to obtain so it is useful to be able to approximate the solution. Euler’s (or the Euler-Maruyama) method is a simple numerical method.

Euler’s method has the form

Xi+1(ω) = Xi(ω) + f (ti, Xi(ω))∆t + g(ti, Xi(ω))∆Wi(ω), X0(ω) = X(0, ω)

for i = 0, 1, 2, . . . , N − 1 where Xi(ω) ≈ X(ti, ω), ti = i∆t, ∆t = T /N, ∆Wi(ω) = (W (ti+1, ω) − W (ti, ω)) ∼ N(0, ∆t), and where ω indicates a sample path.

To study the error in this method, it is useful to approximate the solution for all t ∈ [0, T ] and not just at the nodal points ti. To accomplish this, X(t) ≈ X(t) is defined asˆ

X(t) = Xˆ i + Z t

ti

f (ti, Xi) ds + Z t

ti

g(ti, Xi) dW (s)

for ti ≤ t ≤ ti+1 and i = 0, 1, . . . , N − 1. Notice that ˆX is identical to Euler’s method approximation at the nodal points, that is, ˆX(ti) = Xi for i = 0, 1, . . . , N .

Notice that on the ith subinterval, ˆX(t) is the solution of the SDE

 d ˆX(t) = f (ti, Xi) dt + g(ti, Xi) dW (t), ti ≤ t ≤ ti+1 X(tˆ i) = Xi.

Recall that the solution X(t) satisfies the SDE

dX(t) = f (t, X(t)) dt + g(t, X(t)) dW (t), ti ≤ t ≤ ti+1.

Using the inequality |2ab| ≤ a2 + b2 and properties of stochastic integrals, E(ǫ2(ti+1)) ≤ E(ǫ2(ti)) +

Z ti+1

ti E(X(t) − ˆX(t))2dt +

Z ti+1

ti E(f (t, X(t)) − f(ti, Xi))2dt +

Z ti+1

ti E(g(t, X(t)) − g(ti, Xi))2dt.

But

|f(t, X(t)) − f(ti, Xi)|2 ≤ 2|f(t, X(t)) − f(ti, X(ti))|2 +2|f(ti, X(ti)) − f(ti, Xi)|2

≤ 2k|t − ti| + 2k|X(t) − X(ti)|2 + 2k|X(ti) − Xi|2 and similarly for g using property (c6). Hence,

E(ǫ2(ti+1)) ≤ E(ǫ2(ti)) +

Z ti+1

ti E(X(t) − ˆX(t))2dt +4k(1 + c)

Z ti+1 ti

(t − ti) dt + 4k

Z ti+1 ti

E(ǫ2(ti)) dt using E|X(t) − X(ti)|2 ≤ c|t − ti|.

Therefore,

E(ǫ2(ti+1)) ≤ E(ǫ2(ti))(1 + 4k∆t) + 2k(1 + c)(∆t)2 +

Z ti+1 ti

E(ǫ2(s)) ds.

By Bellman-Gronwall inequality with b(t) = E(ǫ2(ti))(1 + 4k∆t) +2k(1 + c)(∆t)2, E(ǫ2(ti+1)) ≤ E(ǫ2(ti))(1 + 4k∆t) + 2k(1 + c)(∆t)2

+

Z ti+1 ti

e(ti+1−t)E(ǫ2(ti))(1 + 4k∆t) + 2k(1 + c)(∆t)2 dt

= e∆tE(ǫ2(ti))(1 + 4k∆t) + 2k(1 + c)(∆t)2 .

Letting ai = E(ǫ2(ti)), R = e∆t(1 + 4k∆t), and S = e∆t2k(1 + c)(∆t)2, then ai+1 ≤ Rai + S for i = 0, 1, 2, . . . , N − 1.

These inequalities yield

aN ≤ SRN − 1

R − 1 with a0 = E(ǫ2(0)) = 0.

Hence,

E(ǫ2(tN)) ≤ e∆t2k(1 + c)(∆t)2eN ∆te4kN ∆t

e∆t − 1 + e∆t4k∆t ≤ ∆t(1 + c)e(1+4k)T

2 .

This holds for any nodal point and the mean square error satisfies E|X(ti) − Xi|2 ≤ ˆc∆t

for i = 0, 1, 2, . . . , N where ˆc = 12(1 + c)e(1+4k)T.

MILSTEIN’S METHOD

Higher order numerical methods are possible for SDEs which are similar in some respects to higher order methods for ODEs. For example, there are explicit or implicit multistep methods and Runge-Kutta methods.

A popular second-order method is Milstein’s method and has mean square error proportional to (∆t)2 rather than ∆t. Milstein’s method has the form

Xi+1(ω) = Xi(ω) + f (ti, Xi(ω))∆t + g(ti, Xi(ω))∆Wi(ω) +1

2g(ti, Xi(ω))∂g(ti, Xi(ω))

∂x [(∆Wi(ω))2 − ∆t]

for i = 0, 1, 2, . . . , N − 1 with X0(ω) = X(0, ω), where Xi(ω) ≈ X(ti, ω), ∆Wi(ω) = (W (ti+1, ω) − W (ti, ω)) ∼ N(0, ∆t), ti = i∆t, ∆t = T /N, and where ω indicates a sample path.

Milstein’s method has an additional term at each step in comparison with Euler’s method.

EXAMPLE: APPROXIMATION OF AN SDE BY EULER AND MIL-STEIN

Consider the stochastic differential equation dX(t) =  1

3X1/3(t) + 6X2/3(t)



dt + X2/3(t) dW (t), X(0) = 1.

It was shown earlier that E(X(1)) = 28.0 and E(X2(1)) = 869.0206. For this problem, Euler’s method has the form:

Xi+1 = Xi + 1

3Xi1/3 + 6Xi2/3



∆t + Xi2/3

∆t ηi where ηi ∼ N(0, 1)

for i = 0, 1, 2, . . . , N − 1 with X0 = 1, ti = i∆t, and ∆t = 1/N . Milstein’s method has the form

Xi+1 = Xi + 1

3Xi1/3 + 6Xi2/3



∆t + Xi2/3

∆t ηi + 1

3Xi1/3i2 − 1)∆t where ηi ∼ N(0, 1).

The calculational results for the mean square error E|X(1) − XN|2 are given in the table for 10,000 sample paths for each value of N .

Value of N Euler Error Milstein Error 29 2.80× 10−2 1.61× 10−2

210 1.04× 10−2 4.03× 10−3 211 4.20× 10−3 1.01× 10−3 212 1.89× 10−3 2.53× 10−4 213 8.76× 10−4 6.24× 10−5 214 4.12× 10−4 1.60× 10−5

Notice that the mean square errors are approximately proportional to ∆t = 1/N for Euler’s method and to (∆t)2 = 1/N2 for Milstein’s method.

Next, for this example, E(X(1)) and E(X(1))2 were estimated using E(X(1)) ≈ P100,000

j=1 XN(j)/100, 000 and E(X(1))2 ≈ P100,000

j=1 (XN(j))2/100, 000 where XN(j) is the es-timate of X(1) for the jth sample path using N intervals.

In the table, the errors are given in parentheses. Recall that E(X(1)) = 28.0 and E(X2(1)) = 869.0206 are the exact values. Notice that the errors in the mean values are proportional to ∆t for either numerical method. In particular, the errors in Euler’s method when estimating mean values are proportional to ∆t rather than (∆t)1/2.

Value of N Euler Estimate Milstein Estimate 26 27.07 (0.93) 27.08 (0.92)

27 27.56 (0.44) 27.56 (0.44) 28 27.79 (0.21) 27.79 (0.21)

Value of N Euler Estimate Milstein Estimate 26 810.15 (58.87) 810.18 (58.84) 27 840.89 (28.13) 840.93 (28.09) 28 855.33 (13.69) 855.31 (13.71)

0 0.2 0.4 0.6 0.8 1 0

5 10 15 20 25 30

Time

Mean and One Sample Path

Figure 1: Mean solution and one sample path

The mean and one sample path are plotted in the figure for this problem.

STRONG AND WEAK APPROXIMATIONS

This example illustrates that are two kinds of approximation commonly discussed in computational solution of SDEs.

A numerical method is said to be a strong approximation of order γ if kX(T ) − XNkRV ≤ c(∆t)γ

where X(T ) is the exact solution at time T and XN is the approximate solution using step length ∆t = T /N .

Euler’s method and Milstein’s methods have strong orders 12 and 1.

However, if expectations of functions of a solution to an SDE are desired and not necessarily the pathwise approximation provided by a strong ap-proximation, then a weak numerical method may be sufficient.

An approximation XN is said to converge weakly with order β if

|E(F (X(T ))) − E(F (XN))| ≤ c(∆t)β

for all smooth functions F , where ∆t = T /N is the step size.

Euler’s method and Milstein’s method both have weak order 1.

RICHARDSON EXTRAPOLATION

Both Euler’s or Milstein’s method have weak-error expansions of the cor-rect form for applying Richardson extrapolation.

The weak error for Euler’s or Milstein’s method has the form E(F (X(T ))) − E(F (XN)) = c1∆t + c2(∆t)2 + c3(∆t)3 + . . . , where ∆t = T /N and c1, c2, c3, . . . are independent of ∆t.

So, several approximations with different values of N can be applied to obtain a higher order approximation. Suppose that E(F (XN)), E(F (X2N)), and E(F (X4N)) are three approximations to E(F (X(T ))) using step lengths of T /N , T /2N , and T /4N in Euler’s method or in Milstein’s method.

To obtain an approximation to E(F (X(T ))) of order (∆t)2, let

E(F (X(T ))) − 2E(F (X2N)) − E(F (XN)) = ˆc2(∆t)2 + ˆc3(∆t)3 + . . . . To obtain an approximation to E(F (X(T ))) of order (∆t)3, let

E(F (X(T ))) −8E(F (X4N)) − 6E(F (X2N)) + E(F (XN))/3 = ˜c3(∆t)3 + . . . .

EXAMPLE: RICHARDSON EXTRAPOLATION

Referring to the values in for the previous example, the following approxi-mations to E((X(1))2) are obtained using Euler’s method:

E((X64)2) = 810.15, E((X128)2) = 840.89, and E((X256)2) = 855.33.

To obtain O((∆t)2) and O((∆t)3) approximations, respectively, to E((X(1))2) we calculate

2E((X128)2) − E((X64)2) = 871.63 and

[8E((X256)2) − 6E((X128)2) + E((X64)2)]/3 = 869.15.

As E((X(1))2) = 869.02 exactly, the original Euler approximations are much improved through extrapolation.

STRONG APPROXIMATIONS ARE WEAK APPROXIMATIONS

Any strong approximation is also a weak approximation as the Lyapunov inequality gives

|E(F (X(T ))) − E(F (XN))| ≤ (E|(F (X(T ))) − (F (XN))|2)1/2

≤ L(E|X(T ) − XN|2)1/2 = LkX(t) − XNkRV assuming that F satisfies a Lipschitz condition.

However, there are weak methods which are not strong approximations.

Consider the discrete process described earlier. For a particular trajectory, suppose that at time ti, Xi = mδ for some integer m, where δ > 0 is small.

Define the three possibilities at time ti+1 = ti + ∆t as

Xi+1 = Xi + δ with probability r(ti, Xi)∆t/δ2,

Xi+1 = Xi with probability 1 − r(ti, Xi)∆t/δ2 − s(ti, Xi)∆t/δ2, Xi+1 = Xi − δ with probability s(ti, Xi)∆t/δ2,

where

 r(ti, Xi) = f (ti, Xi)δ + g2(ti, Xi)/2 s(ti, Xi) = − f(ti, Xi)δ + g2(ti, Xi)/2.

The probability distribution of XN approaches that of X(T ) as ∆t, δ → 0 implying that E(F (XN)) ≈ E(F (X(T ))) for small values of ∆t and δ.

Apply the weak method to the previous problem. The calculational re-sults are presented using 100,000 sample paths for estimating E(X(1)) and E(X(1))2. Recalling that E(X(1)) = 28.00 and E(X(1))2 = 869.02, the calcula-tional results are reasonable.

Value of N E(X(1)) Estimate E(X(1))2 Estimate

212 17.04 292.42

213 24.64 630.81

214 27.88 854.12

215 27.97 868.12

SYSTEMS OF STOCHASTIC DIFFERENTIAL EQUATIONS

Itˆo’s formula and numerical methods can be extended to systems. Let X(t, ω) = [X~ 1(t, ω), X2(t, ω), . . . , Xd(t, ω)]T

W (t, ω) = [W~ 1(t, ω), W2(t, ω), . . . , Wm(t, ω)]T f : [0, T ] × R~ d → Rd and g : [0, T ] × Rd → Rd×m, where Wi(t, ω), 1 ≤ i ≤ m are independent Wiener processes.

Then a system of stochastic differential equations has the form d ~X(t, ω) = ~f (t, ~X(t, ω)) dt + g(t, ~X(t, ω)) d ~W (t, ω).

In component form, the system is Xi(t) = Xi(0) +

Z t 0

fi(s, ~X(s)) ds +

m

X

j=1

Z t 0

gi,j(s, ~X(s)) dWj(s) for i = 1, 2, . . . , d.

IT ˆO’s FORMULA FOR SYSTEMS Let

F : [0, T ] × R~ d → Rk and let ~Y (t, ω) = ~F (t, ~X(t, ω)).

Then the pth component of ~Y (t, ω) satisfies:

dYp(t) =

EXAMPLE: IT ˆO’s FORMULA FOR A PROBLEM WITH d = 1 AND m = 2 Consider the SDE:

 dX(t) = t2X(t) dt + t dW1(t) + X(t) dW2(t), 0 ≤ t ≤ T X(0) = 1,

where d = 1 and m = 2.

For this problem, f1 = t2X, g1,1 = t, and g1,2 = X.

Consider using Itˆo’s formula to find the SDE for F = X2. Applying Itˆo’s formula,

 d(X2(t)) = 2t2X2(t) + t2 + X2(t) dt + 2tX(t) dW1(t) + 2X2(t) dW2(t) X2(0) = 1.

EULER’S AND MILSTEIN’S METHODS FOR SYSTEMS Euler’s method for systems has the form

X~n+1(ω) = ~Xn(ω) + ~f (tn, ~Xn(ω))∆t + g(tn, ~Xn(ω))∆ ~Wn(ω)

for n = 0, 1, 2, . . . , N , where ~Xn(ω) ≈ ~X(tn, ω), ∆t = T /N , ∆ ~Wn = ~W (tn+1)− ~W (tn).

In component form, Euler’s method is:

Xi,n+1(ω) = Xi,n(ω) + fi(tn, ~Xn(ω))∆t +

m

X

j=1

gi,j(tn, ~Xn(ω))∆Wj,n(ω) for i = 1, 2, . . . , d, where ∆Wj,n ∼ N(0, ∆t).

Milstein’s method for multidimensional SDEs involves the double stochastic integral

In(j1, j2) =

Z tn+∆t tn

Z s tn

dWj1(r) dWj2(s).

Milstein’s method has the componentwise form Xi,n+1(ω) = Xi,n(ω) + fi(tn, ~Xn(ω))∆t +

m

X

j=1

gi,j(tn, ~Xn(ω))∆Wj,n(ω)

+

m

X

j1=1 m

X

j2=1 d

X

l=1

gl,j1∂gi,j2

∂xl In(j1, j2) for i = 1, 2, . . . , d.

EXAMPLE: APPROXIMATION OF AN SDE WITH d = 1 AND m = 2 Consider the SDE;

 dX(t) = t2X(t)dt + tdW1(t) + X(t)dW2(t), 0 ≤ t ≤ T X(0) = 1,

where d = 1 and m = 2.

For this problem, Euler’s method has the form

 Xn+1 = Xn + t2nXn∆t + tn∆W1,n + Xn∆W2,n

X0 = 1,

for n = 0, 1, 2, . . . , where ∆W1,n, ∆W2,n ∼ N(0, ∆t) and tn = n∆t.

Milstein’s method has the form

 Xn+1 = Xn + t2nXn∆t + tn∆W1,n + Xn∆W2,n + tnIn(1, 2) + XnIn(2, 2) X0 = 1,

for n = 0, 1, 2, . . . .

It is useful to note that In(j1, j1) =

Z tn+∆t tn

Z s tn

dWj1(r) dWj1(s) = 1

2 (∆Wj1,n)2 − ∆t

but In(j1, j2) for j1 6= j2 does not have an analytical form and must be ap-proximated.

This multiple integral can be approximated by a Fourier series expansion.

Also, if [tn, tn+1] is divided into M equal intervals with tj,n = tn + j∆t/M for j = 0, 1, . . . , M, then

In(j1, j2) ≈ ˜In(j1, j2) =

M −1X

j=0

[Wj1(tj,n)− Wj1(t0,n)][Wj2(tj+1,n) − Wj2(tj,n)].

It can be shown that E|In − ˜In|2 = (∆t)2/(2M ).

FORWARD KOLMOGOROV (FOKKER-PLANCK) EQUATION

The probability distribution of solutions to a discrete-valued continuous stochastic process satisfies a system of differential equations called the for-ward Kolmogorov equations. An analogous result holds for the probability distribution of solutions to an SDE.

Consider the stochastic differential equation

dX(t) = f (t, X(t)) dt + g(t, X(t)) dW (t)

Let p(t, x) be the probability density for solutions to the SDE. The previous

Integrating by parts the right-hand side yields the relation Z

As the above integral holds for every function F ∈ C0 (R), this implies that

∂p(t, x)

This equation is the forward Kolmogorov equation or Fokker-Planck equa-tion for the probability distribuequa-tion of soluequa-tions to the SDE.

Also, the forward Kolmogorov equation for a system of SDEs is:

∂p(t, ~x)

EXAMPLE: SOLUTION OF A FORWARD KOLMOGOROV EQUATION Consider the stochastic differential equation

 dX(t) = a dt + b dW (t) X(0) = x0.

The probability density of the solutions satisfies the forward Kolmogorov equation

∂p(t, x)

∂t = −∂(ap(t, x))

∂x + b2 2

2(p(t, x))

2x p(0, x) = δ(x − x0).

The solution to this partial differential equation is p(t, x) = 1

√2πb2t exp −(x − at − x0)2 2b2t

 .

STABILITY

There are several kinds of stability questions and several ways to define stability for SDEs. To introduce this topic, it is useful to first review stability concepts for ODEs. Consider the initial-value problem:

d~y(t)

dt = ~f (~y(t)) for t > 0

~y(0) = ~a

where ~y : R → Rn and ~f : Rn → Rn. Suppose that ~z(t) satisfies the same differential equation as ~y(t) but with a different initial condition, i.e.,

d~z(t)

dt = ~f (~z) for t > 0

~z(0) 6= ~a.

Suppose that ~a = ~γ is a critical point, i.e., ~f (~γ) = ~0. Then the solution satisfies ~y(t) = ~γ for t ≥ 0. The initial-value problem is said to be stable at

~γ if given ǫ > 0 there is a δ > 0 such that

k~z(t) − ~γk < ǫ for t ≥ 0 whenever k~z(0) − ~γk < δ.

That is, small changes in the initial condition do not produce large changes in the solution for t ≥ 0.

In numerical solution of initial-value problems for ODEs, there are two common numerical stability concepts. Suppose that a single-step method for solving has the form:

 ~yk+1 = ~yk + h ~φ(h, ~yk) for k = 0, 1, 2, . . . , N − 1

~y0 = ~a.

where h = T /N is the step length, tk = kh, and ~yk ≈ ~y(tk) for each 0 ≤ k ≤ N.

The method is numerically stable if small changes in the initial condition do not produce large changes in the computational solution.

Specifically, if ~zk for k = 0, 1, . . . , N satisfies the method but with a different initial condition ~z0 6= ~y0, then the numerical scheme is numerically stable provided that

k~yk − ~zkk ≤ cǫ for 0 ≤ k ≤ N when k~y0 − ~z0k < ǫ.

If ~φ satisfies an appropriate Lipschitz condition, then the numerical scheme can be shown to be stable. However, the constant c can be extremely large, especially for stiff systems, which motivates another concept of numerical stability.

To study stability of stiff systems, the following scalar test problem is

stud-ied: 

dy(t)

dt = λy for t > 0 y(0) = a

where λ is a constant. Clearly, if a 6= 0, then y(t) → 0 as t → ∞ if and only if Re(λ) < 0.

Consider, for example, applying Euler’s method to this test problem. Then

 yk+1 = (1 + hλ)yk for k = 0, 1, 2, . . . , y0 = a.

and yk → 0 as k → ∞, if and only if −2 < Re(λh) < 0. The region of absolute stability of Euler’s method is −2 < Re(λh) < 0. The region of absolute stability gives a condition on the step length. If the method satisfies this condition, then the numerical solution does not “blow up” but decreases to zero behaving like the solution to the initial-value problem. For Euler’s method to behave similarly to the solution for a large negative value of λ, the step length h must be selected to be small.

However, for the backward Euler method, which for the test problem has the form:

 yk+1 = yk + hλyk+1 for k = 0, 1, 2, . . . , y0 = a,

the region of absolute stability is the entire left-half of the complex plane, i.e. −∞ < Re(λh) < 0.

To see this, consider

yk+1 = yk/(1 − hλ) for k = 0, 1, 2, . . . .

For this method, the step length h need not be chosen very small for the numerical solution to perform similarly to the actual solution even for an initial-value problem that involves a large negative value of λ.

The concept of absolute stability is useful when considering numerical so-lution of systems. For the test initial-value system

d~y(t)

dt = A~y for 0 ≤ t ≤ T

~y(0) = ~a.

where A is an n × n matrix, then ~y(t) → ~0 as t → ∞ provided that Re(λi) < 0 for each eigenvalue λi for 1 ≤ i ≤ n. For this problem, Euler’s method is

 ~yk+1 = (I + Ah)~yk for k = 0, 1, 2, . . . ,

~y0 = ~a.

The eigenvalues of I + Ah are 1 + λih for i = 1, 2, . . . , n and ~yk → ~0 as k →

∞ provided that −2 < Re(λi)h < 0 for each eigenvalue λi for 1 ≤ i ≤ n.

Hence, the step length h is forced to satisfy a condition determined by the eigenvalues with large negative real parts.

Now consider stability for SDEs. First, stability of a steady solution to an SDE is studied then numerical stability of an approximation is studied.

Consider stability of a steady solution for the SDE

 dX(t) = f(X(t)) dt + g(X(t)) dW (t) for 0 ≤ t ≤ T X(0) = a.

It is supposed that f (0) = g(0) = 0 so that X(t) ≡ 0 is a steady solution.

There are many ways to define stochastic stability for a steady solution of an SDE. Two ways are considered here, asymptotic stochastic stability and mean-square stability. It is assumed that X(0) 6= 0.

If lim

t→∞|X(t)| = 0 with probability 1, then X(t) ≡ 0 is said to be asymptotically stochastically stable.

If lim

t→∞E(|X(t)|2) = 0, then X(t) ≡ 0 is said to be mean-square stable.

It is interesting that some SDEs may be both asymptotically stochastically stable and mean-square stable while others may be asymptotically stochas-tically stable but not mean-square stable.

To illustrate this behavior, stability is analyzed for an SDE with f (X) = λX and g(X) = µX. In this case,

 dX(t) = λX(t) dt + µX(t) dW (t) for 0 ≤ t ≤ T X(0) = a

and E(X(t)) = X(0) exp(λt).

Using Itˆo’s formula, X2(t) satisfies the SDE

 d(X2(t)) = 2λX2(t) + µ2X2(t) dt + 2µX2(t) dW (t) for t > 0 X2(0) = a2.

It follows that E(X2(t)) satisfies the SDE

 d(E(X2(t))) = 2λE(X2(t)) + µ2E(X2(t)) dt for t > 0 E(X2(0)) = a2.

and the solution E(X2(t)) is found to be

E(X2(t)) = X2(0) exp((2λ + µ2)t).

This solution implies that the steady solution X(t) = 0 is mean-square stable if and only if λ + µ2/2 < 0.

Now consider Itˆo’s formula applied to ln(X(t)). Then,

 d(ln(X(t)) = (λ − µ2/2) dt + µ dW (t) for t > 0 ln(X(0)) = ln(a).

Let ∆t be a given interval width and let ti = i∆t for t = 0, 1, 2, . . . . This SDE can be exactly integrated from ti to ti+1 to yield:

ln(X(ti+1)) − ln(X(ti)) = (λ − µ2/2) (ti+1 − ti) + µηi p(ti+1 − ti)

. By the Law of Large Numbers, Sn

But, letting t = tn, 1 n∆t

Xn−1 i=0

ln X(ti+1) X(ti)



= 1

n∆t ln X(tn) X(0)



= 1

t ln X(t) X(0)



→ (λ − µ2/2) w.p.1 as t → ∞.

Therefore,

X(t) → X(0) exp((λ − µ2/2)t) w.p.1 as t → ∞.

This result implies that the steady solution X(t) = 0 is asymptotically sto-chastically stable if and only if λ − µ2/2 < 0.

Hence, for example, if λ = µ2/4 in the SDE, then X(t) → 0 with probability 1 as t → ∞ while E(X(t)) → ∞ and E(X2(t)) → ∞ under the same condition.

Now, numerical stability of SDEs is considered, in particular, with respect to stiff stochastic problems with additive noise and then, more briefly, with respect to multiplicative noise.

The test problem for additive noise has the form

 dX(t) = λX(t) dt + µ dW (t) for t > 0 X(0) = a.

Two kinds of numerical stochastic stability are numerical asymptotic sto-chastic stability and numerical mean-square stability. Let Xk and ˜Xk be two approximations with the same numerical method but with different initial values.

If lim

k→∞|Xk − ˜Xk| = 0 with probability 1, then the approximation is said to be asymptotically stochastically stable.

If lim

k→∞E(|Xk − ˜Xk|2) = 0, then the approximation is said to be mean-square stable.

Consider first Euler’s method for solution of test problem:

 Xk+1 = Xk + λXkh + µ ηk

h for k = 0, 1, . . . X0 = a

where Xk ≈ X(kh), ηk ∼ N(0, 1) for each k, and h is the step length. Fur-thermore, let ˜Xk be another numerical approximation but with a different initial approximation ˜X0 = ˜a. Let Zk = Xk − ˜Xk. Then, Zk satisfies

 Zk+1 = Zk + λh Zk for k = 0, 1, . . . Z0 = a − ˜a

and therefore,

|Xk − ˜Xk| = |Zk| = |1 + λh|k |Z0| for k = 0, 1, . . . .

Thus, Euler’s method is asymptotically and mean square stable for the test problem provided that −2 < λh < 0. An analogous result holds for stability of Euler’s method for stiff systems with additive noise. Specifically, Euler’s method is numerically stable for a stochastic system with additive noise

( d ~X(t) = A ~X(t) dt + µ d ~W (t) for t > 0 X(0) = ~a.~

provided that −2 < Re(λi)h < 0 for each eigenvalue λi of A.

相關文件