HungChen MonteCarlomethodsforStatisticalInference:GenerationofRandomNumbers

(1)

Monte Carlo methods for Statistical Inference:

Generation of Random Numbers

Hung Chen

hchen@math.ntu.edu.tw Department of Mathematics

National Taiwan University

25th February 2004

Meet at NS 104 On Wednesday from 9:10 to 12.

(2)

Outline

Introduction

{ Inverse CDF Method

{ Generate Discrete Random Variates { Limitation

Classical Uniform Variate Generator

{ Lehmer congruential generator (LCG)

{ Tausworthe feedback shift register generator { Combination of random number generators

Non-uniform Variate Generation { Simulating Stochastic Processes

Poisson process

(3)

Brownian motion

{ Acceptance/Rejection Methods { Bayesian Analysis

{ Simulating Multivariate Random Variates

Gibbs Sampling and MCMC

{ Brief Introduction on Markov Chain { MCMC

{ Gibbs sampling

(4)

Classical Uniform Variate Generator

Simulation is used heavily when analytical study of a statistical procedure becomes intractable.

Simulation of random variables and random processes using computers is among the fastest growing areas of computational statistics.

Many statistical techniques rely on simulating random variables.

{ One traditional area is the use of random numbers to sample from a population.

{ More recent applications include simulation of high- dimensional, complex stochastic systems that are beyond analytical studies.

(5)

{ In many practical situations the probability distributions are far too complicated to analyze and often it is easier to simulate these distributions on computers and the resulting samples can be analyzed instead.

The study of a random variable through simulations is becoming a powerful tool in the hands of the statisticians.

Monte Carlo experimentation is the use of simulated random numbers to estimate some functional of a probability distribution.

Building block in any simulation study is non-uniform variate generation.

{ Many algorithms are available.

(6)

{ Example: Generate normal random variable.

Box-Muller method (Polar method)

If X and Y are independent and standard normal random variables then for

= tan ¹

Y X

; R = p

X² + Y ²

is uniform in [0; 2] and R² is exponential with mean 2.

(0) U₁; U₂ iid U(0; 1)

(1) X₁ = ( 2 ln U₁)¹⁼² cos(2U₂) (2) X₂ = ( 2 ln U₁)¹⁼² sin(2U₂) (3) X₁; X₂ iid N(0; 1)

Inverse method

If X F , then F (X) U(0; 1).

(7)

{ In the above methods, it assumes that we can produce an endless ow of a iid uniform random variate generators.

On the computer, we generally settle for pseudorandom numbers, that is, numbers that appear to be random but actually deterministic.

CDF transformation method X = F (U), U U(0; 1) where

F (u) = inffx j F (x) ug is the generalized inverse of the cdf F .

{ For a standard exponential random variable, the transformation

X = log(U)

yields one exponential for each uniform variable.

(8)

{ How to simulate the process of ipping a coin with probability of head p?

For a discrete random variable, although the inverse of the cdf does not exist, the inverse cdf method can still be used.

{ The value of the discrete random variable is cho- sen as the smallest value within its countable range such that the cdf is no less than the value of the uniform variate.

For a multivariate random variable, the inverse cdf method yields a level curve in the range of the random variable; hence, the method is not directly useful for multivariate random variable.

{ Multivariate random variates can be generated us-

(9)

ing the inverse cdf method rst on a univariate marginal and then on a sequence of univariate conditionals.

(10)

Discrete Random Variables

A discrete random variable takes only a countable number of values with pre-dened probabilities.

A discrete random variable is characterized by its probability mass function dened as P (x₁) = p₁; P (x₂) = p₂; : : : ; P (x_n) = p_n; : : : such that such that for all i, 0 p_i 1, and P

i p_i = 1.

Commonly used discrete random variables are binomial, Poisson, geometric and negative-binomial. As an

How do we generate a Poisson random variable with parameter ?

(11)

The probability mass function is given by:

p_i = exp( )ⁱ

i! ; i = 0; 1; 2; : : : : Note that

P (X = i + 1)

P (X = i) = i + 1:

F_X(i + 1) can be written in the following interative form:

F_X(i + 1) = F_X(i) + P (X = i) i + 1: The algorithm is

i. Generate U according to U[0; 1].

ii. Set i = 0, p = exp( ), and F = p.

iii. If U < F , set X = i and stop.

(12)

iv. Set p = p=(i + 1), F = F + p, i = i + 1 v. Go to Step (iii).

Denition For a given random variable, with a specied probability mass function f(x_i; p_i); i = 0; 1; 2; : : :g, the process of selecting a value x_i with probability p_i is called Simulation. If this selection is performed many times, generating a sequence fX_jg, then

1 n

Xn j=1

I_X_j(fx_ig) ! p_i:

(13)

Uniform Random Number Generation

Use algebraic methods to generate sequences of numbers that mimic the behavior of a uniform random variable.

{ These numbers are called pseudorandom numbers.

{ A uniform pseudorandom number generator is a mapping f that, starting from an initial value x₀, generates a sequence

x₀; f(x₀); f(f(x₀)); f(f(f(x₀))); : : : :

Since f is computed on a computer (without the use of random number generator!!), it is a deterministic mapping. That is, given x₀ the remaining

(14)

sequence is xed everytime the sequence is computed.

The elements of such a sequence should have the following properties:

1. The patterns between the numbers the appearing in a sequence should be minimized.

2. The correlation between the neighboring elements should be reasonably small.

3. The values should be distributed nearly uniformly over the whole the range of possible values.

4. The sequences should have large periods, where a period is dened to be duration after which a sequence repeats itself.

5. There exist a set of goodness of t tests for test-

(15)

ing the probability distributions associated with the observed random variables. The elements of a pseudorandom sequence should provide a reason- able performance in these goodness of t tests.

No random number generator is capable of generating (a) uniform and (b) independent variates.

Slight defect in RNG may have dramatic eect on whole simulation study.

{ Deng and Chhikara (1992) { If U₁; U₂; : : : ; U_n iid U(0; 1),

Z_n =

P_n

i=1 U_i ⁿ₂

pn=12 N(0; 1):

What if the assumption of iid and/or \U(0; 1)"

fail?

(16)

Classical uniform variate generator

Linear congruential generator [Lehmer (1951)]

{ X_i = BX_{i 1} + A mod n.

{ U_i = X_i=m

{ LCG has been used in almost all computer systems and packages.

{ Popular LCG (e.g., IMSL, SAS) (a) B = 16807, A = 0, m = 2³¹ 1.

(b) Its period is m 1 2:1 10⁹. { Comments

Period ( m) depends on B; A; m; X₀.

The period is too short by today's standard.

Large-scale simulation study is more and more common.

(17)

uniformity in 1-dimensional space

LCG cannot generate set of all lattice points in k space, S_k, for k 2.

Consider S₂ = f(I; J) j 0 I; J mg and do plots of (U_i; U_i+1), i = 0; 1; 2; : : :

Insert p7 and 8 of Deng's note.

Feedback shift register [Tausworthe (1965)]

{ a_j = P_k

i=1 c_ia_{j i}(mod 2) where a_i; c_i 2 f0; 1g, c_k = 1

{ The mth random variate is the d bits binary num-

(18)

ber

0:a₀a₁ : : : a_{d 1} base 2

0:a_da_d+1 : : : a_{2d 1} base 2 ...

0:a_mda_md+1 : : : a_{md+d 1} base 2 : : :

It can have an extremely long period, 2^k 1, (if c_i's are properly selected) for a large k,

good theoretical k-space uniformity

Poor empirical performance

Combination generators:

Wichmann and Hill (1982): Add three LCGs and take its fractional part.

{ X_i = AX_{i 1} mod m₁

(19)

{ Y_i = BY_{i 1} mod m₂ { Z_i = CZ_{i 1} mod m₃

{ U_i = X_i=m₁ + Y_i=m₂ + Z_i=m₃ mod 1 Comments:

{ Period is LCM(m₁ 1; m₂ 1; m₃ 1).

For m₁ = 30269; m₂ = 30307; m₃ = 30323, its period is 6:95 10¹².

{ About 3000 times longer period than LCG-16807.

{ About three times slower than LCG.

{ No theoretical justication for uniformity provided.

Statistical justication given in Deng and George (1990)

{ Suppose that X₁ and X₂ are independent r.v. over

(20)

[0; 1] with pdfs f₁(x₁) and f₂(x₂) respectively.

{ j f₁(x₁) 1 j ₁, j f₂(x₂) 1 j ₂

{ Let Y = X₁+X₂ mod 1 and denote its pdf by f(y).

{ Conclusion: j f(y) 1 j ₁₂. In general, Y = P_n

i=1 X_i mod 1 and denote its pdf by f(y). Then

j f(y) 1 j

Yn i=1

_i:

(21)

Exponential and Poisson RVs

The exponential density function is dened by f(x) =

exp( x); if 0 x < 1,

0; otherwise.

Here is any positive constant, depending on the ex- periment.

The exponential density is often used describe ex- periments involving a question of the form: How long until something happens?

For example, the exponential density is often used to study the time between emissions of particles from a radioactive source.

\Memoryless" property:

Let T be an exponentially distributed random vari-

(22)

able with parameter .

It says that P (T > r + s j T > r) = P (T > s).

There is a very important relationship between the exponential density and the Poisson distribution.

Dene X₁; X₂; : : : to be a sequence of independent exponentially distributed random variables with parameter .

Think of X_i as denoting the amount of time between the ith and (i + 1)st emissions of a particle by a radioactive source.

Consider a time interval of length t, and we let Y denote the random variable which counts the number of emissions that occur in this time interval.

Find the distribution function of Y (clearly, Y is a

(23)

discrete random variable).

Let S_n denote the sum X₁ + X₂ + : : : + X_n, then it is easy to see that

P (Y = n) = P (S_n t and S_n+1 > t)

= P (S_n t) P (S_n+1 t):

The density of S_n is given by the following formula:

g_n(x) =

( ^(x)_{(n 1)!}^{n 1} exp( x); if x > 0;

0; otherwise.

It is a gamma density with parameters and n.

It is easy to show by induction on n that the cumu-

(24)

lative distribution function of S_n is given by:

G_n(x) = 8<

: 1 exp( x)

1 + ^x_1! + + ^(x)_{(n 1)!}^{n 1}

; if x > 0;

0; otherwise.

We recognize easily that it is the probability of taking on the value n by a Poisson-distributed random variable, with parameter t.

The above relationship will allow us to simulate a Poisson distribution, once we have found a way to simulate an exponential density.

To simulate a Poisson random variable W with parameter , we

{ Generate a sequence of values of an exponentially distributed random variable with the same param-

(25)

eter.

{ Keep track of the subtotals S_k of these values.

{ We stop generating the sequence when the subto- tal rst exceeds .

{ Assume that we nd that S_n < S_n+1. Then the value n is returned as a simulated value for W .

(26)

Simulating Poisson Processes

A point process consisting of randomly occurring points in the plane is said to be a two-dimensional Poisson process having rate , if

1. the number of points in any given region of area A is Poisson distributed with mean A; and

2. the number of points in disjoint regions are independent.

Let O be the origin in R² and R_i be the ith nearest Poisson point to O, i 1 (R₀ = O).

It can be shown that

(R²_i R²_{i 1}) are exponentially distributed with rate

.

(27)

By symmetry, the respective angles of the Poisson points are independent and uniform [0; 2].

The following algorithm simulates a two-dimensional Poisson process in a ball of radius r centered at O, C(r).

1. Generate independent exponentials X₁; X₂; : : : with rate 1, stopping at

N = min

n : X₁ + X₂ + + X_n

> r²

2. if N = 1, stop, there are no points in C(r). Other- wise, for i = 1; 2; : : : ; N 1, set

R_i = p

(X₁ + X₂ + + X_i)=:

3. Generate independent uniform [0; 1] random variables U₁; U₂; : : : ; U_{N 1}.

(28)

4. Return the N 1 Poisson points in C(r) whose polar coordinates are (R_i; 2U_i); i = 1; : : : ; N 1.

(29)

Brownian motion Finance Application:

As you may know something about the celebrated Black- Scholes formula of nance. The problem addressed by the formula is determining how much an \option"

should cost. This option is called the \call" options.

A call option on a certain stock is the right to buy a share of the stock at a certain xed price (the strike price) at a certain xed time in the future (the maturity date).

If I buy a call option from you, I am paying you a certain amount of money in return for the right to force you to sell me a share of the stock, if I want it, at the strike price, K, on the maturity date, t₁.

(30)

The problem is, what is the right amount of money for me to pay for this right?

{ The meaning of the term right here relates to the economic term arbitrage.

{ An arbitrage opportunity is the opportunity to make money instantly and without risk. That is, you get some money for sure, right now.

{ Such free lunches are not supposed to exist, or at least should be rare and short-lived.

The basic reason for believing this is that many people are looking for such opportunities to make money.

If the price of commodity A were so low, for example, that some clever nancial transaction in-

(31)

volving buying commodity A and perhaps selling some others were guaranteed to make an instan- taneous prot, then many eager arbitrage seek- ers would try to perform the transaction many times.

The resulting increased demand for commodity A would cause its price to increase, thereby destroying the arbitrage opportunity.

It assume that there is a nancial instrument called bond such that its \interest rate" or the \riskless"

rate of return be r, that is, $1 in a riskless invest- ment today becomes $exp(rt) at time t.

dB(t)

dt = rB(t);

where B(t) is the bond price at time t.

(32)

Let the stock price at time t be X(t).

A little thought shows that the value of the option at time t₁ is the random variable (X(t₁) K)₊, since it makes sense for me to exercise the option if and only if X(t₁) > K.

Let Y (t) denote the magic, no-arbitrage price for the option that we are seeking.

Assume that Y (t) may be expressed as some function f(X(t); t) of X(t) and t; our goal is to determine the function f.

Assume a simple probabilistic model for the evolu- tion of the stock price: suppose X is the geometric Brownian motion having stochastic dierential

dX = Xdt + XdW:

(33)

Thus, X is the exponential of a Brownian motion with drift.

Note that the riskless investments change as exp(linear function), and stocks change as exp(Brownian motion).

What we are really assuming is that returns, that is, proportional changes in the stock price, are station- ary and independent over dierent time intervals.

The formulation of this process was inspired by the physical phenomenon of Brownian motion, which is the irregular jiggling sort of movement exhibited by a small particle suspended in a uid, named after the botanist Robert Brown who observed and studied it in 1827.

A physical explanation of Brownian motion was given

(34)

by Einstein, who analyzed Brownian motion as the cumulative eect of innumerable collisions of the suspended particle with the molecules of the uid.

Einstein's analysis provided historically important support for the atomic theory of matter, which was still a matter of controversy at the time-shortly after 1900.

The mathematical theory of Brownian motion was given a rm foundation by Norbert Wiener in 1923;

the mathematical model we will study is also known as the \Wiener process."

Brownian motion and diusions are used all the time in models in all sorts of elds, such as nance (in mod- eling the prices of stocks, for example), economics,

(35)

queueing theory, engineering, and biology.

Just as a pollen particle is continually bueted by collisions with water molecules, the price of a stock is bueted by the actions of many individual in- vestors.

Construction of Brownian motion on the time interval [0; 1]:

Connect-the-dots approach: At each stage of the construction we obtain a more and more detailed picture of a sample path.

W (0) = 0

For W (1), we generate a N(0; 1) random variable Z₁ and take Z₁ to be W (1) since W (1) N(0; 1).

(36)

Given that the path passes through the two points (0; 0) and (1; Z₁), the conditional expectation is the linear interpolation X⁽⁰⁾(t) = Z₁t.

This will be our rst crude approximation to a sample path.

Next let's simulate a value for W (1=2).

{ Given the values we have already generated for

W (0) and W (1), we know that W (1=2) N(Z₁=2; (1=2)(1=2)).

{ Generate another independent standard random variable Z₂ and take W (1=2) to be X⁽⁰⁾(1=2) + (1=2)Z₂.

{ Dene the approximation X⁽¹⁾ to be the piecewise linear path joining the three points (0; 0), (1=2; W (1=2)), and (1; W (1)).

(37)

Simulate W (1=4) and W (3=4).

{ E(W (t) j W (0); W (1=2); W (1)) = X⁽¹⁾(t)

{ Conditional variance of both W (1=4) and W (3=4) is (1=4)(1=4)=(1=2) = 1=8.

{ Generate two more independent standard random variables Z₃ and Z₄, and dene

W (1=4) = X⁽¹⁾(1=4) + p1

8Z₃; W (3=4) = X⁽¹⁾(3=4) + p1

8Z₄:

{ The approximation X⁽²⁾ to be the piecewise linear interpolation of the simulated values we have obtained for the times 0, 1=4, 1=2, 3=4, and 1.

In general, to get from X⁽ⁿ⁾ to X⁽ⁿ⁺¹⁾, we generate

(38)

2ⁿ new standard normal random variables Z₂ⁿ₊₁; Z₂ⁿ₊₂; : : : ; Z₂n+1, multiply these by the appropriate conditional stan-

dard deviation p

2 ^{n 2} = 2 ^{(n=2) 1}, and add to the values X⁽ⁿ⁾(1=2ⁿ⁺¹); X⁽ⁿ⁾(3=2ⁿ⁺¹); : : : ; X⁽n)(1 1=2ⁿ⁺¹)

to get the new values X⁽ⁿ⁺¹⁾(1=2ⁿ⁺¹); X⁽ⁿ⁺¹⁾(3=2ⁿ⁺¹); : : : ; X⁽n+

1)(1 1=2ⁿ⁺¹).

Claim. With probability 1, the sequence of functions X⁽¹⁾; X⁽²⁾; : : : converges uniformly over the interval [0; 1].

{ The limit of a uniformly convergent sequence of continuous functions is a continuous function.

{ To appreciate the need for uniformity of conver- gence in order to be guaranteed that the limit function is continuous, recall the following stan-

(39)

dard example.

For n = 1; 2; : : :, consider the function tⁿ for t 2 [0; 1]. Then as n ! 1, this converges to 0 for all t < 1 whereas it converges to 1 for t = 1, so that the limit is not a continuous function.

{ Dene the maximum dierence M_n between X⁽ⁿ⁺¹⁾ and X⁽ⁿ⁾ by

M_n = max

t2[0;1] j X⁽ⁿ⁺¹⁾(t) X⁽ⁿ⁾(t) j : { Note that if P

M_n < 1, then the sequence of functions X⁽¹⁾; X⁽²⁾; : : : converges uniformly over [0; 1].

{ It is sucient to show that P fP

M_n < 1g = 1.

(40)

{ Observe that

M_n = 2 ^{n=2 1} maxfj Z₂ⁿ₊₁ j; j Z₂ⁿ₊₂ j; : : : ; j Z₂n+1 jg:

{ Note that X1

n=1

P fj Z_n j> p

c log ng 2p1 2

X1 n=1

e (1=2)c log n

pc log n

= p2 c

X1 n=1

1

n^c=2(log n)¹⁼² which is nite for c > 2.

{ By the Borel-Cantelli lemma, P fj Z_n j> p

c log n innitely ofteng = 0:

{ Taking c > 2, the fact implies that with probability 1,

M_n 2 ^{n=2 1}q

c log(2ⁿ⁺¹)

(41)

holds for all suciently large n.

We have P

M_n < 1 with probability 1, which completes the proof of the above claim.

(42)

Acceptance/Rejection Method

This method assumes that we have a method for simulating from some density function g and our task is utilize samples from g to simulate from a given density function f.

g can be fairly arbitrary except for one condition men- tioned below.

The basic idea is to simulate from g and accept the samples with probability proportional to the ratio f=g.

{ Requirement: Let C be a constant such that f(Y )

g(Y ) C; for all Y :

Simulation procedure:

(43)

(1) Simulate Y from the density g and simulate U from uniform [0; 1].

(2) If U f(Y )=[Cg(Y )] then X = Y else go to step 1.

(44)

Validity of Acceptance/Rejection Method

Let X be the value obtained and n be the number of iterations required to reach this value.

P (X x) = P (Y_n x) = P

Y x j U f(Y_n) Cg(Y_n)

= P

Y x; U _Cg(Y^f(Yⁿ⁾

n)

P

Y x; U _Cg(Y^f(Yⁿ⁾

n)

=

R _x

1

R _f(y)=Cg(y)

0 1dug_Y (y)dudy R ₁

1

R _f(y)=Cg(y)

0 1dug_Y (y)dudy

=

R _x

1 f(y)

Cg(y)g_Y (y)dy R ₁

1 f(y)

Cg(y)g_Y (y)dy;

(45)

since Y and U are independent random variables.

(Their joint density function is the product of the marginals g(y) 1)

As x ! 1, the left side goes to 1 and the integral on the right side also goes to 1.

Therefore,

C

Z ₁

1

f(y)

Cg(y)g_Y (y)dy = 1 and P (X x) = R _x

1 f(Y )dY . We conclude that X is random with probability density f.

Eciency: For a given value of Y we accept Y by generating a uniform U and comparing U with f(Y )=Cg(Y ).

Accept Y with probability f(Y )=Cg(Y ).

(46)

Each iteration in the loop involves independent real- izations, we can compute the probability of accept- ing Y as X according to

P

U f(Y ) Cg(Y )

= K = 1 C:

If C is large then the process, of generating samples from f using this method, will be slow.

What is the distribution of n?

Illustration: Use acceptance/rejection method to generate sample from standard normal density function.

Find g with support on ( 1; 1).

{ Sampling from the standard exponential density function (g(x) = exp( x)) can be done quickly.

(47)

{ Note that the support of g is on [0; 1) and f is symmetric at 0. Convert the problem to the generation of half-normal variate.

Generate X =j Z j with density function f(x) = p2

2 exp x² 2

!

; x 0:

Determine C.

{ The bound on the ratio of f to g:

f(x) g(x) =

r2e

exp (x 1)² 2

!

r2e

= C:

{ f(x)=Cg(x) = exp( (x 1)²=2).

Algorithm 1

(48)

1. Generate Y , an exponential random variable with mean 1, and U, a uniform [0; 1] random variable.

2. If U exp( (Y 1)²=2) set X = Y , otherwise return to (1).

Algorithm 2

Observe that log(U) (Y 1)²=2 and log(U) is exponential with rate 1.

1. Generate Y₁ and Y₂, two samples from exponential random variable with mean 1.

2. If Y₂ (Y₁ 1)²=2 set X = Y₁, otherwise return to (1).

Having generated a random variable which is the ab- solute value of a standard normal, we can generate sample from standard normal.

(49)

1. Generate U a uniform random variable the algorithm described above.

2. If U 2 (0; 1=2] set Z = X, else set Z = X.

Example on R-programming:

Generate deviates from a beta distribution with parameters and .

f(x) = 1

B(; )x¹(1 x)¹:

It has a nite support [0; 1].

Choose g as a uniform.

Need to nd the mode f. Solve 1

x

1

1 x = 0

and obtain xmode = ( 1)=( + 2).

(50)

C = (xmode)¹(1 xmode)¹ ( + )=( () ()) R-program

alpha<- 2; beta<- 7; nsimu<- 1000

xmode<- (alpha -1)/(alpha+ beta -2)

dmax<- xmode^(alpha -1)*(1-xmode)^(beta-1)*

gamma(alpha+beta)/(gamma(alpha)*gamma(beta))

y<- runif(nsimu)

x<- na.omit(ifelse(runif(nsimu)<=dbeta(y,alpha, beta)/dmax,y,NA))

Note that dmax 3:18 in this case, we expect to get around 1000=3:18 deviates.

Remarks

No clear rule to nd g.

(51)

{ g(y) should be similar and dominate f(y).

The constant C maynot be easy to nd.

{ As an example, how do we determine C for the posterior distribution

p( j y) / (2 + )¹²⁵(1 )³⁸³⁴

If it is hard to apply rejection method, what can be used?

(52)

Simulating Multivariate Random Variates

With multivariate distributions, one is often faced with enormous problems for random variate generation.

Von Neumann's rejection method [von Neumann 1963] requires a case-by-case study.

{ It is dicult to determine a usable majorizing density.

The conditional method (generate one random variate; generate the next one conditional on the rst one, and so forth) requires often dicult-to-compute marginal densities.

Consider the generation of multivariate normal with mean 0 and variance-covariance matrix = (_ij)_pp.

(53)

{ X_p1 has a multivariate normal distribution i X can be written as

X = + AZ

where _pp, A are constant and X = (X₁; : : : ; X_p)^T where the Z_j are independent standard normal variables.

{ = AA^T

A is nonsingular i is positive denite.

{ By the spectral decomposition theorem, there ex- ists P orthogonal such that = P^TDP.

Here D is the diagonal matrix whose diagonal en- tries are nonnegative eigenvalues of .

{ If rank() = p, ¹⁼² = PD¹⁼²P^T. Useful R-command:

(54)

solve: Solve a system of equations.

eigen: Computes eigenvalues and eigenvectors.

backsolve: Solve an upper or lower Triangular System.

chol: Compute the Cholesky factorization of a real symmetric positive-denite square matrix.

qr: The QR decomposition of a matrix

{ Write X = ((X⁽¹⁾)^T; (X⁽²⁾)^T)^T, the conditional distribution of X⁽²⁾ given X⁽¹⁾ = x⁽¹⁾ is normal with mean ⁽²⁾ + ₂₁₁₁¹(x⁽¹⁾ ⁽¹⁾) and variance ₂₂

₂₁₁₁¹₂₁.

{ X₁ is generated as N(₁; ₁₁),

{ X₂ is generated as N(₂+₁₂X₁=₁₁; ₂₂ ₁₂² =₁₁),

(55)

and so on,

Generate multivariate random variates by use of ei- ther iid univariates followed by a transformation.