Stochastic Models for Disease Natural History

Chapter 2 Literature Review

2.3 Stochastic Models for Disease Natural History

A sequence of random variables {ܺ_ఈ,Ƚ= 0,1,…} is called a Markov chain if, for every

collection of integers, ߙ_଴ ൏ ߙ_ଵǡ ൏ ڮ ൏ ߙ_௡ ൏ ߚ, the conditional distributions of

ܺ_ఉsatisfy the relation:

ܲ_௥൛ܺ_ఉ ൌ ݅_ఉหܺ_ఈ_బǡ ǥ ǡ ܺ_ఈ_೙ൟ ൌ ܲ_௥൛ܺ_ఉ ൌ ݅_ఉหܺ_ఈ_೙ൟ, for ݅_ఉ

The outcome in the future (ܺ_ఉൌ ݅_ఉሻ݅ݏ݊݋݈݋݊݃݁ݎ݀݁݌݁݊݀݁݊ݐݑ݌݋݊ݐ݄݁݌ܽݏݐݏݐܽݐ݁

ov mmmmmoododododelelelelelelelele

me me me me

mes tththththththththatatatatatatatatPPPPPPPPD D D D D D D D isisisisisisisisaaaaaaaa

sed ddddasasasasasa ssssstatatatagegeggg VVVVVVViiiiisss s s

(ܺ_ఈ_బǡ ǥ ܺ_ఈ_೙షభሻ

For each ܺ_ఈ, the absolute probability is denoted by ܲ_௥ሼܺ_ఈ ൌ ݅_ఈሽ ൌ ܽ_௜_ഀ

For every pair of random variables, _ఈandܺ_ఉ, the conditional probability is denoted by

ܲ_௥ሼܺ_ఉ ൌ ݅_ఉŽܺ_ఈ ൌ ݅_ఈሽ ൌ ܲ_௜_ഀ_Ǥ௜_ഁġ

The joint probabilities of ܺ_ఈǡ ܺ_ఉǡ ܺ_ఊ, for Ƚ ൏ Ⱦ ൏ ɀ, are given by

ܲ_௥൛ܺ_ఈൌ ݅_ఈǡ ܺ_ఉ ൌ ݅_ఉǡ ܺ_ఊൌ ݅_ఊൟ ൌ ܽ_௜_ഀܲ_௜_ഀ_ǡ௜_ഁܲ_௜_ഁ_ǡ௜_ംǡ ܽ݊݀ ܲ_௥൛ܺ_ఈൌ ݅_ఈǡ ܺ_ఉ ൌ ݅_ఉൟ ൌ ܽ_௜_ഀܲ_௜_ഀ_ǡ௜_ഁ

Therefore, for any collection of integers Ƚ ൏ Ⱦ ൏ ڮ ൏ Ɂ ൏ ɂ, the joint probabilities are

ܲ_௥൛ܺ_ఈൌ ݅_ఈǡ ܺ_ఉ ൌ ݅_ఉǡ ǥ ǡ ܺ_ఋൌ ݅_ఋǡ ܺ_ఌ ൌ ݅_ఌൟ ൌ ܽ_௜_ഀܲ_௜_ഀ_ǡ௜_ഁǥ ܲ_௜_ഃ_ǡ௜_ച

A Markov chain with state space being the set of all the non-negative integers is

completely determined by the initial absolute probability distribution

ܲ_௥ሼܺ_଴ ൌ ݅_଴ሽ ൌ ܽ_௜_బǡ݅_଴ ൌ ͳǡʹǡ… and the transition probabilities

ܲ_௥ሼܺ_ఈାଵ ൌ ݅_ఈାଵȁܺ_ఈൌ ݅_ఈሽ ൌ ܲ_௜_ഀ_ǡ௜_ഀశభ , ݅_ఈǡ ݅_ఈାଵ ൌ ͳǡʹǡ ǥ for Ƚ=0,1,…

The transition probabilities of a time homogeneous chain is denoted by

ܲ_௥ሼܺ_ఈାଵ ൌ ݆ȁܺ_ఈ ൌ ݅ሽ ൌ ܲ_௜௝

The transition probability ܲ_௜௝ for a three-state Markov model can be arranged in the form

of a matrix

P=൭

ܲ_଴଴ ܲ_଴ଵ ܲ_଴ଶ

ܲ_ଵ଴ ܲ_ଵଵ ܲ_ଵଶ

ܲ_ଶ଴ ܲ_ଶଵ ܲ_ଶଶ൱

ഀ

bilititititity y y y yy isisisisisddddenenotototttttedededededbbbbbbyyyyy

2.3.2 Three-state Homogeneous Markov Model for Disease Natural History

Chen et al applied a three-state Markov model to estimate sojourn time in chronic

disease screening without data of interval cases.⁴³They model the disease with a

continuous-time Markov process in which X(t), the state of an individual at time t, is a random variable with a state space Ω={0,1,2}, where 0 represents no disease, 1 represents

preclinical screen detective disease (PCDP) and 2 represents clinical phase (CP). The

clinical phase in this model is an absorbing state in Markov processes language because

the natural history cannot be estimated beyond diagnosis due to the effect of therapy. They

also assume this is a progressive model.

The transition rates in the three-state model can be expressed as an intensity matrix,

൭െߣ_ଵ

ߣ_ଵ represents the transition rate from no disease to the PCDP, ߣ_ଶ represents the transition

rate from the PCDP to the clinical phase.

Given the transition intensity matrix above, transition probabilities for a three-state model

can be expressed as

The likelihood function based on the prevalent screen in a cohort with N individuals is

ܮ

_ଵ

ሺǤ ሻ ൌ ෑ ൬ ܲ

_଴ଵ

ሺݒ

_௠

ሻ

ݒ_௠ represents age at fist screen for mth subject

ݔ_௠ ൌ ͳ when the mth subject is detected as a positive case

ݔ_௠ ൌ Ͳ otherwise.

However, as the previous mention above, the Markov model used to assume a

homogeneous process that a constant hazard rate with time for progression for state to

state. This may be unrealistic in medicine and biology.

2.3.3 Three-state Model with Weibull Distribution

In order to deal with the non-constant hazard in the stochastic model, Chen et al

propose a non-homogeneous three-state model for the disease natural history of oral

cancer.⁴⁴They model the time of transitions from normal to leukoplakia and leukoplakia

to invasive carcinoma with two Weibull distributions. The transition probabilities for

staying in a no disease state (state 0), transitions from normal to leukoplakia (state 1) (2 (2 (2 (2 ( --3)3)3)3)3)

and from normal to invasive carcinoma (state 2) in a given time interval [t1, t2] are

expression as follows:

ܲ_଴଴ሺݐ_ଵǡ ݐ_ଶሻ ൌ ͳ െ න ݂_ଵ

௧_మ ௧భ

ሺݑሻ݀ݑ

ܲ_଴ଵሺݐ_ଵǡ ݐ_ଶሻ ൌ ׬ ݂_௧^௧^మ _ଵ

భ ሺݑሻ ቀͳ െ ׬ ݂௧_మ ଶ

௨ ሺݒሻݒቁ ݀ݑ(2-4)

ܲ_଴ଶሺݐ_ଵǡ ݐ_ଶሻ ൌ න ݂^௧^మ _ଵ

௧భ

ሺݑሻ න ݂^௧^మ _ଶ

௨

ሺݒሻ݀ݒ݀ݑ

f1(t) and f2(t) are the probability density function of Weibull distributions for time of

transition from states 0 to 1 and from state 1 to 2. The two Weibull distributions are

denoted as W1(ߣ_ଵ଴,ߛ_ଵ)and W2(ߣ_ଶ଴, ߛ_ଶ). ߣ_ଵ଴ andߣ_ଶ଴ are scale parameters and ߛ_ଵ and ߛ_ଶ are shape parameters for the two corresponding transitions. The transition rates as a

function of time are expressed as follows:

ߣ_௜ ൌ ߣ_௜଴ߛ_௜ݐ^ఊ^೔^ିଵ where i=1 or 2

The probability of remaining in state i-1 in time t is

ܵ_௜ሺݐሻ ൌ ቄെ ׬ ߣ_଴^௧ ௜଴ߛ_௜ݑ^ఊ^೔^ିଵݑቅ ൌ ሺെߣ_௜଴ݐ^ఊ^೔ሻ (2-5)

The corresponding probability density function is

݂_௜ሺሻ ൌ ߣ_௜଴ߛ_௜ݑ^ఊ^೔^ିଵሺെߣ_௜଴ݐ^ఊ^೔ሻ

The transition probabilities for staying in state 1 and state 2 were also denoted as

follows:

ܲ_ଵଵሺݐ_ଵǡ ݐ_ଶሻ ൌ ͳ െ ׬ ݂௧మ ଶ ௧_భ ሺݑሻݑ

ܲ_ଵଶሺݐ_ଵǡ ݐ_ଶሻ ൌ ׬ ݂௧మ ଶ

௧_భ ሺݑሻݑ (2-6) al [[[[[ttttt11111, tttt ]22222]]]]]]]]ararararararararareeeeeeee

The natural history from state 1 (leukoplakia) to state 2 (invasive carcinoma) is usually

unobservable due to the interruption of medical treatment. We can only estimate

parameters via equation (1), P00, P01and P02.

2.3.4 Incorporation of patient specific covariates

The effect of patient specific covariates, say x, on the three-state stochastic model was

assessed by the exponential regression model that treats scale parameter in the Weibull

distribution as a function of patient-specific covariates. It is expressed as follows:

ߣ_௜଴^௠ ൌ ߣ_௜଴଴ሺߚ_௜଴߯^௠ሻ

ߣ_௜଴଴ : the scale parameter of Weibull distribution for state i

߯^௠ : a vector of covariates for subject m

ߚ_௜଴ : corresponding regression coefficient

2.3.5 Bayesian inversion for a non-standard case-cohort design

For an n-state disease natural history, n sets of random samples for each transition were

selected in case-cohort study design in Chen et al. Let S denoted an indicator of whether

a subject was sampled (S=1). For individual i, let ߨ_௝^௧^೔ be sampling fractions for state j

at time ti . ߨ_௝^௧^೔ was denoted as follows:

ߨ_௝^௧^೔ ൌ ሺ ൌ ͳȁͲ ՜ ݆Ǣ ݐ_௜ሻ

nomommmma)aa)a)a iiiiiiss s ss sssuususususususususuauauauauauauauaualllllllllllllllllly y yyyyyyy

y y y

yeeeeestiimimimimimimimimatatatatatatatateee e e e

The sampling fractions for state j can be expressed as ߨ_௝ if we assume that sampling

fractions are independent of the individual. Using Bayesian inversion, the probability of

transition of being state j at time tigiven a subject was sampled is P(0՜ ݆Ǣ ݐ_௜ȁܵ ൌ ͳሻ

The transition probabilities P0j(ti) are derived from equation (1).

Likelihood function, parameter estimation and model validation

The data on the first oral examination were used to estimate the parameters relate to the

disease natural history. This yields three possible observed transitions before the first

examination: staying in normal (state 0 Æ 0), normal to leukoplakia (state 0Æ 1) and

normal to invasive carcinoma (state 0 Æ 2). According to the above equation, P(0՜ ݆Ǣ ݐ_௜ȁܵ ൌ ͳሻ

The likelihood function for the normal-leukoplakia-invasive carcinoma cohort with

three covariates is

age i of the first examination.

thahahahahattttt sasaaaaaampmpmpmpmpmpmpmpmplililililililililingngngngngngngngng

2.3.6 Five-state non-homogeneous stochastic model

Chen et al further extended the three-state model to the k-state model.⁴⁵They use

normal-adenoma-carcinoma for colorectal cancer for the example. The natural history of the colorectal cancer is classified by adenoma size. The state space Ω={0,1,2,3,4},

where state 0 represent normal, state 1 represent diminutive adenoma, state 2 represent

small adenoma, state 3 represent large adenoma, and state 4 represent invasive

carcinoma. They apply the hazard rate from normal (state 0) to diminutive adenoma

(state 1) change with time and denoted as ߣ_ଵሺݐሻ with Weibull distribution. The Markov

property was assumed for the remaining transition rate of ߣ_ଶ to ߣ_ସdue to the

complexity of algebra increases if each transition rate is modelled by the Weibull

distribution. The natural history of the above process is divided into two parts: 1.

Non-homogeneous Markov property for the hazard rate for normal to diminutive adenoma. 2.

Homogeneous Markov property for the remaining transitions. The transition matrix is as

follows:

The time of transition from states 0 to 1 is modeled byߣ_ଵሺݐሻ with Weibull distribution.

The remaining transition matrix M is as below:

d de de de

dell.⁴⁵⁴⁵⁴⁵⁴⁵⁴⁵⁴⁵⁴⁵⁴⁵ThThThThThThThTheyeyeyeyeyeyeyeyuuuuuuuusesesesesesesese

nattturururururralalalalalhhhhisistotoryryryryryr

As the non-homogeneous part that models the hazard rate of the onset of diminutive

adenoma with a Weibull distribution, the transition probabilities from state 0 (normal) to

state 1-4 can be derived as follows.

The probabilities for subjects staying as normal during [t1, t2] is

ܲ_଴଴ሺݐ_ଵǡ ݐ_ଶሻ ൌ ͳ െ ׬ ݂_௧^௧^మ ଵሺݑሻ݀ݑ

భ (2-10)

݂_ଵሺݐሻ : the probability density functions of Weibull distribution for the transition from

state 0 to 1

The probabilities for an individual progressing from state 0 to state j during [t1, t2] is

ܲ_଴௝ሺݐ_ଵǡ ݐ_ଶሻ ൌ ׬ ݂_௧^௧^మ ଵሺݑሻ ൈ ܲ_ଵ௝^ெሺݑǡ ݐ_ଶሻ݀ݑ

భ (2-11)

j=1,2,3,4; ܲ_ଵ௝^ெ(.): transition probabilities derived from ܲ_௜௝^ெሺܽǡ ܾሻ

According to the equation as below, P(0՜ ݆Ǣ ݐ_௜ȁܵ ൌ ͳሻ

The likelihood function for adenoma-carcinoma is ς ൬_σ^గ^బ^௉_గ^బబ^ሺ௧^೔^ሻ

2.3.7 Semi-Markov Model

To consider death as an absorbing state, the five-state Markov model (Figure 4-2) is

extended to the following model.

As the transition from the current sate to the next state, particularly absorbing state i.e.

death, is highly dependent on how long they stay in the current stat, a six-state

semi-Markov model will be proposed to model the temporal natural history of H-Y based PD.

State space Ω, Ω={0,1,2,3,4,5} is defined similarly as above. Let X={X0, X1,,…, Xn}

denote n observed successive transitions for an individual during a period of time t, where

X0is the initial state and Xnis the X final state after n transitions. We assume the total number of transition is finite and XאΩ. As a six-state semi-Markov process will be

applied, in addition to X, which is said to form an embedded Markov chain, we still Free of PD

require sojourn time distribution to depict the time spent in the current state before

transition to the next state. In parallel with X, T= {T0, T1, …Tn} is denoted to represent the

entry time into state Xnafter n transitions. According to X and T, a semi-Markov process

can be formed by transition probabilities (Pij) and distribution of sojourn time (Fij(t))

expressed by

ܲ_௜௝ ൌ ሺܺ_௡ାଵ ൌ ݆ȁܺ_௡ ൌ ݅ሻ (4-7)

ܲ_௜௝ is a homogeneous process

ܨ_௜௝ሺሻ ൌ ሺܶ_௡ାଵെ ܶ_௡ ൑ ݐȁܺ_௡ାଵൌ ݆ǡ ܺ_௡ ൌ ݅ሻ

For example, the transition from SD early H-Y stage (I&II) (j=1) to death (j=5) is

determined by the transition probability (P15) and also the distribution for the time spent in

early SD H-Y stage F15(t).

Fij(t) is specified by a generalized Weibull distribution expressed by ܨ_௜௝ሺሻ ൌ ͳ െ ൬െ ൬_ఙ^௧

೔ೕ൰^ఔ^೔ೕ൰ (4-8)

The parameters of ɐ and ɋ can change with time.

ߥ_௜௝ and ߪ_௜௝are estimated using the maximum likelihood method.

Suppose we have N individual (m=1,…..N) and the subject m had nmsuccessive

transition. The observed sequence is denoted as {߯_଴^௠ǡ ǥ ߯_௡^௠ሽ and the corresponding entry

times into state X is denoted by {ܶ_଴^௠ǡ ܶ_ଵ^௠ǡ ǥ ǡ ܶ_௡^௠_೘శభሽ.

The likelihood function

statatatatateee e e bebebeeeeeeefofofofofofofofoorererererererere

ot ot ot ot

otededededddddttttttto o oooooorererererererereprprprprprprprpresesesesesesesesenenenenenenenentt t tt tt ttthththththththehe

i-MaMaMaMaMaarkrkrkrkrkovovovovo pppproooooocececececesssssssssss

ሺɐǡ ɋሻ ൌ ς^ே_௠ୀଵሼς^௡_௟ୀଵ^೘ሺܲ_௫_೗షభ^೘ _௫_೗^೘݂_௫_೗షభ^೘ _௫_೗^೘ሺܶ_௟^௠െ ܶ_௟ିଵ^௠ ሻ ൈ σ_௃ஷ௫_೙೘^೘ ܲ_௫_೙೘^೘ _௝ܵ_௫_೙೘^೘ _௝ሺܶ_௡^௠_೘శభെ

ܶ_௡^௠_೘ሻ^ఋ^೙೘శభ^೘ (4-9)

The latter part is related to right censoring with censoring indicator of Ɂ ߜ_௡^௠_೘శభ ൌ ͳ if ܺ_௡^௠_೘ is not final state

ߜ_௡^௠_೘శభ ൌ Ͳ otherwise

2.4 Covariates associated with the progression of

在文檔中隨機過程應用於預防巴金森氏症Hoehn-Yahr 分類疾病進展之實證評估 (頁 29-40)