國
立
交
通
大
學
統計學研究所
碩
碩
碩
碩
士
士
士
士
論
論
論
論
文
文
文
文
臨床試驗之連續性滿足點 Phase II/III 調適性
設計
An Alternative Phase II/III Design in Clinical Trials for
Continuous Endpoint
研 究 生:劉佳佩
指導教授:蕭金福 教授
中
中
中
中
華
華
華
華
民
民
民
民
國
國
國
國
九
九
九
九
十
十
十
十
九
九
九
九
年
年
年
年
六
六
六
六
月
月
月
月
臨床試驗之連續性滿足點 Phase II/III 調適性設計
An Alternative Phase II/III Design in Clinical Trials for
Continuous Endpoint
研 究 生:劉佳佩 Student:Chia-Pei Liu
指導教授:蕭金福 Advisor:Chin-Fu Hsiao
國 立 交 通 大 學
統計學研究所
碩 士 論 文
A ThesisSubmitted to Institute of Statistics College of Science
National Chiao Tung University in partial Fulfillment of the Requirements
for the Degree of Master
in Statistics June 2010
Hsinchu, Taiwan, Republic of China
臨床試驗之連續性滿足點
臨床試驗之連續性滿足點
臨床試驗之連續性滿足點
臨床試驗之連續性滿足點 Phase II/III 調適性
調適性
調適性
調適性設計
設計
設計
設計
研究生:劉佳佩 指導教授:蕭金福 博士 國立交通大學統計學研究所 中文摘要 中文摘要 中文摘要 中文摘要 醫藥研發是非常具有風險、複雜、高成本又費時的過程。其中大部分的時間 和經費是用於臨床試驗。因此,我們需要更有效率且可靠的臨床試驗方法,來分 析資料及評估藥物的風險和效用,以減少病人樣本數,縮短發展期間,進而降低 藥物開發成本。在本文中,我們發展一個連續性滿足點的 phase II / III 試驗設計, 在 phase II 試驗中評估不同劑量的試驗藥物與對照組兩兩之間比較。在 phase II 試驗,隨機分派病人接受一個試驗藥物劑量或對照組。如果一個或一些劑量有統 計顯著的療效優於對照組,這些劑量將被選中進入 phase III 試驗。此外,病人在 被選中的劑量組與對照組將繼續進入 phase III 試驗。同時也將招募新的病人,隨 機分配至所選的藥物劑量組或對照組。我們會計算每階段的檢定統計量之臨界 值,以確定各藥物劑量在試驗過程中是否應被淘汰或被選擇,並計算合適的樣本 大小以方便招募病人。在我們的設計中,由於將傳統的 phase II 試驗及 phase III 試驗合併成單一的試驗,同時,在 phase II 試驗收集的資料也會進入 phase III 試 驗做最後的分析,因此可以減少樣本數量和節省試驗時間。
關鍵詞 關鍵詞 關鍵詞
An Alternative Phase II/III Design in Clinical Trials for Continuous
Endpoint
Student:Chia-Pei Liu Advisor:Dr. Chin-Fu Hsiao
Institute of Statistics National Chiao Tung University
Abstract
The pharmaceutical development is very risky, complex, costly, and time-consuming. Much of time and costs were spent in clinical trials. Hence, we need the methods which can be more efficient and reliable to minimize sample size, shorten the period of development duration, and thus reduce the cost for drug development. In this paper, we demonstrate the phase II/III design for continuous endpoint based on pariwise comparisons at the phase II stage when evaluating the efficacy of drugs. For the phase II stage, patients are randomly assigned to receive either one of the several doses of the test drug or to the control group. If one or some doses are declared to have a statistically significantly superior efficacy over control, these doses will be selected for the phase III trial. In addition, the patients in the selected doses and control groups will be continued to the phase III stage. Also new patients will be recruited and randomized to receive either the selected doses of the test drug or to the control group. We will find the critical value at each stage to determine whether the treatment should be dropped out or selected in the process of the trials, and compute the required sample size for facilitating recruitment of patients. In our design, since we integrate the traditional phase II and III trials into a single trial, and the data collected from phase II stage will also be included into the final analysis, sample size reduction and trial time saving may be possible.
誌謝
誌謝
誌謝
誌謝
能完成這篇論文,我十分感謝指導教授蕭金福老師給予的細心教導,老師在 百忙之中總是不厭其煩地給我寶貴的意見與方向,才使這篇論文能充實嚴謹的順 利完成。同時謝謝鄒小蕙老師,在我發生困惑時,協助我解決問題;感謝翁賢學 長提供的經驗和想法;在做此論文的過程中,謝謝敏琪陪伴我一起成長努力;以 及感謝口試委員們的建議,使得論文更加完善。 在念研究所的這些日子,感謝所上所有老師們的認真教學,謝謝郭姊的幫忙 還有處理所上的事務。最後謝謝家人和朋友們的鼓勵與支持,讓我更有信心完成 這篇論文,謝謝同學們在這段期的照顧,不論是課業上或是生活上幫助我很多事 情,雖然相處的時間不長,但我會永遠想念這兩年來和大家一起努力、一起相聚 的美好回憶。 劉佳佩 謹誌于 國立交通大學統計研究所 中華民國九十九年六月
Content
中文摘要﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ i Abstract ﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ ii 誌謝﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ iii Content ﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ iv 1. Introduction ﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ 12. A Seamless Adaptive Phase II/III Design ﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ 4
3. Results ﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ 9 4. Discussion ﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ 12 List of Tables ﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ 15 List of Figures ﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ 21 Appendix ﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ 24 References ﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒﹒ 27
1. Introduction
The pharmaceutical development is very risky, complex, costly, and time-consuming. Much of time and costs were spent in clinical trials. Even if there is a better understanding of disease etiology and higher technology in medical production, the success rate of drug development has been low. One of the probable reasons is that the current methods used for developing new drugs may not be practicable. Hence, we need the methods which can be more efficient and reliable to minimize sample size, shorten the period of development duration, and thus reduce the cost for drug development.
In the traditional phase II and phase III design, there exists a lead period between phase II and phase III trials. In addition, the data collected in the phase II trial are not used in the phase III trial. Adaptive seamless phase II/III designs have been regarded as a feasible way in which shortening the trial time may be possible. More specifically, an adaptive seamless phase II/III design combines the traditional separated trials (phase II and phase III trials) into a single trial, and use data from patients enrolled before and after the adaptation in the final analysis (Maca et al. 2006). In a seamless phase II/III design, the phase II stage corresponds to the learning phase to evaluate whether a new medication is effective and choose the best doses for confirmatory phase, and the phase III stage then corresponds to the confirmatory phase to confirm its effectiveness and evaluate its safety. Since the data collected from phase II stage will also be included into the final analysis, sample size reduction and trial time saving may be possible.
have been proposed. Simon (1989) and Tsou et al. (2008) developed two-stage screening designs for phase II which can minimize the expected sample size and stop early if the new regimen has low activity subject to constraints on the size of the type I error rate and power for discrete and continuous efficacy endpoints respectively. Schaid et al. (1990) proposed the phase II/III designs based on pariwise comparisons for survival endpoints for cancer drugs. The design allows multiple treatments to be tested at the same time and bases the determination to proceed from the phase II trial on the same clinical endpoint evaluated in the same population as the phase III trial. A concurrent control group is also treated. This design may offer a substantial saving when the hazard rate is large relative to the patient accrual rate, a situation often encountered in clinical trials in advanced cancer. Scher et al. (2002) used this design for castrate metastatic prostate cancer for which multiple regimens appear to have similar activity at this time. Posch et al. (2005) described a general formulation of the adaptive testing procedure in the context of treatment selection. In addition, they proposed multiplicity adjusted p-values, introduced simultaneous confidence bounds, and investigated the statistical properties of point estimates. Bischoff et al. (2005) developed a two-stage adaptive design with a minimal number of patients that controls the type I error rate, and achieves a required power to detect a given clinically relevant difference in means, and controls the probability of wrong selection. The information received about the variance is used to determine the number of patients on the selected treatment and control for the second stage. Maca et al. (2006) have proposed the concept of adaptive seamless phase II/III designs and describe the statistical methodologies related to adaptive seamless designs. They also describe the decision process involved with seamless designs and present some illustrative examples.
An adaptive seamless phase II/III design is to conduct the learning phase trial (phase II stage) and the confirmatory phase trial (phase III stage) simultaneously under the same protocol and the same study population. Therefore, it can eliminate the lead time between the separated trials, and thus reduce the sample size required and shorten the drug development time. In this paper, we demonstrate the phase II/III design for continuous endpoint based on pariwise comparisons at the phase II stage when evaluating the efficacy of drugs. The goal is to find the critical value at each stage to determine whether the treatment should be dropped out or selected in the process of the trials, and compute the required sample size for facilitating recruitment of patients. In our design, both the data from the learning phase and the confirmatory phase will be used in the final analysis. We also compare our design with the traditional separated phase II and phase III design. In this paper, a seamless adaptive phase II/III design based on continuous efficacy endpoint is described in Section 2. Some numerical results of our proposed design are provided in Section 3. Final remarks and discussion are given in Section 4.
2. A Seamless Adaptive Phase II/III Design
Our main goal is to compare a test drug for several different doses with a control group based on some continuous efficacy endpoint. Here, we assume that the total number of doses is K. For the phase II stage, patients are randomly assigned to receive either one of the K doses of the test drug or to the control group. We aim to compare each dose of the test drug with the control group, and the Bonferroni correction is employed to adjust p-values for multiple comparisons. If one or some doses are declared to have a statistically significantly superior efficacy over control, these doses will be selected for the phase III trial. In addition, the patients in the selected doses and control groups will be continued to the phase III stage. Also new patients will be recruited and randomized to receive either the selected doses of the test drug or to the control group. The final analysis includes the data of the selected doses and control groups from both phase II and phase III stages.
Let n2 be the number of patients recruited for each of K+1 groups at the phase
II stage. LetY0jand Yir be the efficacy responses for patient j receiving control group
and patient r receiving the ith dose group respectively, i=1,2,K,K, j=1,2,K,n2, 2
2 1, , ,n
r= K . We assume that the efficacy responses are continuous and
(
2)
0 0 ~N µ ,σ Y iid j and(
)
2 , ~ µi σ iid ir N Y ,where N
(
µ,σ2)
represents the normal distribution with mean µ and varianceσ2. Here we also assume σ2is known at the design stage. The selection of the doses is based on the following hypothesis:The observed sample means for the control group and the ith experimental group based on n2 patients can be respectively derived by
∑
= = 2 1 2 0 II 0 n j j n Y Y and∑
= = 2 1 2 II n j ij i n Y Y , i=1,2,K,K. Let ∆i =µi−µ0 and II ˆ i∆ be the estimate of ∆ . Then i
II 0 II II ˆ Yi Y i = − ∆ and ∆ ∆ 2 2 II 2 , ~ ˆ n N i i σ ,i=1,2,K,K. Let 2 2 II II 2 ˆ n T i i σ ∆ = . It follows that ∆ 1 , 2 ~ 2 2 II n N T i i σ ,i=1,2,K,K. (2)
At the phase II stage, if TiII is less than a given value C1, for all i=1,2,K,K, or any one of TiII is greater than C2, we will stop the trial and further clinical development will not be considered. The former indicates that all of the doses for test product are futile, whereas the latter says that there exists at least one experimental treatment regimen to have overwhelming advantage. On the other hand, we will continue to recruit patients to the control group and all experimental groups for whichC1≤TiII ≤C2for the phase III stage.
Let n3 be the number of patients required for each of groups selected for the phase III stage. Similarly, the observed sample means based on n3 patients for the control group and the ith experimental group selected for the phase III stage can
respectively be expressed as
∑
+ + = = 3 2 2 1 3 0 III 0 n n n j j n Y Y and∑
+ + = = 3 2 2 1 3 III n n n j ij i n Y Y , for some i, i=1,2,K,K.In addition, the overall sample means based on n2+n3 patients for the control group and the ith experimental group selected the phase III stage are respectively given by
∑
+ = + = 3 2 1 2 3 0 0 n n j j n n Y Y and∑
+ = + = 3 2 1 2 3 n n j ij i n n Y Y , for some i, i=1,2,K,K. Let 3 2 III 0 III III 2 n Y Y Ti i σ −= . Let Ti be the test statistic for the final analysis. Then
, 2 3 2 2 0 n n Y Y Ti i + − = σ and we have . 3 2 III 3 II 2 n n T n T n Ti i i + + =
The ith dose is declared to be superior to the control group if Ti > C3.
In our design, each dose for the test product is compared to the control group, and thus we define α to be the pairwise alpha-error for each comparisons and Kα to be the Bonferroni approximation of the overall alpha-error. Consequently, the overall probability of “accepting” the ith dose in both stages with the true parameters∆ is a i function of σ,∆i,n2,n3,C1,C2,andC3, and it is given as
(
)
(
)
( )
(
)
, , , , , , , II 3 2 II 3 2 1 3 2 2 1 II xP T C T x dx f C T P C C C n n i i C C T i i i i i > + > = = ∆ ∆ ∆∫
σ φ (3) where II( )
⋅ i Tpower 1-β given ∆i = ∆ and the pairwise alpha-error α can be written as
1−β=φ
(
σ ,∆,n2,n3 ,C1 ,C2 ,C3)
, (4) andα =φ
(
σ ,0 ,n2 ,n3 ,C1 ,C2,C3)
(5) respectively. The equation (3) can be re-expressed as(
)
(
)
. 2 2 2 exp 2 1 2 1 , , , , , , 3 2 3 2 3 3 2 3 2 2 2 2 2 2 3 2 1 3 2 2 1 dx n x n n n n n C n x n C C C C n n i C C i i i ∆ + + + − Φ ∆ − − + ∆ − Φ − = ∆∫
σ σ π σ σ φFurthermore, the expected total sample size, E(N), under the null hypothesis, can be evaluated by ( ) ( 1)
{
(
)(
1)
(
)
}
, 1 2 2 3 0 2 j K j p j K n j n n p n K N E∑
= − + + + + + = (6)where p0 denotes the probability of stopping accrual at the phase II stage, andpj
( j=1,2,K,K) is the probability that accrual will continue for the standard treatment and j of the experimental treatments. The derivations of p0 and pj can be found in
the Appendix.
Under the specification of design parameters ∆, σ, α, β, and C1, the proposed
constraints of pairwise type I and II error rates given in (4) and (5) and to minimize the expected total sample size (6). In our design, C1 should be pre-specified. The
determination of C1 should meet the minimal clinically meaningful requirement that
an investigator would need to observe before continuing accrual onto the phase III stage. Furthermore, for convention,C3 is chosen as 3 =Φ−1
(
1−α)
.C In other words,
we consider C3 as if one-stage design was conducted and the Bonferroni procedure was applied to the K multiple treatment comparisons.
3. Results
In this section, we are at the position to give some examples. Given (Kα ,1−β) = (0.05, 0.8), (Kα ,1−β) = (0.025, 0.8), Tables 1-6 illustrate the seamless adaptive design for different combinations of design parameters with K=1, K=2, and K=3, respectively. For each K, we consider various combinations of values for C1=0,
1
C =0.5, σ =13, 15, and 17, ∆=5, 6, and 7. The tabulated results include the early stopping upper boundary for concluding efficacy for the test drug at the phase II stage (C2), the critical value that would reject the test drug at the phase III stage (C3), the required sample size per group for the phase II stage (n2), the required sample size per group for the phase III stage (n3), the expected total sample size (E(N)), numbers of sample sizes required for the traditional phase II and phase III trials (n2' , and n3', respectively), and the ratios of the maximum of the total sample size for our phase II/III design vs. the maximum of the total sample size for the traditional designs (ratio). Here the total sample sizes per group required for traditional phase II and phase III design are derived by
(
)
, 2 2 2 ' 2 ∆ + = σ Zα Zβ n and(
)
2 2 ' 3 2 ∆ + = σ Zα Zβ n respectively.For instances, Table 1 displays the results for Kα=0.05, 1−β =0.8 and K=1. The first line considers the case of C1=0, σ=13, and ∆=5. That is, only one dose
group for the test product is included in the drug development. In this case, the phase II stage needs to recruit 26 patients for the control group and the experimental group. When the study is completed at the phase II stage, if the observed value of
II 1
T exceeds 2.37, the trial is terminated and we declare that the experimental treatment regimen to have overwhelming advantage. If T1II is less than 0, the trial is also terminated and we conclude the test product is futile. On the other hand, if 0≤ II ≤
1
T 2.37, the trial continues to the phase III stage to enroll additional 66 patients for the control group and the experimental group. After the recruitment of the patients at the phase III stage is completed and if the observed overall value of T1 based on the cumulative data of n1 + n2 obtained at the end of the trial does not exceed 1.64,
then the lack of efficacy of the test drug is concluded. However, on the other hand, if the observed overall value of T1 is greater than 1.64, the test drug is declared to be superior to the control group. In addition, the numbers of sample sizes required for the traditional phase II trial and phase III trial are respectively 84 and 84 per group. Subsequently, it can be seen from Table 1 that the total sample size required for our phase II/III design can be reduced by around 45% compared with the traditional design.
As noted from Tables 1-6 that, for fixed K, C1, and σ, as ∆ increases, the corresponding required sample size decreases. This makes intuitive sense since the larger the effect size (∆/σ ), the smaller the required sample size. It can be seen that as C1 increases, both C2 and the expected total sample size decrease. This phenomenon occurs because we will spend more type I error rate and power for early stopping for efficacy, and thus the value of C2 is reduced. Also the more stringent (large) the C1 is, the larger sample size required for each stage. On the other hand, larger C will cause that the ineffective doses will be quickly eliminated during the
phase II stage. Doing so will increase the probability of early termination for futility at the phase II stage and subsequently reduce the expected total sample size.
Given (Kα ,1−β) = (0.05, 0.8), Figures 1-3 show the simulations comparing our proposed design with the traditional design in terms of success rates for K=1,
σ = 13, 15, and 17, ∆=5 ,6,and7, C1 =0, and 0.5 respectively. For example, given 0
1 =
C , σ =13, and ∆=5, we can derive that n2 =26, n3= 66, C2 = 2.37, = 3 C 1.64, ' = 2 n 84, and ' = 3
n 84. We assume that the true values of ∆ are 0, 0.25, 0.5, 0.75, …, 9 respectively. The success rates are obtained from simulations of 10000 replicates, and the simulation results are presented in Figure 1. From Figure 1, when∆=5, the success rate of our proposed design is close to the desired power 0.8. Also it can also be seen that our proposed design performs better than the traditional design. Other simulation results with different combinations of design parameters can be seen from Figures 2 and 3. All figures show the same phenomenon as Figure 1.
4. Discussion
In this paper, we developed a seamless phase II/III design based on continuous endpoint to evaluate the efficacy of a test drug. More specifically, we integrate the traditional phase II and III trials into a single trial. One attractive feature in our design is that the phase II and phase III trials are conducted in the same protocol with the same inclusion/exclusion criteria, the same study design, the same control, the same methods for evaluation, and the same efficacy/safety endpoints. This can avoid the difficulties arising from the current phase II and III paradigm including different patient populations recruited for phase II and phase III trials and possible different primary efficacy endpoints used in the phase II and III trials.
Another attractive feature is that our proposed design can shorten the time of clinical development because there is no lead time between the learning phase (phase II) and the confirmatory phase (phase III). Also data collected at the phase II stage are combined with those data obtained at the phase III stage for final analysis. As a result, the total sample size might be reduced and thus considerably valuable resource and cost can be saved.
Another point we wish to make is that choosing C1 is rather critical. In fact, the
determination of C1 should meet the minimal clinically meaningful requirement that
an investigator would need to observe before continuing accrual onto the phase III stage. Also the value of C1 should be stringent enough that ineffective doses should be
quickly eliminated during the phase II stage. It can be expected that larger value of C1
will lead to higher probability of early stopping. Also larger value of C1 can also
Note that when both powers in the traditional phase II and phase III designs are assumed to be 0.8, then the actual power is 0.64 (0.8×0.8) if phase II and III trials are conducted separately. However the power in our proposed phase II/III design is maintained at 0.8. Consequently, our proposed phase II/III design can derive more power than the traditional design. In the same way, suppose the type I error rates in the traditional phase II and phase III designs are both assumed to be 0.05, then the actual type I error rate is 0.0025 (0.05×0.05) if phase II and III trials are conducted separately. But the type I error rate in our proposed phase II/III design is controlled at 0.05 level. Hence, conducting phase II and III trials separately is more conservative than our proposed phase II/III design. We can clearly see the phenomenon in the Figures 1-3.
Although our proposed design has many advantages, not all clinical development can be conducted by such designs. Maca et al. (2006) outline some criteria for determining the feasibility for this type of design: endpoints and enrollment, and clinical development time. In our phase II/III design, prior to the interim analysis at which the dose to be continued will be chosen, there will be a period of the study during which some patients have been randomized but have not yet been followed long enough to reach the endpoint for evaluation. When the time needed to reach this endpoint is short relative to the total enrollment time of the study, enrollment can still continue uninterrupted with relatively few patients enrolled during this “transition” period. Even though patients enrolled during this period and randomized to doses that will not be continued will not be providing direct evidence for the comparison of the selected dose vs. the control at the phase III stage, they can be used to understand better the dose response and safety profile. If the endpoint
duration is too long, it could cause inefficiency because many patients will need to be randomized during this lead period between phase II stage and phase III stage. But if we suspend to recruit patients at the lead period, it may cause the trial to disrupt. In such case, a surrogate marker might be considered. They suggested using well-established and well-understood endpoints or surrogate marker when executing adaptive seamless designs. Also, an adaptive seamless design is not suitable if the target of the phase II is to decide a primary endpoint into phase III. If the seamless trial is the only pivotal trial, it is clear that the development time can be reduced. If the phase II/III trial is one of two required pivotal trials, the second pivotal trial should be completed close to the time the seamless study is completed. That is, the second pivotal trial should be started right after the phase II interim analysis in the phase II/III design. There will be more time needed for planning, development, and health authority review for such a design. Consequently, this additional time must be included into the evaluation of overall development time.
List of Tables
Table1. K=1, (Kα ,1−β) = (0.05, 0.8) 1 C σ ∆ C2 C3 n2 n3 E(N) ' 2 n n3' ratio 0 13 5 2.37 1.64 26 66 116.82 84 84 0.55 6 2.37 1.64 18 46 81.19 59 59 0.54 7 2.37 1.64 14 34 61.40 43 43 0.56 15 5 2.36 1.64 35 88 156.41 112 112 0.55 6 2.37 1.64 24 61 107.91 78 78 0.54 7 2.37 1.64 18 45 80.21 57 57 0.55 17 5 2.36 1.64 45 113 200.93 143 143 0.55 6 2.37 1.64 31 79 139.58 100 100 0.55 7 2.37 1.64 23 58 102.97 73 73 0.55 0.5 13 5 2.11 1.64 35 70 110.77 84 84 0.63 6 2.11 1.64 24 48 75.96 59 59 0.61 7 2.11 1.64 18 36 56.97 43 43 0.63 15 5 2.11 1.64 47 92 147.56 112 112 0.62 6 2.11 1.64 32 64 101.27 78 78 0.62 7 2.11 1.64 24 47 75.38 57 57 0.62 17 5 2.11 1.64 60 119 189.25 143 143 0.63 6 2.11 1.64 42 82 131.75 100 100 0.62 7 2.11 1.64 31 61 97.53 73 73 0.63 ' 2 n and ' 3n denotes the total sample sizes per group required for traditional phase II and phase III design respectively.
Table2. K=2, (Kα ,1−β) = (0.05, 0.8) 1 C σ ∆ C2 C3 n2 n3 E(N) ' 2 n n3' ratio 0 13 5 2.72 1.96 30 84 228.43 107 107 0.53 6 2.72 1.96 21 58 158.60 74 74 0.53 7 2.73 1.96 15 43 115.88 55 55 0.53 15 5 2.71 1.96 40 111 302.88 142 142 0.53 6 2.72 1.96 28 77 210.90 99 99 0.53 7 2.72 1.96 21 57 156.95 73 73 0.53 17 5 2.70 1.96 51 143 388.55 182 182 0.53 6 2.71 1.96 36 99 271.13 127 127 0.53 7 2.72 1.96 26 73 198.31 93 93 0.53 0.5 13 5 2.48 1.96 38 87 204.07 107 107 0.58 6 2.48 1.96 27 61 144.17 74 74 0.59 7 2.48 1.96 20 45 106.61 55 55 0.59 15 5 2.47 1.96 51 116 273.05 142 142 0.59 6 2.48 1.96 35 81 188.86 99 99 0.59 7 2.48 1.96 26 59 139.10 73 73 0.58 17 5 2.47 1.96 65 149 349.16 182 182 0.59 6 2.47 1.96 45 104 242.65 127 127 0.59 7 2.48 1.96 33 76 177.69 93 93 0.59 ' 2
n and n3' denotes the total sample sizes per group required for traditional phase II and phase III design respectively.
Table3. K=3, (Kα ,1−β) = (0.05, 0.8)
1 C σ ∆ C2 C3 n2 n3 E(N) ' 2 n n3' ratio 0 13 5 2.88 2.13 31 96 337.89 120 120 0.53 6 2.89 2.13 22 67 237.31 83 83 0.54 7 2.89 2.13 16 49 173.21 61 61 0.53 15 5 2.87 2.13 41 127 446.89 159 159 0.53 6 2.88 2.13 29 89 314.31 111 111 0.53 7 2.89 2.13 21 65 228.86 81 81 0.53 17 5 2.86 2.13 53 163 574.98 204 204 0.53 6 2.87 2.13 37 113 399.73 142 142 0.53 7 2.88 2.13 27 84 295.17 105 105 0.53 0.5 13 5 2.65 2.13 39 100 298.64 120 120 0.58 6 2.66 2.13 27 69 206.45 83 83 0.58 7 2.66 2.13 20 51 152.78 61 61 0.58 15 5 2.65 2.13 52 132 396.23 159 159 0.58 6 2.65 2.13 36 92 275.24 111 111 0.58 7 2.66 2.13 27 68 205.03 81 81 0.59 17 5 2.65 2.13 67 170 510.33 204 204 0.58 6 2.65 2.13 47 118 356.29 142 142 0.58 7 2.66 2.13 34 87 260.12 105 105 0.58 ' 2
n and n3' denotes the total sample sizes per group required for traditional phase II and phase III design respectively.
Table4. K=1, (Kα ,1−β) = (0.025, 0.8)
1 C σ ∆ C2 C3 n2 n3 E(N) ' 2 n n3' ratio 0 13 5 2.67 1.96 28 88 143.34 107 107 0.54 6 2.68 1.96 20 61 100.56 74 74 0.55 7 2.69 1.96 15 45 74.68 55 55 0.55 15 5 2.67 1.96 38 117 192.10 142 142 0.55 6 2.68 1.96 26 81 132.40 99 99 0.54 7 2.68 1.96 19 60 97.56 73 73 0.54 17 5 2.65 1.96 48 149 243.82 182 182 0.54 6 2.67 1.96 34 104 171.21 127 127 0.54 7 2.68 1.96 25 77 126.43 93 93 0.55 0.5 13 5 2.45 1.96 37 93 130.06 107 107 0.61 6 2.45 1.96 26 64 90.59 74 74 0.61 7 2.46 1.96 19 47 66.34 55 55 0.60 15 5 2.44 1.96 49 123 172.12 142 142 0.61 6 2.45 1.96 34 86 119.84 99 99 0.61 7 2.45 1.96 25 63 87.99 73 73 0.60 17 5 2.44 1.96 63 158 221.17 182 182 0.61 6 2.45 1.96 44 110 154.29 127 127 0.61 7 2.45 1.96 32 81 112.83 93 93 0.61 ' 2
n and n3' denotes the total sample sizes per group required for traditional phase II and phase III design respectively.
Table5. K=2, (Kα ,1−β) = (0.025, 0.8)
1 C σ ∆ C2 C3 n2 n3 E(N) ' 2 n n3' ratio 0 13 5 2.97 2.24 32 105 270.09 129 129 0.53 6 2.98 2.24 22 73 187.06 90 90 0.53 7 2.99 2.24 16 54 137.56 66 66 0.53 15 5 2.95 2.24 42 139 356.40 172 172 0.53 6 2.97 2.24 30 97 250.83 119 119 0.53 7 2.98 2.24 22 71 183.74 88 88 0.53 17 5 2.94 2.24 54 178 456.97 220 220 0.53 6 2.96 2.24 38 124 319.56 153 153 0.53 7 2.97 2.24 28 91 234.89 113 113 0.53 0.5 13 5 2.76 2.24 40 110 236.03 129 129 0.58 6 2.76 2.24 28 76 164.19 90 90 0.58 7 2.77 2.24 20 56 119.10 66 66 0.58 15 5 2.75 2.24 53 146 312.95 172 172 0.58 6 2.76 2.24 37 101 217.54 119 119 0.58 7 2.77 2.24 27 75 160.13 88 88 0.58 17 5 2.74 2.24 68 186 400.06 220 220 0.58 6 2.75 2.24 47 130 278.10 153 153 0.58 7 2.76 2.24 35 96 206.27 113 113 0.58 ' 2
n and n3' denotes the total sample sizes per group required for traditional phase II and phase III design respectively.
Table6. K=3, (Kα ,1−β) = (0.025, 0.8)
1 C σ ∆ C2 C3 n2 n3 E(N) ' 2 n n3' ratio 0 13 5 3.10 2.39 33 117 393.99 142 142 0.53 6 3.12 2.39 23 81 273.42 99 99 0.53 7 3.13 2.39 17 60 202.41 73 73 0.53 15 5 3.09 2.39 44 155 522.98 189 189 0.53 6 3.11 2.39 30 108 361.85 131 131 0.53 7 3.12 2.39 22 79 264.95 97 97 0.52 17 5 3.07 2.39 56 198 667.10 243 243 0.52 6 3.09 2.39 39 138 464.96 169 169 0.52 7 3.11 2.39 29 102 344.42 124 124 0.53 0.5 13 5 2.91 2.39 41 121 339.19 142 142 0.57 6 2.92 2.39 29 85 239.11 99 99 0.58 7 2.92 2.39 21 62 173.82 73 73 0.57 15 5 2.90 2.39 54 161 449.02 189 189 0.57 6 2.91 2.39 38 112 314.17 131 131 0.57 7 2.92 2.39 28 83 232.22 97 97 0.57 17 5 2.89 2.39 69 207 575.48 243 243 0.57 6 2.90 2.39 48 144 400.45 169 169 0.57 7 2.91 2.39 36 106 297.50 124 124 0.57 ' 2
n and n3' denotes the total sample sizes per group required for traditional phase II and phase III design respectively.
List of Figures
Figure1. Simulated success rates for the case of K=1,σ =13, (Kα,1−β) = (0.05, 0.8).
Figure2. Simulated success rates for the case of K=1,σ =15, (Kα,1−β) = (0.05, 0.8).
Figure3. Simulated success rates for the case of K=1,σ =17, (Kα,1−β) = (0.05, 0.8).
Appendix
Since ≡ = , 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1 , ~ 2 2 2 2 3 1 0 II II 1 II 0 II K K K K n N n N Y Y Y I µ Y σ σ µ µ µ L M O M L M M L L M M ,and from (2), we can derive the joint distribution of
[
T T TK]
TII II 2 II 1 K : . , ~ 1 0 0 1 0 1 0 1 0 0 1 1 2 1 2 1 T 2 2 II II II 1 II 0 2 2 II 0 II II 0 II 2 II 0 II 1 2 2 II II 2 II 1 = − − − = − − − = AA Aµ Y A n N Y Y Y n Y Y Y Y Y Y n T T T K K K K σ σ σ M L M O M M M L L M M We can write . 1 2 1 2 1 2 1 2 1 1 2 1 2 1 2 1 2 1 1 2 1 2 1 2 1 2 1 1 , 2 1 2 1 1 1 1 2 1 1 1 1 2 1 1 1 1 2 2 1 , 2 1 1 0 0 0 1 0 0 0 1 1 1 1 1 0 0 1 0 1 0 1 0 0 1 1 2 1 , 2 1 , 0 0 2 0 1 2 2 0 0 2 0 1 2 2 2 2 2 2 0 0 2 0 1 2 2 T 2 2 − − − ≡ − − − ≡ − − − − − − − − − ≡ L M O M L M M L L M L M O M L M M L L M L M O M M L L L L M O M M M L L M µ µ µ µ µ µ σ µ µ µ µ µ µ σ σ σ µ µ µ µ µ µ σ σ K K K K K K K n N n N n n n N n N Aµ AA
Under the null hypothesis, ∆i =0,
(
,)
. 1 2 1 2 1 2 1 2 1 1 2 1 2 1 2 1 2 1 1 2 1 2 1 2 1 2 1 1 , 0 0 0 ~ 0 H II II 2 II 1 Σ 0 K K K N N T T T ≡ L M O M L M M L L M MFrom above, the correlation coefficient ρ is 2
1 . The joint probability density
function for
[
T T TK]
T II II 2 II 1 K is( )
( )
− = − 2 exp det 2 1 ) , , ( 1 2 1 x Σ x Σ T K K x x f π K .Under H , the probability of keeping on recruiting after the phase II stage for 0 control group and j of the experimental group(s) is
, 2 2 exp 1 1 1 ) , , ( 2 1 1 2 1 1 2 1 2 1 1 1 dy y y C y C y C j K dx dx x x f j K p j K j K K j C C C C j K C C j π ρ ρ ρ ρ ρ ρ − − − Φ − − Φ − − − Φ = = − ∞ ∞ − − ∞ − ∞ −
∫
∫
∫
∫
∫
K K 3 2 1 L 3 2 1 Lwhere j=1,2,K,K. The probability of stopping to recruit after the phase II stage is
. 2 2 exp 1 1 2 2 exp 1 ) , , ( 1 ) , , ( 2 2 2 1 1 1 1 1 0 2 2 1 1 dy y y C dy y y C dx dx x x f dx dx x x f p K K K K K C C K K K C C
∫
∫
∫
∫
∫
∫
∞ ∞ − ∞ ∞ − ∞ − ∞ − ∞ − ∞ − − − − Φ − + − − − Φ = − + = π ρ ρ π ρ ρ K K 3 2 1 L K K 3 2 1 LThe results ofp0 and pj are derived by Gupta (1963). And we have
∑
= = + K j j p p 1 0 1 .The expected total sample size, E(N), under the null hypothesis, is derived as
(
)(
)
(
)
{
}
(
)
j K j j K j p j n n K p j K n j n n p n K N E∑
∑
= = + + + = − + + + + + = 1 3 2 1 2 2 3 0 2 1 1) ( 1 1) ( ) (References
1. Bischoff W, and Miller F. Adaptive two-stage test procedures to find the best treatment in clinical trials. Biometrika. 2005;92:197-212.
2. Gupta SS. Probability integrals of multivariate normal and multivariate t. Ann.
Math. Statist. 1963;34:792-828.
3. Maca J, Bhattacharya S, Dragalin V, Gallo P, and Krams M. Adaptive seamless Phase II/III designs—Background, operational aspects, and examples. Drug
Information Journal. 2006;40:463-473.
4. Posch M, Koenig F, Branson M, Brannath W, Dunger-Baldauf C, and Bauer P. Testing and estimation in flexible group sequential designs with adaptive treatment selection. Stat Med. 2005;24:3697-3714.
5. Schaid DJ, Wieand S, and Therneau TM. Optimal two-stage screening designs for survival comparisons. Biometrika. 1990;77:507-13.
6. Scher HI, and Heller G.. Picking the Winners in a Sea of Plenty. Clinical Cancer
Research. 2002;8:400-404.
7. Simon R. Optimal two-stage designs for phase II clinical trials. Controlled
Clinical Trials. 1989;10:1-10.
8. Tsou HH, Hsiao CF, Chow SC, and Liu JP. A two-stage design for drug screening trials based on continuous endpoints. Drug Information Journal. 2008;42:253-262.