• 沒有找到結果。

Optimal Two-Level Regular Fractional Factorial Block and Split-Plot Designs

N/A
N/A
Protected

Academic year: 2022

Share "Optimal Two-Level Regular Fractional Factorial Block and Split-Plot Designs"

Copied!
21
0
0

加載中.... (立即查看全文)

全文

(1)

Optimal Two-Level Regular Fractional Factorial Block and Split-Plot Designs

BYCHING-SHUICHENG

Department of Statistics, University of California, Berkeley, California 94720, U.S.A.

cheng@stat.berkeley.edu

ANDPI-WENTSAI

Department of Mathematics, National Taiwan Normal University, Taipei 11677, Taiwan pwtsai@math.ntnu.edu.tw

SUMMARY

We propose a general and unified approach to the selection of regular fractional factorial de- signs which can be applied to experiments that are unblocked, blocked, with random or fixed block effects, or have a split-plot structure. Our criterion is derived as a good surrogate for the model-robustness criterion of information capacity. In the case of random block effects, it takes the ratio of intra- and interblock variances into account. In most of the cases, up to 32 runs, we have examined, there exist designs that are optimal for all values of that ratio. Examples of op- timal designs that depend on the ratio are provided. We also demonstrate that our criterion can further discriminate designs that cannot be distinguished by the existing minimum-aberration criteria.

(2)

Some key words: Alias set; Estimation capacity; Information capacity; Minimum aberration; Model robustness;

Wordlength pattern.

1. INTRODUCTION

There has been considerable recent research in the selection and construction of good frac- tional factorial designs under the minimum-aberration criterion proposed by Fries & Hunter (1980). Cheng et al. (1999) provided a statistical justification of this popular criterion and demon- strated that it is a good surrogate for robustness against model uncertainty.

When running an experiment, the experimenters are advised to randomize the experimen- tal runs. Complete randomization is performed when the experimental units have no particular structure. On the other hand, a constrained randomization that respects the structure of the exper- imental units is desirable if such a structure exists. A good example is when the method of block- ing is used to control variations among the experimental units. Proper randomization in this case leads to two error terms, or strata, in Nelder’s (1965) language. Two different variances, called inter- and intrablock variances, respectively, are associated with the two strata. When blocking is successfully done, one expects the intrablock variance to be smaller than the interblock vari- ance. Then a factorial effect, main effect or interaction contrast, that is confounded with a block contrast is estimated with a larger variance than one that is orthogonal to block contrasts. This complicates the issue of selecting a design. One would expect the decision to depend in some way on prior knowledge about the ratio of the two variances if it is available.

Extensions of the minimum-aberration criterion to blocked fractional factorial designs have been proposed by several authors (Sitter et al., 1997; Chen & Cheng, 1999; Cheng & Wu, 2002) under the assumption that the underlying model has fixed block effects. One objective of this article is to propose a general approach that covers the cases of random as well as fixed block

(3)

effects. Following Cheng et al. (1999), we use a measure of model robustness to motivate a criterion for selecting blocked fractional factorial designs. It suffices to say at this point that for models with fixed block effects, one version of our criterion reduces to what was proposed by Chen & Cheng (1999), and it is the same as the usual minimum-aberration criterion for unblocked designs when the inter- and intrablock variances are equal.

When designing a blocked fractional factorial experiment, typically we try to avoid confound- ing main effects with block contrasts. However, for practical considerations, sometimes one must confound certain main effect contrasts with blocks. Such designs, called split-plot designs, arise, for example, when it is difficult or expensive to change the levels of certain factors. Then each of these factors, called whole-plot factors, keeps the same level in each block, called a whole-plot, while the other factors, called subplot factors, are allowed to take different levels in the units, called subplots, within the same whole-plot.

Perhaps because the research on minimum-aberration criteria for blocked fractional factorial designs has been focused on the case of fixed block effects, while for split-plot designs one must assume that the whole-plot effects are random, these two lines of research have been rather dis- connected, even though split-plot designs are a special kind of blocked factorial designs. Huang et al. (1998) and Bingham & Sitter (1999a, b) applied the usual minimum-aberration criterion, for unblocked factorials, to rank fractional factorial split-plot designs. This approach ignores the effect of the block structure on the precision of the estimators of factorial effects. In many situations the usual minimum-aberration criterion cannot distinguish several different split-plot designs even though some are clearly better than the others. Bingham & Sitter (2001) and Muk- erjee & Fang (2002) used additional secondary criteria further to discriminate such designs.

The second objective of this article is to apply the same approach we propose for blocked frac- tional factorial designs to split-plot designs. Our unified approach can also be applied to more

(4)

complicated experiments with more than two strata, including row-column designs, split-split- plot designs, blocked split-plot designs (McLeod & Brewster, 2004), strip-plot designs (Miller, 1997) and more generally those with the block structures studied by Nelder (1965). These will be treated elsewhere.

Only two-level regular fractional factorial designs with orthogonal blocking are considered in this article; the case in which the number of levels is a prime power can be treated similarly.

We make the hierarchical assumption that lower-order effects are more important than higher- order effects and effects of the same order are equally important, a situation in which minimum aberration is a suitable criterion. For simplicity, we further assume that three-factor and higher- order interactions are negligible.

2. RANDOMIZED BLOCK DESIGNS AND EXISTING CRITERIA

A consequence of randomization for a block design is that the vector of observations y can be assumed to have a simple covariance matrix V in which all the observations have the same variance and there are two possibly different covariances depending on whether or not two ob- servations are in the same block (Bailey, 1981). Let b be the number of blocks and let s be the block size. Then V has three eigenvalues, ξ0, ξ1 and ξ2, where ξ0 is the common row sum of V , and under the assumption of no treatment effects, i.e., E(y) is constant, ξ1 = E{(b − 1)−1Pb

i=1s(yi.− y..)2}, and ξ2 = E[{b(s − 1)}−1Pb i=1

Ps

j=1(yij− yi.)2], where yij is the observation on the jth unit in the ith block, yi.=Ps

j=1yij/s, and y..=Pb i=1

Ps

j=1yij/bs. It is clear from these equations that ξ1and ξ2are measures of between-block and within-block vari- ability, and are called inter- and intrablock variance, respectively. Consider contrasts l0yof the observations such that all the observations with the same factor-level combination have the same coefficient. We note that var(l0y) = ξ1l0lif l falls in the eigenspace associated with ξ1, in which

(5)

case we say that the treatment contrast E(l0y) is confounded with blocks; on the other hand, var(l0y) = ξ2l0lif l falls in the eigenspace associated with ξ2, in which case E(l0y) is orthogonal to the blocks. We denote the ratio ξ21by r.

We shall assume that r ≤ 1, i.e., ξ1≥ ξ2. The inequality ξ1> ξ2holds whenever two different observations from the same block have a larger covariance than those from different blocks. One example is when the error term εij associated with the observation on the jth unit in the ith block can be decomposed as εij = βi+ eij, where {βi} and {eij} are mutually uncorrelated random variables with E(βi) = E(eij) = 0, and var(βi) = σB2, var(eij) = σ2e; that is, we have a model with random block effects. In this case, the covariances are σB2 or 0 for two observations in the same or different blocks, respectively. Thus the condition σ2B> σe2, as is often assumed in the literature, is not necessary.

Note that r can be thought of as the efficiency of the estimator of a contrast confounded with blocks relative to that of one orthogonal to blocks. Under models with fixed-block effects, such efficiency is zero since contrasts confounded with blocks are not estimable. Therefore, models with fixed block effects can be thought to have r = 0. On the other hand, r = 1 corresponds to when the inter- and intrablock variances are equal, i.e., blocking does not affect precision of the estimators.

A regular 2n−pfractional factorial design with n two-level factors in 2n−pruns is determined by p independent treatment defining effects. Under such a design, the defining contrast subgroup generated by the p independent effects contains 2p− 1 treatment factorial effects, called defining effects, defining contrasts or defining words. Let Ai(d) be the number of defining effects of length i of a design d. The sequence {A1(d), A2(d), . . .} is called the wordlength pattern of d. A design is said to have minimum aberration if it sequentially minimizes A1(d), A2(d), A3(d), . . ., and so on.

(6)

To divide the 2n−p treatment combinations into 2q blocks each of size 2n−p−q, where q <

n − p, one needs another set of q independent defining effects, called block defining effects, which are independent of the treatment defining effects. It is convenient to think of such a blocked fractional factorial design as a 2n+q−(p+q) design, where the n + q factors are the n treatment factors and q block factors defined by the q additional independent block defining contrasts.

However, the wordlength pattern of this 2n+q−(p+q)design as defined in the previous paragraph is not enough to summarize the properties of a block design. We define Ai,j(d) as the number of defining words in the 2n+q−(p+q)design that contain i treatment factors and j block factors, and let Bi(d) =P

j:j≥1Ai,j(d). All the existing minimum-aberration criteria for blocked fractional factorial designs are based on the two sequences {Ai,0(d)} and {Bi(d)}; see Sitter et al. (1997), Chen & Cheng (1999) and Cheng & Wu (2002). Two criteria relevant to our study, the WCC- criterion proposed by Chen & Cheng (1999) and the W1-criterion proposed by Cheng & Wu (2002), are based on the sequential minimization of components of the following wordlength patterns:

WCC = (3A3,0+ B2, A4,0, 10A5,0+ B3, A6,0, . . .), W1 = (A3,0, A4,0, B2, A5,0, A6,0, B3, . . .).

Note that there are two assumptions underlying these criteria: the block effects are assumed to be fixed, and A1,0 = A2,0 = B1 = 0. When three-factor and higher order interactions are negli- gible, only the first three terms in W1and the first two terms in WCCare relevant. In particular, we call a design W1-optimal if it sequentially minimizes A3,0, A4,0and B2, and WCC-optimal if it sequentially minimizes 3A3,0+ B2and A4,0.

(7)

3. NEW CRITERIA FOR BLOCKED FRACTIONAL FACTORIAL DESIGNS

The minimum-aberration criterion based on WCCwas proposed as a surrogate for the model robustness criterion of maximum estimation capacity (Cheng et al., 1999). For any 1 ≤ k ≤

1

2n(n − 1), define Ek(d) as the number of models containing all the main effects and k two- factor interactions that can be estimated by a design d. Here k can be thought of as the number of active two-factor interactions. Following Cheng et al. (1999), Chen & Cheng (1999) showed that the Ek’s are large if the number of two-factor interactions that are neither aliased with main effects nor confounded with blocks, i.e., 12n(n − 1) − (3A3,0+ B2) is maximized and that these interactions are distributed very uniformly over the alias sets. Once the former is maximized, i.e., 3A3,0+ B2is minimized, the latter can be achieved by making A4,0small. This leads to the first two components of WCC.

Under a regular fractional factorial design with orthogonal blocking, a model with fixed block effects is either not estimable, and thus has a singular information matrix with zero determinant or all its parameters can be estimated with the highest efficiency. Let δkbe the number of candidate models containing all the main effects and k two-factor interactions. Therefore Ek(d)/δk can be regarded as the average efficiency of d over the candidate models. On the other hand, when the block effects are random, or under a randomization model, different estimable models may be estimated with different efficiencies depending on how many factorial effects in the model are confounded with or are orthogonal to the blocks. We use the average efficiency over all the candidate models, called the information capacity by D. X. Sun, in his unpublished 1993 Ph.D thesis from The University of Waterloo, and Li & Nachtsheim (2000), to measure model robustness of a design. The use of the average efficiency over a set of candidate models to select model-robust factorial designs has also been proposed by Tsai et al. (2000) in a different context.

(8)

We assume that no main effect is either a defining effect or confounded with blocks, and that no two main effects are aliased with each other. Then they are estimated with the same preci- sion. Thus we shall concentrate on the efficiencies of the estimators of two-factor interactions, measured by D1/k, where D is the determinant of the information matrix for the two-factor inter- actions, and k is the number of two-factor interactions in the model. For simplicity we normalize the interaction contrasts so that their estimators are of the form l0ywith l0l= 1. For a given de- sign d, we define the information capacity Ik(d) as the average of D1/k over the set of models containing all the main effects and k two-factor interactions. We seek a design that has large Ik. Clearly, under models with fixed block effects, maximizing Ik(d) is equivalent to maximizing Ek(d) as considered in Chen & Cheng (1999).

The 2n− 2ptreatment factorial effects other than those in the defining contrast subgroup are divided into g ≡ 2n−p− 1 alias sets each of size 2p. The effects in 2q− 1 of these alias sets are confounded with blocks. Let f = g − n. Without loss of generality, suppose that the effects in each of the first h alias sets are confounded with blocks but are not aliased with main effects, that those in the next f − h alias sets are neither confounded with blocks nor aliased with main effects, and that each of the last n alias sets contains one main effect. When no main effect is confounded with blocks, as assumed in this section, we have h = 2q− 1. For each i = 1, . . . , g, let mi be the number of two-factor interactions contained in the ith alias set. In the Appendix, we show that Ikis large if

(i) r1/kPh

i=1mi+Pf

i=h+1miis as large as possible, and

(ii) r1/k m1, . . . , r1/kmh, mh+1, . . . , mf are as equal as possible.

(1)

Let u = 12n(n − 1) −Pf

i=h+1miand v =Ph

i=1mi. Then u is the number of two-factor inter- actions that are aliased with main effects or are confounded with blocks, and v is the number of

(9)

two-factor interactions that are not aliased with main effects but are confounded with blocks. To make r1/kPh

i=1mi+Pf

i=h+1mi large is the same as to make u − r1/kv small. On the other hand, a good surrogate for (ii) is to have small r2/kPh

i=1m2i +Pf

i=h+1m2i.

Under the assumption that no main effect is confounded with blocks, we have u = 3A3,0+ B2

and v = B2; therefore, u − r1/kv = 3A3,0+ (1 − r1/k)B2. Since A4,0 = 16{Pg

i=1m2i − n(n − 1)/2}, see (2.2) of Cheng, et al. (1999),Pg

i=1m2i is small if A4,0is small. Then r2/kPh

i=1m2i + Pf

i=h+1m2i would also be small. Thus a good and simple surrogate for maximizing Ik is to minimize 3A3,0+ (1 − r1/k)B2, followed by the minimization of A4,0. We call this two-stage procedure, which replaces B2 in WCC with (1 − r1/k)B2, the Wkr-criterion. The weight 1 − r1/k is imposed to penalize confounding with blocks. The heaviest penalty is imposed when the block effects are fixed, r = 0, since in this case confounding with blocks leads to total loss of information. Indeed Wk0-optimality is the same as WCC-optimality. The penalty decreases as r increases. For r = 1, Wk1 successively minimizes A3,0 and A4,0, as in the usual minimum- aberration criterion for unblocked factorial designs. We shall denote this criterion by WMA, partly to emphasize that it does not depend on k. The following result provides a useful tool for studying Wkr-optimal designs.

THEOREM1. If a design is both Wk0- andWk1-optimal, i.e., it isWCC- andWMA-optimal, then it isWkr-optimal for allk and all 0 ≤ r ≤ 1. The same conclusion also holds for a design that is bothWCC- andW1- optimal.

Proof. Suppose d is both Wk0- and Wk1-optimal. For any other design d0, we need to show that 3A3,0(d) + (1 − r1/k)B2(d) ≤ 3A3,0(d0) + (1 − r1/k)B2(d0), and, if 3A3,0(d) +(1 − r1/k)B2(d) = 3A3,0(d0) + (1 − r1/k)B2(d0), then A4,0(d) ≤ A4,0(d0).

We have 3A3,0(d0) + (1 − r1/k)B2(d0) − {3A3,0(d) + (1 − r1/k)B2(d)} = (1 − r1/k) [{3A3,0(d0) + B2(d0)} − {3A3,0(d) + B2(d)}] + 3r1/k{A3,0(d0) − A3,0(d)}, which

(10)

is nonnegative since both {3A3,0(d0) + B2(d0)} − {3A3,0(d) + B2(d)} and A3,0(d0) − A3,0(d) are nonnegative. Now suppose the above difference is zero. If 0 ≤ r < 1, then 3A3,0(d0) + B2(d0) must be equal to 3A3,0(d) + B2(d), and, for r = 1, we have A3,0(d0) = A3,0(d). In both cases it follows from the Wk0- and Wk1-optimality of d that

A4,0(d) ≤ A4,0(d0). 

Minimizing A4,0is a good, albeit not the best, surrogate for part (ii) of (1), since the role played by r is ignored. A more accurate surrogate for maximizing Ik is to maximize r1/kPh

i=1mi+ Pf

i=h+1mi, followed by the minimization of r2/kPh

i=1m2i +Pf

i=h+1m2i. We call this the W˜kr-criterion. We note that r2/kPh

i=1m2i +Pf

i=h+1m2i can be expressed in terms of some modified and more complicated versions of wordlengths; the details are omitted since the mi

values can be determined in a straightforward manner. One can use ˜Wkr as a primary criterion or as a secondary criterion when Wkr cannot produce a unique optimal design. The advantage of Wkris its simplicity, and, as discussed earlier, it is nicely related to the existing criteria. Our experience indicates that Wkrworks well at least for up to 64-run designs.

The following result similar to Theorem 1 can easily be established.

THEOREM2. If a design is both ˜Wk0- and ˜Wk1-optimal, then it is ˜Wkr-optimal for allk and all 0 ≤ r ≤ 1.

We note that both ˜Wk0 and ˜Wk1 do not depend on k. By Theorem 2, to show that a design is W˜kr-optimal for all k and all 0 ≤ r ≤ 1, it suffices to verify that (i) it maximizesPf

i=1mi, and minimizesPf

i=1m2i among those which maximizePf

i=1mi, and (ii) it maximizesPf

i=h+1mi, and minimizesPf

i=h+1m2i among those which maximizePf

i=h+1mi. Requirement (i) ensures that the design has the largest number of two-factor interactions that are not aliased with main effects and that these two-factor interactions are very uniformly distributed over the alias sets.

(11)

When there are two strata, (ii) further requires that a large number of the two-factor interactions that are not aliased with main effects be estimated in the more precise intrablock stratum, and that these two-factor interactions be also very uniformly distributed over the alias sets.

The practical significance of Theorems 1 and 2 is that one can establish a strong optimality result that holds for all k and r by checking two simple cases that do not involve k. Furthermore, one can use WCC and W1, or WCC and WMA, or ˜Wk0 and ˜Wk1, together to screen out inferior designs. If a design d1 is at least as good as another design d2under both criteria, and is better than d2 under at least one of the two criteria, then we say that d1 dominates d2 with respect to the two criteria. In this case d2 can be dropped from consideration, and we say that it is inadmissible. Typically there may be only a small number of admissible designs. Then we can carry out the simple task of comparing them directly for given k and r. When comparing the admissible designs, we suggest using the Ik(d)-criterion directly. This is because the range of r values for a design to be better than another under Ik(d) may not coincide with that determined by simple surrogates such as Wkr.

Example1. Consider the case n = 13, p = 8 and q = 3, i.e., 32-run designs for 13 factors with the 32 runs grouped into 8 blocks of size 4 each. Let d1, d2and d3be three designs with the following sets of independent defining words:

d1 :{1236, 1247, 1348, 2349, 125t10, 135t11, 235t12, 145t13, 12b1, 13b2, 14b3}, d2 :{1236, 1247, 1348, 2349, 125t10, 135t11, 235t12, 145t13, 13b1, 14b2, 15b3}, d3 :{126, 137, 148, 2349, 1234t10, 235t11, 245t12, 345t13, 23b1, 24b2, 15b3},

where t10, . . . , t13 are treatment factors 10, . . . , 13, and b1, b2, b3 are block factors. Then A3,0(d1) = 0, A4,0(d1) = 55, B2(d1) = 38; A3,0(d2) = 0, A4,0(d2) = 55, B2(d2) = 36;

A3,0(d3) = 4, A4,0(d3) = 39, B2(d3) = 22.

(12)

In view of these wordlengths, d1is worse than d2with respect to WCCand the two designs are tied under WMA. We note that d1is also worse than d2with respect to ˜Wk0and the two designs are tied under ˜Wk1. By the proofs of Theorems 1 and 2, d1is worse than d2with respect to Wkrand W˜krfor all k and all 0 ≤ r < 1, and they are tied for r = 1. These two designs do have identical information capacities when r = 1, and, for r < 1, Ik(d1) < Ik(d2) for all the k and r values we have examined. On the other hand, d2is better than d3with respect to WMAand is worse than d3

with respect to WCC. Therefore d2is better than d3 with respect to Wkr except when r is small.

Likewise, d2 is better than d3 with respect to ˜Wk1 and is worse than d3 with respect to ˜Wk0. A complete search of all the designs with n = 13, p = 8, q = 3 shows that d2 and d3 are the only two admissible designs with respect to WMAand WCC. They are also the only admissible designs with respect to W1 and WCC as well as ˜Wk0and ˜Wk1. It follows from Theorems 1 and 2 that d3 is Wkr- and ˜Wkr-optimal for small r values and d2 is Wkr- and ˜Wkr-optimal for larger r values.

A comparison of the information capacities of these two designs shows that Ik(d2) > Ik(d3) except when r is small.

Xu (2006) and Xu & Lau (2006) constructed all minimum-aberration blocked two-level frac- tional factorial designs with 8, 16, 32 runs and those with 64 runs and up to 32 factors with respect to several criteria. They found that in most cases these criteria lead to the same minimum- aberration design. In particular, for 16-run designs, except for (n, p, q) = (5, 1, 1) and (5, 1, 2), W1 and WCCproduce the same minimum-aberration designs. These designs are Wkr- and ˜Wkr- optimal for all k and all 0 ≤ r ≤ 1. For both of the two cases (n, p, q) = (5, 1, 1) and (5, 1, 2), the WCC-optimal design is Wkr- and ˜Wkr-optimal for small r values, and the W1-optimal design is Wkr- and ˜Wkr-optimal for larger r values, and these two designs are the only admissible designs with respect to W1 and WCC as well as ˜Wk0 and ˜Wk1. For 32-runs designs, W1 and WCC pro- duce the same minimum-aberration designs except when (n, p, q) = (6, 1, 2), (7, 2, 2), (8, 3, 3),

(13)

(9, 4, 2), (9, 4, 3), (10, 5, 1), (10, 5, 2), (10, 5, 3), (11, 6, 3), (12, 7, 3), (13, 8, 3), (21, 16, 1) and (21, 16, 2). These are also the only cases of 32-run designs where there are more than one ad- missible design with respect to W1 and WCCas well as ˜Wk0and ˜Wk1. In all the twelve cases ex- cept (n, p, q) = (9, 4, 2), the W1- and WCC-optimal designs are the only admissible designs. For (n, p, q) = (9, 4, 2), there is a third admissible design with independent defining effects 12456, 2359, 2578, 2689, 14b1and 15b2.

The numerical computations we have done demonstrate that the criteria introduced in this ar- ticle are good surrogates for maximum information capacity, but we should keep in mind that they are only surrogates. We have examined all 16-run designs to study the consistency of rank- ings under ˜Wk0, ˜Wk1, W1and WCC. Treating those which perform equally well under all the four criteria as the same design, we examined 388 pairs of 16-run designs with the same numbers of treatment factors and blocks. They fall into the following four categories: (i) the two designs perform equally well under both ˜Wk0 and ˜Wk1, but one of them dominates the other with respect to W1 and WCC, two pairs; (ii) one design dominates the other with respect to ˜Wk0 and ˜Wk1, but neither dominates the other with respect to W1 and WCC, three pairs; (iii) one design dominates the other with respect to ˜Wk0 and ˜Wk1 as well as W1 and WCC, 374 pairs; (iv) neither design dominates the other with respect to ˜Wk0and ˜Wk1or W1 and WCC, nine pairs. In category (i), the two designs have identical information capacities for all r and k, and in category (ii), the better design under ˜Wk0and ˜Wk1dominates the other with respect to the information capacity criterion for all the k and r values that we have examined. These results confirm our expectation that ˜Wk0 and ˜Wk1give more accurate comparisons than W1 and WCC. In 336 of the 374 pairs in category (iii), the better design with respect to ˜Wk0 and ˜Wk1, or W1 and WCC, dominates the other under the information capacity criterion for all the k and r values that we have examined. In the 38 exceptional pairs, the better design under ˜Wk0 and ˜Wk1 fails to have larger Ik(d) only for larger

(14)

k’s. When k is large, the information capacity criterion may favour designs with smaller A4. This is because smaller A4 implies fewer pairs of aliased two-factor interactions, and thus possibly more estimable models containing a larger number of two-factor interactions. In category (iv), neither design dominates the other with respect to the information capacity criterion for all r and k.

4. SPLIT-PLOT DESIGNS

The principle we used to derive Wkr and ˜Wkr is applicable to split-plot designs since they are a special kind of blocked fractional factorial design. Again we assume that the two-factor interactions are equally important. In this case the split-plot structure arises purely from some practical needs; for example, it is difficult to change the levels of certain factors or some factors require larger experimental units. This is different from the situation in which there are what Cox (1958, p.?? ) called classification factors, which are included in the experiment primarily to examine their possible interactions with other factors, such as in robust product experiments. A different approach needs to be developed for such experiments and will not be treated here.

Suppose that there are n1 whole-plot factors and n2 subplot factors, each at two levels, and that there are 2n1−p1 whole plots, each containing 2n2−p2 subplots. It is convenient to consider the design as a 2n−pdesign with n = n1+ n2and p = p1+ p2. We use capital letters A, B, . . . to denote whole-plot factors and use lower case letters p, q, . . . to denote subplot factors. We also assume that p1 ≥ 0, but our approach can also be applied to the case where the whole-plot factors are replicated, as discussed in Bingham et al. (2004).

Under a split-plot design, main effects of the whole-plot factors are estimated with the same precision, and main effects of the subplot factors are also estimated with the same precision.

It follows that all the designs have the same overall efficiency for the estimators of the main

(15)

effects of treatment factors. We can therefore concentrate on the efficiencies of the estimators of two-factor interactions, and define the information capacity Ik and derive its surrogates in the same way as in §3 and the Appendix. As in the paragraph following (1), the first step of a good surrogate for maximizing Ikis to minimize u − r1/kv, where in the present context u is the number of two-factor interactions that are aliased with main effects or are estimated in the whole- plot stratum, and v is the number of two-factor interactions that are not aliased with main effects and are estimated in the whole-plot stratum. Since two-factor interactions that are estimated in the whole-plot stratum may be aliased with main effects, the computation of u and v cannot be done as in §3, and a correct form of the Wkr-criterion can no longer be expressed purely in terms of the wordlengths. On the other hand, the ˜Wkr-criterion can be carried over without any change, and is a more accurate surrogate than Wkrfor the information capacity criterion. In fact, there are seven cases of 32-run split-plot designs in which Wkr-optimal designs are not unique but can be discriminated by ˜Wkr. To save space, in this section we shall concentrate on ˜Wkr.

As in §3, the ˜Wkr-criterion first maximizes r1/kPh

i=1mi+Pf

i=h+1mi, and then minimizes r2/kPh

i=1m2i+Pf

i=h+1m2i. Here the first sumPh

i=1is over the alias sets of effects which are not aliased with main effects and are estimated in the whole-plot stratum, and the second sum Pf

i=h+1 is over those which are not aliased with main effects and are estimated in the subplot stratum. Furthermore, Theorem 2 still holds. Therefore, as discussed in the paragraph preceding Example 1, we can use ˜Wk1and ˜Wk0to screen out inadmissible designs.

Example2. Consider the case n1= 5, n2 = 2, p1= p2= 1. Table 4 of Bingham & Sitter (2001) listed two designs which are the best and are equally good according to their approach:

d1 :I = ABCDE = ABpq = CDEpq, d2 :I = ABCE = ABDpq = CDEpq.

(16)

We point out that these two designs also cannot be distinguished by the proper form of Wkr for split-plot designs. As to ˜Wkr, under both d1 and d2, there are 10 alias sets of effects which are estimated in the whole-plot stratum and are not aliased with main effects; the numbers of two- factor interactions in these alias sets are nine 1’s and one 2 for d1, and three 2’s, five 1’s and two 0’s for d2. For the 14 alias sets of effects that are estimated in the subplot stratum and are not aliased with main effects, we have two 2’s, six 1’s and six 0’s for d1, and ten 1’s and four 0’s for d2. The values ofPf

i=1mi,Pf

i=h+1mi,Pf

i=1m2i andPf

i=h+1m2i for d1are 21, 10, 27 and 14, respectively, and those for d2 are 21, 10, 27 and 10. It follows from Theorem 2 that d2

is ˜Wkr-better than d1for all k and all 0 ≤ r < 1, and ties with d1for r = 1. Essentially d2beats d1 because it has a more uniform distribution, over the alias sets, of the two-factor interactions that are not aliased with main effects and are estimated in the more precise subplot stratum. In fact, it can be verified that, for n1 = 5, n2 = 2 and p1= p2= 1, d2is ˜Wkr-optimal for all k and all 0 ≤ r ≤ 1. One can also see that d2 is superior in the sense that all the aliasing between two- factor interactions occurs in the less precise whole-plot stratum. Comparison of the information capacities of d1and d2confirms that d2 is indeed a better design.

Bingham & Sitter (2001) used the usual minimum-aberration criterion for unblocked designs to compare split-plot designs, but since it often leads to more than one nonisomorphic minimum- aberration design, for further discrimination, they proposed the secondary criterion of minimiz- ing the number of subplot two-factor interactions that are estimated in the whole-plot stratum.

This approach leads to a unique optimal design in each of the 16-run and all but 12 of the 32-run cases they had studied. The results are tabulated in Table 3, for 16-run designs, and Table 4, for 32-run designs, of their paper.

Except for one 16-run case, with (n1, n2, p1, p2) = (4, 2, 1, 1), and two 32-run cases, with (n1, n2, p1, p2) = (7, 3, 3, 2) and (8, 2, 4, 1), the best design or one of the best de-

(17)

signs Bingham & Sitter (2001) found is W˜kr-optimal for all k and all 0 ≤ r ≤ 1. In each of the exceptional cases, our approach found another admissible design, with re- spect to ˜Wk1 and ˜Wk0, which has larger Ik when r is small or k is large. These ad- ditional admissible designs have the following independent defining effects: ABD, ACpq for (n1, n2, p1, p2) = (4, 2, 1, 1); ABE, ACF, BCDG, BCpq, ADpr for (n1, n2, p1, p2) = (7, 3, 3, 2); and ABE, ACF, ADG, BCDH, ABCDpq for (n1, n2, p1, p2) = (8, 2, 4, 1).

Implicitly, our criteria also take into account the number of subplot two-factor interactions that are estimated in the whole-plot stratum, but those which are aliased with main effects are excluded. This is because, once a two-factor interaction is aliased with main effects, accord- ing to the hierarchical principle, it cannot be estimated in a model in which all the main ef- fects need to be estimated. Consequently such a two-factor interaction does not play a role in the computation of information capacities. For example, for (n1, n2, p1, p2) = (1, 8, 0, 5), the first design listed in Bingham & Sitter’s (2001) Table 3 has one subplot two-factor interaction that is estimated in the whole-plot stratum, while the second design has four such two-factor interactions. However, since all these two-factor interactions are aliased with whole-plot main effects, both designs have the same number, 0, of subplot two-factor interactions that are not aliased with main effects and are estimated in the whole-plot stratum. It follows that they per- form equally well under the ˜Wkr-criteria and have exactly the same information capacities for all k and r. In addition to (n1, n2, p1, p2) = (1, 8, 0, 5), there are three other 16-run cases, with (n1, n2, p1, p2) = (1, 9, 0, 6), (1, 10, 0, 7), (3, 10, 1, 8), where the two top designs listed in Bing- ham & Sitter’s (2001) Table 3 are ˜Wkr-optimal, and have identical information capacities, for all k and r.

However, this phenomenon does not occur in the case of 32-run designs. Furthermore, in seven of the twelve 32-run cases, including the one discussed in Example 2, where Bingham &

(18)

Table 1. Seven optimal 32-run split-plot designs

n1.n2.p1.p2 Independent defining words

3.4.0.2 ABpr, ACpqs 5.2.1.1 ABCE, ABDpq 3.5.0.3 ABpr, ABqs, ACpqt 4.4.0.3 ABpq, ACDpr, BCDps 5.3.1.2 ABCE, ABpq, ACDpr 3.6.0.4 ABpr, ABqs, ACpqt, BCpqu 5.4.1.3 ABCE, ABpq, ACDpr, BCDps

Sitter’s (2001) secondary criterion, and the counterpart of the Wkrcriterion for split-plot designs, failed to produce unique optimal designs, we are able to find a unique ˜Wkr-optimal design. These seven cases and the corresponding optimal designs are provided in Table 1. These designs are better than the other designs that are also optimal under Bingham & Sitter’s (2001) secondary criterion, because they have more uniform distributions, over the alias sets, of the two-factor interactions that are not aliased with main effects and are estimated in the subplot stratum. In each of the other five cases, namely (n1, n2, p1, p2) = (1, 6, 0, 2), (2, 5, 0, 2), (1, 7, 0, 3), (2, 6, 0, 3) and (1, 8, 0, 4), there are two optimal designs which have identical information capacities for all k and r.

ACKNOWLEDGEMENT

Ching-Shui Cheng’s research was supported by a grant from the U.S. National Science Foun- dation and Pi-Wen Tsai’s research was supported by a grant from the Taiwan National Science Council. The authors would like to thank the referees for their helpful comments.

(19)

APPENDIX

Derivation of a good surrogate for maximizingIk(d)

When a model is not estimable, the determinant of the information matrix is zero. Therefore, if k > f , then, since not all the k two-factor interactions can be estimated, Ik(d) must be equal to zero. For k ≤ f , since a normalized estimable contrast is estimated with variance ξ2if it is orthogonal to blocks and with variance ξ1= ξ2/r if it is confounded with, blocks, for each estimable model containing all the main effects and k two-factor interactions, if t of these two-factor interactions are confounded with blocks, then the determinant of the information matrix for the two-factor interactions is equal to rt. The total number of models in which t two-factor interactions are confounded with blocks and the other k − t are orthogonal to blocks is given by

X

1≤i1<···<it≤h t

Y

u=1

miu

X

h+1≤j1<···<jk−t≤f k−t

Y

v=1

mjv

. Thus Ik(d) is equal to

min(k,h)

X

t=max(0,k−f +h)

rt/k·

X

1≤i1<···<it≤h t

Y

u=1

miu

X

h+1≤j1<···<jk−t≤f k−t

Y

v=1

mjv

n(n − 1)/2 k



=

min(k,h)

X

t=max(0,k−f +h)

X

1≤i1<···<it≤h t

Y

u=1

(r1/kmiu)

X

h+1≤j1<···<jk−t≤f k−t

Y

v=1

mjv

n(n − 1)/2 k



= Ek(r1/km1, . . . , r1/kmh, mh+1, . . . , mf)n(n − 1)/2 k



, (A1)

where

Ek(x1, . . . , xf) = X

1≤i1<···<ik≤f k

Y

j=1

xij,

as in (3.1) of Cheng et al. (1999). It follows from the same argument as was employed there that the quantity in (A1) is large ifPh

i=1r1/kmi+Pf

i=h+1miis large and r1/km1, . . . , r1/kmh, mh+1, . . . , mf

are as equal as possible.

REFERENCES

BAILEY, R. A. (1981). A unified approach to design of experiments. J. R. Statist. Soc. A 144, 214–23.

(20)

BINGHAM, D. & SITTER, R. R. (1999a). Minimum aberration two-level fractional factorial split-plot designs.

Technometrics41, 62–70.

BINGHAM, D. & SITTER, R. R. (1999b). Some theoretical results for fractional factorial split-plot designs. Ann.

Statist.27, 1240–55.

BINGHAM, D. R., SCHOEN, E. D. & SITTER, R. R. (2004). Designing fractional factorial split-plot experiments with few whole-plot factors. Appl. Statist. 53, 325–39.

CHEN, H. & CHENG, C. S. (1999). Theory of optimal blocking of 2n−mdesigns. Ann. Statist. 27, 1948–73.

CHENG, C. S., STEINBERG, D. M. & SUN, D. X. (1999). Minimum aberration and model robustness for two-level factorial designs. J. R. Statist. Soc. B 61, 85–93.

CHENG, S. W. & WU, C. F. J. (2002). Choice of optimal blocking schemes in two-level and three-level designs.

Technometrics44, 269–77.

COX, D. R. (1958). Planning of Experiments. New York: Wiley.

FRIES, A. & HUNTER, W. G. (1980). Minimum aberration 2k−pdesigns. Technometrics 22, 601–8.

HUANG, P., CHEN, D. & VOELKEL, J. O. (1998). Minimum-aberration two-level split-plot designs. Technometrics 40, 314–26.

LI, W. & NACHTSHEIM, C. J. (2000). Model-robust factorial designs. Technometrics 42, 245–52.

MCLEOD, R. G. & BREWSTER, J. F. (2004). The design of blocked fractional factorial split-plot experiments.

Technometrics46, 135–46.

MILLER, A. (1997). Strip-plot configurations of fractional factorials. Technometrics 39, 153–61.

MUKERJEE, R. & FANG, K. T. (2002). Fractional factorial split-plot designs with minimum aberration and maximum estimation capacity. Statist. Sinica 12, 885–903.

NELDER, J. A. (1965). The analysis of randomized experiments with orthogonal block structure. I. block structure and the null analysis of variance. Proc. R. Soc. Lond. A 283, 147–62.

SITTER, R. R., CHEN, J. & FEDER, M. (1997). Fractional resolution and minimum aberration in blocking factorial designs. Technometrics 39, 382–90.

TSAI, P. W., GILMOUR, S. G. & MEAD, R. (2000). Projective three-level main effects designs robust to model uncertainty. Biometrika 87, 467–75.

XU, H. (2006). Blocked regular fractional factorial designs with minimum aberration. Ann. Statist. 34, 2534–53.

XU, H. & LAU, S. (2006). Minimum aberration blocking schemes for two- and three-level fractional factorial designs. J. Statist. Plan. Infer. 136, 4088–118.

(21)

參考文獻

相關文件

Other advantages of our ProjPSO algorithm over current methods are (1) our experience is that the time required to generate the optimal design is gen- erally a lot faster than many

After teaching the use and importance of rhyme and rhythm in chants, an English teacher designs a choice board for students to create a new verse about transport based on the chant

Simulation conditions are introduced first and various characteristics in three defect designs, such as single mode laser wavelength shift and laser mode change, are analyzed.

In addition, to incorporate the prior knowledge into design process, we generalise the Q(Γ (k) ) criterion and propose a new criterion exploiting prior information about

(2007) demonstrated that the minimum β-aberration design tends to be Q B -optimal if there is more weight on linear effects and the prior information leads to a model of small size;

The packed comparison instructions compare the destination (second) operand to the source (first) oper- and to test for equality or greater than.. These instructions compare eight

Two cross pieces at bottom of the stand to make a firm base with stays fixed diagonally to posts. Sliding metal buckles for adjustment of height. Measures accumulated split times.

The min-max and the max-min k-split problem are defined similarly except that the objectives are to minimize the maximum subgraph, and to maximize the minimum subgraph respectively..