Optimal balanced measurement designs when errors are correlated

(1)

www.elsevier.com/locate/jspi

Optimal balanced measurement designs when errors are

correlated

C.T. Liao

a

_{, C.H. Taylor}

b

_{, H.K. Iyer}

c;∗ a_{Department of Agronomy, National Taiwan University, Taipei, Taiwan}

b_{Proctor and Gamble, Cincinnatti, OH, USA}

c_{Department of Statistics, Colorado State University, Ft. Collins, CO 80523, USA} Received 1 October 1995; received in revised form 10 April 1998; accepted 8 May 1999

Abstract

We consider situations in which some characteristic is to be measured on each of several specimens. For instance, the characteristic may be the concentration of lead in soil samples and a laboratory may routinely analyze samples from dierent customers. A measuring device is used to obtain the responses and the measurement errors generally exhibit serial correlation and=or drift. To help adjust the instrument readings for this serial correlation, standards are in-terspersed among the specimens at appropriate intervals. In this paper we present the results of an investigation of balanced measurement designs that minimize the uncertainty in the ad-justed measurements under an AR(1) error structure. Examples are given to illustrate the results.

c

Keywords: Calibration problems; AR(1) process; Systematic errors; Random errors; BLUE; A-optimality

1. Introduction

A measurement process is frequently subject to errors which are generally classied as systematic, or random, or a combination of both. The random errors are dened to have a zero expected value, and the systematic errors are dened to be due to biases in the measurement process. Thus, the measured (observed) value of an unknown specimen can be described by the following additive model:

yi= i+ i+ i (1.1)

where i; i and i denote the systematic error, the true value, and the random error, respectively, corresponding to the specimen being measured. If the errors can be esti-mated suciently precisely, then it may be possible to process the raw measurement

∗_{Corresponding author. Tel.: +1-970-491-6870; fax: +1-970-491-7895.}

E-mail address: [email protected] (H.K. Iyer)

(2)

to produce an estimate of the true value which is “better” than the raw observation itself.

Since “standards” have known true values, the errors associated with the measure-ment process are “observed” whenever a standard is measured. Typically, random errors are assumed to be independent, but it is often more realistic to acknowledge that the measurement process is serially correlated. For example, consider an automated system which measures the concentrations of a toxic material in water samples contained in each of several test tubes. These measurements are made sequentially in time. If the time period between these measurements is relatively small, then it is quite conceivable that a substantial correlation may exist between consecutive random errors. Suppose the correlation structure can be modeled suciently well so that it may be assumed known. Then a measurement experiment consisting of several observations of each of m dierent unknown specimens and of a known standard may utilize the “known” correlation structure to give better estimates of the unknowns. The following example is used to illustrate how this occurs.

Example 1.1. Consider the simple situation where there are only random errors, i.e., the systematic errors are assumed to be zero. Let y1 represent an observation of a standard and y2 represent an observation of an unknown, and 1 and 2 represent the true values of the standard and the unknown, respectively. The relationship between the observed values and the true values may be written as

y1 y2 = 1 2 + 1 2 :

Also suppose the random errors have the distribution 1 2 ∼ N 0 0 ; 1:0 0:9 0:9 1:0 :

Since 1 is observable, 2 may be estimated by E(2| 1) = 0:91:

Consequently, the unknown quantity may be estimated as ˆ2= y2− 0:91:

The variance of this estimate is Var( ˆ2) = 0:19

which is less than 1, the variance of y2.

The precision in estimating the true values of unknown specimens also depends on the arrangement (ordering) of the observations of the unknowns and the standard. The following example illustrates this phenomenon.

Example 1.2. Suppose a single measurement of one unknown is to be made along with two observations of the standard. The systematic errors are assumed to be zero,

(3)

and the random errors of observations 1, 2, and 3 are normally distributed with mean zero and the covariance matrix



1:00 0:50 0:25_{0:50 1:00 0:50} 0:25 0:50 1:00

  :

If the observations are made in the order USS where U denotes the unknown specimen and S denotes the standard, then the unknown quantity may be estimated as

ˆ1= y1− E(1| 2; 3) = y1− 0:52: The variance of this estimate is

Var( ˆ1) = 0:75:

If the observations are made in the order SUS, then the unknown quantity may be estimated as

ˆ2= y2− E(2| 1; 3) = y2− 0:41− 0:43: The variance of this estimate is

Var( ˆ2) = 0:60:

So the order SUS is preferable to the order USS.

Literature pertaining to calibration or measurement problems involving correlated errors appears to begin with Pepper (1973). He considered the following model for the measurement process:

yt= t+ (bt+ t)

where yt represents the measured value and t represents the corresponding true value, for t = 1; 2; : : : ; n. The quantity (bt+ t) represents random errors, where t are inde-pendent N(0; 2

) random variables, and bt arise from a random walk of the form bt= bt−1+ t

with t independent N(0; 2) random varibles which are also independent of t. Pepper (1973) provided three dierent measurement policies. For each of the policies, he gave approximate maximum likelihood estimators for the unknowns. In addition, he gave suggestions for ordering the observations based on enumeration of all possible designs in situations where the total number of observations is “small”.

By a suitable reparameterization, it is easy to see that the problem addressed in this paper is equivalent to the problem of comparing a set of treatments with a control using an unblocked experimental design, that is, using only one block of size equal to the total number of runs. Considerable attention has been devoted in the literature to the problem of comparing treatments with a control, but nearly all of the work is under the assumption of uncorrelated errors. Cutler (1993) does consider this problem under a circular AR(1) error structure as well as a linear AR(1) error structure and demonstrates the A-optimality of certain classes of binary block designs. Other authors have discussed

(4)

block designs with an AR(1) error structure but they concern themselves with the estimation of all pairwise treatment dierences rather than comparing treatments with a control. See, for instance, Berenblut and Webb (1974), Cheng (1983, 1988), Ipinyomi (1986), Kiefer (1960), Kiefer and Wynn (1981, 1983, 1984), Kunert (1985a,b, 1987), Martin (1982), Martin and Eccleston (1991), Russell and Eccleston (1987a, b), and Williams (1949, 1985, 1986). Another useful source of information on this topic is the review article by Hedayat et al. (1988). Optimal designs for the comparision of treatments to a control using a model with uncorrelated errors are treated by Hedayat et al. (1988), Pigeon and Raghavarao (1987) and Majumdar (1988). None of these articles directly address the problem under consideration here. Although Cutler (1993) considers the same error structure as we do, his focus is on binary block designs whereas we use single block designs that are nonbinary.

In the next section we describe our problem and point out its equivalence with the problem of comparing all treatments to a control. Section 3 provides optimal designs, within the class of “balanced” designs, when there are three or more unknowns to be measured. For situations where at most two unknowns are to be measured we can actually nd optimal designs in the class of all designs but we do not present the results here. Instead the reader is referred to Taylor (1989). Section 4 presents an algorithm to generate the optimal balanced designs. Proofs of the theroems in Section 3 are given in the appendix.

2. Preliminaries 2.1. The problem

The specic problem treated in this paper assumes that a total of N measurements can be made some of which may be measurements on a standard while the remaining measurements are of m dierent unknown specimens. The standard as well as each of the specimens may be measured more than once if so desired. The number of mea-surements of unknown specimen i is denoted by ti; i = 1; 2; : : : ; m, and the number of measurements of the standard is denoted by b − 1 (b ¿ 1). Furthermore these obser-vations are made sequentially in time. The statistical model assumes a constant but unknown systematic error, say . Let 0 be the true value of the standard (which is, of course, known), and ij be the indicator function dened by

ij=   

1 if j = 0 and observation i is of the standard;

1 if j ∈ {1; 2; : : : ; m} and observation i is of unknown j; 0 otherwise:

Then, the model described in (1.1) can be rewritten as yi= + i00+

m P

(5)

The random errors i are now assumed to arise from a rst-order stationary autoregres-sive process (AR(1)) of the form

i= i−1+ zi (2.2)

where zi are assumed to be uncorrelated random variables with zero means and equal variances 2_{. The coecient (−1 ¡ ¡ 1) is assumed known. We use a stationary} AR(1) error sequence rather than a nite AR(1) error sequence because of the inverse of the covariance matrix has a rather simple form in the former case. We are interested in estimating 1; : : : ; m most eciently.

Let us reparameterize the model as follows. Dene 0 = and i = + i for i = 1; : : : ; m. Also let zi= yi− i00. The model equation in (2.1) becomes

zi= m P

j=0ijj+ i; i = 1; : : : ; N (2.3)

which is a one-way classication model with m + 1 treatments with means 0; : : : ; m. The parameter 0 is for the control and the dierences j− 0 are precisely the j’s for j = 1; : : : ; m.

This paper is concerned with nding optimum designs for estimating the values of the unknown specimens within the class of “balanced” measurement designs (to be dened shortly). In particular, we consider the A-optimality criterion according to which the average variance of the best linear unbiased estimates (BLUE) of the true values of the unknown specimens is minimized. The following terminology is relevant to the rest of the paper.

2.2. The class of “balanced measurement designs”

A balanced measurement design is dened here as a design for which the BLUEs of the true values of the unknown specimens have the same variance and all pairs of the estimators have the same covariance. Such a design is often desirable since inferences may be made without regard to the labelling of the specimens.

2.2.1. Batches

Designs used in this paper will be described in terms of “batches”. A “batch” is dened to be the set of all measurements of unknown specimens between two succes-sive observations of the standard, or the set of all measurements (if any) of unknowns before the rst measurement of a standard, or the set of all measurements (if any) of unknown specimens after the last measurement of a standard. A batch of measure-ments is called an “interior batch” if the corresponding measuremeasure-ments are sandwiched between measurements of the standard. If the batch of measurements forms the initial segment of the measurement process it is called the “initial batch” and if it forms the nal segment of the measurement experiment, then it is called the “nal batch”.

Empty batches are permissible and these occur when the standard is measured at two consecutive time points with no unknown in between. The number of empty batches is

(6)

denoted by b0. As a consequence of these denitions, b − 1 observations of a standard give rise to b batches. The following example is given to illustrate the above denitions. Example 2.1. Consider the following sequence of the observations.

U1U3SU1U3U3U3U2U1U2U2SS

where Ui denotes the unknown specimen i and S denotes the standard. In this case, there are b=4 batches. Of these 4 batches b0=2 are empty. The initial batch consists of two measurements, the rst being of specimen 1 and the next of specimen 3. The rst interior batch consists of the eight measurements in the order U1U3U3U3U2U1U2U2. The second interior batch is empty since we have two successive measurements of the standard with no unknown specimen in between. The nal batch is also empty since no unknowns are measured after the last measurement of the standard. Note that specimen 1 is measured t1= 3 times, specimen 2 is measured t2= 3 times, and specimen 3 is measured t3=4 times. The total number of measurements is N =(b−1)+t1+t2+t3=13. 2.3. BLUEs for the unknown true values

Since 0 is known in the model described in (2.1), we let zi=

yi if observation i is of the unknown; yi− 0 if observation i is of the standard: Thus, we may write

zi= + m P

j=1ijj+ i (2.4)

or in matrix notation, this can be represented as z = X + where X is n × (1 + m) dened as Xi;j= 1 if j = 1; i;j if j ¿ 1; and = [; ]0 _{where = [}

1; 2; : : : ; m]. We write X = [X1; X2] where X1 is a column of 1’s and X2 has i;j as its (i; j)-element.

The random errors i are assumed to arise from an AR(1) process described in (2.2). Hence, the vector of these random errors = [1; 2; : : : ; N]0 has a distribution with a zero mean and a variance–covariance matrix 2_{V, where V is a positive denite matrix} with the elements

Vi;j=

|i−j|

(7)

It is well known (see Siddiqui (1958)) that V−1 _{is given by} (V−1₎ i;j=        1 if i = j = 1 or i = j = N; 1 + 2 _{if i = j and 1 ¡ j ¡ N;} − if |i − j| = 1; 0 otherwise: The BLUE of is ˆ = [ ˆ; ˆ]0_{= (X}0_V−1_X)−1_X0_V−1_z

and its variance is

Var( ˆ) = 2_(X0_V−1_X)−1_:

From this it is easy to see that the BLUE of is given by ˆ = C−1_X0 2[V−1− V−1X1(X10V−1X1)−1X10V−1]z and that Var( ˆ) = 2_C−1 where C = X0 2[V−1− V−1X1(X10V−1X1)−1X10V−1]X2: (2.6) The matrix C will be called the information matrix for .

At this point it is useful to recast the measurement design of Example 2.1 in terms of treatments and controls and point the similarities and dierences between the problem considered in this paper and the one addressed by Cutler (1993). Example 2.2 serves this purpose.

Example 2.2. Consider the design U1U3U0U1U3U3U3U2U1U2U2U0U0

which is obtained by replacing S with U0 in the measurement sequence of Example 2.1. Here U0 represents the control and U1; U2; U3 represent the treatments. If we are inter-ested in comparing each treatment with the control, then, by virtue of the equivalence between measurement designs and treatment-control designs, the information matrix for (1; 2; 3) given in (2.6) is identical to that of C∗ in Cutler (1993, p. 121) with b = 1 where b is the number of blocks in Cutler’s designs. Hence, in principle, the optimum design problem we consider is the same as that considered in Cutler (1993). However, as mentioned earlier, Cutler (1993) only considered binary block designs whereas we consider nonbinary designs with only one block. In particular, the design used as an illustration in this example is not in the class of designs considered by Cutler (1993). 2.4. The structure of the information matrix for a general measurement design

We rst introduce some notation and then describe the structure of the information matrix C for a general measurement design.

(8)

Notation.

ei= the number of times unknown i is the rst observation plus the number of times unknown i is the last observation, i.e., ei= 1;i+ N;i. Note that Pm_i ei= 0; 1, or 2.

qi= the number of times two consecutive observations are of the unknown i, i.e., qi=PN−1k=1 k;ik+1;i.

ri;j= the number of times unknowns i and j occur as neighboring pairs in the design. The following example is given to illustrate these denitions.

Example 2.3. Consider the same sequence of observations as in Example 2.1, U1U3SU1U3U3U3U2U1U2U2SS;

e1= 1 (the rst observation is of the unknown specimen U1), e2= 0 (neither the rst nor the last observation is of U2), e3= 0 (neither the rst nor the last observation is of U3), q1= 0 (no two consecutive observations are of U1), q2= 1 (due to observations (10 and 11)),

q3= 2 (due to the pairs of observations (5 and 6) and (6 and 7)), r1; 2= 2 (due to the pairs of observations (8 and 9) and (9 and 10)), r1; 3= 2 (due to the pairs of observations (1 and 2) and (4 and 5)), r2; 3= 1 (due to observations (7 and 8)).

The parameters ei; qi; rij describe the arrangement of observations on the unknowns and the standard and there are certain interrelationships among them which need to be recognized. These interrelationships are summarized in the following Lemma.

Lemma 2.1. Let t =_m1Pm i ti; e = 1 m m P i ei; q = 1 m m P i qi; r = 1 m 2 Pm i ¡ jri;j:

For any measurement design we must have N = mt + b − 1. Also; the following relationships among the parameters must hold:

m 2 r + mq = mt − (b − b0); (2.7) maximum{2 − me; b − mt}6b06b − 1; (2.8) maximum ( 0;m − (b − b_m 0) 2 ) 6r6mt − (b − b_m 0) 2 : (2.9)

The proof is based on simple combinatorial arguments and is omitted.

The following lemma describes the elements of the information matrix C corre-sponding to an arbitrary measurement design.

(9)

Lemma 2.2. Let D denote an arbitrary measurement design described by the param-eters ei; qi; ti; b0; b; m and N. Let C denote the information matrix for under the design D. Then the elements of C = (Cij); i; j = 1; 2; : : : ; m are given by

Ci;j= gi− (1 − )s2i=h if i = j; −ri;j − (1 − )sisj=h if i 6= j; (2.10) where gi= ti(1 + 2) − 2qi − ei2; si= ti(1 − ) + ei; h = 2 + _P_m i ti+ b − 3 (1 − ):

It is interesting to note that the information matrix in Lemma 2.2 has the same structure as the one given in Kunert and Martin (1987, p. 1606) except that the value of N in our problem is mt+b−1 and in their problem it is mt. However, the information matrix C of Lemma 2.2 is in general nonsingular whereas the matrix Cd of Kunert and Martin (1987) is always singular. There appears to be no useful connection between our problem and theirs which allows us to obtain optimum designs for our problem.

The following lemma describes conditions that must be satised by the design pa-rameters ei; qi; ti, and rij in order for a measurement design to be balanced. It also gives the information matrix corresponding to balanced designs.

Lemma 2.3. A measurement design is balanced for all values of (−1 ¡ ¡ 1) if and only if; for all i and j; ti= t; qi= q; ei= e; and ri;j = r. When this holds; the matrix C is completely symmetric and is equal to xIm+ yJm; where

x = t(1 − )2₊2(b − b0)

m  + mr − e2; (2.11)

y = −r −(1 − )(t(1 − ) + e)_{2 + (N − 2)(1 − )} 2; (2.12)

and the trace of C−1 _{can be explicitly expressed as}

Trace(C−1_{) =}m − 1

x +

1

x + my: (2.13)

The proof involves straightforward algebra and is omitted.

According to the above lemma, ei must all be equal to a common value e for a balanced design. Also, the sum of the ei’s cannot be greater than 2. Therefore, when m¿3, each ei must be zero, i.e., e must equal zero. Observe that the eigenvalues of the matrix C are x + my with multiplicity one and y with multiplicity m − 1. These may be obtained by substituting b = 1 in Cutler (1993) formulas for the eigenvalues of C∗ under a rectangular AR(1) error structure.

The following lemma gives a necessary and sucient condition for the existence of a balanced measurement design corresponding to specied values of m; b, and t.

(10)

Lemma 2.4. Suppose m¿3. A balanced measurement design exists corresponding to specied values of m; b; and t; if and only if there exist nonnegative integers q and r with 06q6t − 1 such that 2 + mt − b6mq + m(m − 1)r=26mt − 1.

The necessity follows from (2.7)–(2.9) using the fact that e=0 when m¿3, and the requirement for q; r; b0 to be nonnegative integers satisfying the conditions 06q6t −1 and 26b06b − 1. The suciency follows by noting that the algorithm for construct-ing balanced designs, given in Section 4, is applicable whenever the condition of Lemma 2.4 is met.

For given values of N and m it is often possible to enumerate, by the use of Lemma 2.4, the entire collection of balanced designs. This is illustrated in Examples 2.4 and 2.5.

Example 2.4. Suppose we wish to identify the collection of balanced designs when there are m = 20 unknown specimens to be measured and the total number of observa-tions that can be made using available resources is N = 60. By Lemma 2.4, we need to determine the collection of integer solutions to the inequalities

20t − b + 2620q + 190r620t − 1 (2.14)

satisfying 20t + b − 1 = 60 and nonnegativity restrictions. The value of t must be at least 1. If we take t = 1 then b must equal 41 and the inequality in (2.14) becomes

−19620q + 190r619

which has only one solution, namely q = 0 and r = 0. If we take t = 2 then b = 21 and the condition in (2.14) becomes

21620q + 190r639

which has no feasible solutions. No other solutions are feasible, so there is only one balanced design corresponding to N = 60 and m = 20. It has parameters b = 41; t = 1; q = 0; r = 0; e = 0, and b0= 21.

Example 2.5. Now consider the case N =60 and m=4. Using the conditions in Lemma 2.4 it is easily veried that there are 204 balanced designs in all. In particular, there are 26 designs corresponding to the value t = 10. We will return to this example later in Section 3.

In some problems it will be relatively straightforward to determine the optimum balanced design, given the value of the AR(1) parameter , simply by enumerating the entire collection of balanced designs corresponding to the given values of the design parameters, computing the trace criterion for each design in the collection, and determining the best among them. There is a serious disadvantage to this approach. It requires that the value of be specied in advance. If one is interested in examining optimum balanced designs for a range of values of then the numerical optimization must be carried out over a grid of values with respect to the enumerated collection

(11)

of balanced designs. Alternatively, Theorems 3.1 and 3.2 given in the next section may be used to prune the list of balanced designs to a shorter list of candidate designs for optimality. Details of this approach will be given later in this article.

Kiefer (1975) provided a methodology which can be used to nd optimal designs for various classes of problems. A key step in his approach depends on the convexity of the trace of the inverse of a positive-denite matrix. The following result, which is a consequence of a well known convexity property of positive-denite matrices (see Theorem 1:1:12 of Fedorov (1972)), supplies the condition needed to apply Kiefer’s method.

Lemma 2.5. For a given information matrix C described in (2:10); we have Trace( C)−1_{6 Trace(C}−1₎

where C = ( Ci;j) with Ci;j=        2 m(m − 1) P k¡ lCk;l for i 6= j; 1 m m P k=1Ck; k for i = j: (2.15)

Clearly, C is of the form xIm+ yJm where Im is an identity matrix, Jm is a m × m matrix with all elements equal to 1. By virtue of Lemma 2.5 it follows that a design that is optimum within the class of balanced designs is sometimes optimum within the class of all feasible designs. This is the essence of the following corollary.

Corollary 2.6. Suppose −1 ¡ ¡ 1 and positive integers N and m (N ¿ m) are given quantities. Let t; e; q; and r be nonnegative real numbers satisfying the conditions given in (2:7)–(2:9). Let t and e be nonnegative real numbers and −1661. Dene y∗ and x∗ _by y∗_{= −r −} m(1 − ) (m − 1)[N − (N − 2))]{t(1 − ) + e}2 +_{(m − 1)[N − (N − 2)]}(1 − )3 {t2₊2 t} +_{(m−1)[N −(N −2)]}2(1 − ) {e2₊2 e}+ 2(1 − ) 2 (m−1)[N −(N −2)]{et +te} and x∗_{= t(1 +}2_{) − 2q − e}2₋ (1 − ) [N −(N −2))]{(1 − )2(t2+ 2)+2(e2+2e) + 2(1 − )(et + te)} − y∗:

Over the space of allowable values of t; e; q; r; t; e, and for which the matrix x∗_I

(12)

y∗_J

m)−1 occurs at t = t0; e = e0; r = r0; q = q0; t= 0, and e= 0, where t0; e0; q0; r0 are integers. Then, a balanced measurement design with parameters N; m; t0; q0; r0; e0 is A-optimum over the class of all measurement designs with m unknowns and N observations.

The proof follows by expressing the entries of the matrix C in terms of the param-eters for a general measurement design with m unknowns and N observations. Details are omitted. Corollary 2.6 may also be used to obtain a lower bound for the trace criterion for an arbitrary measurement design by carrying out the minimization of the trace without requiring the design parameters to be integers. This lower bound can then be used to judge the eciency of an optimum balanced design. This is illustrated in Section 3.

3. A-optimum balanced designs for three or more unknowns

The determination of an A-optimum design in the class of all balanced designs is a discrete optimization problem with inequality constraints. Dierent optimum designs are obtained for dierent ranges of values of . In this section we derive the optimum balanced designs when the values of N; m; t, and are specied. The results can be used to develop a computer algorithm for nding the best value of t and the corresponding optimum design for specied N; m, and . The computer implementation is aided by observing that, for xed N; m, and , the function

T(t) = Min

q;r;b0 Trace(C −1₎

is a convex function of t.

We rst observe that an explicit expression for Trace(C−1_{) for balanced designs}

with m¿3 is given by Trace(C−1_{) =} m − 1 t(1 − )2₊2(b−b0) m + mr + 1 t(1 − )2₊2(b−b0) m − mt 2₍₁₋₎3 2+(N−2)(1−) : (3.1)

For xed values of m, and N, the right-hand side of (3.1) may be regarded as a function of b0, t, and r. We minimize this function with respect to b0 and r for each allowable value of t, thus nding optimal balanced designs for each allowable combination of values of b and t. The global optimum balanced design is then the best design among the set of optimum balanced designs obtained for the dierent admissible values of b and t. We state the following two results, one for the case −1 ¡ ¡ 0 and the other for the case 0 ¡ ¡ 1. Proofs are given in the appendix.

Theorem 3.1. Suppose m; the number of unknown specimens; is greater than or equal to 3 and −1 ¡ ¡ 0; where is the parameter in the AR(1) model for the errors.

(13)

Suppose each unknown specimen is to be measured t times and that the total number of measurements; including those on the standard; is N. The values of b0 and r corresponding to an optimal balanced design are as follows:

(1) If b − 2¿m and m is odd; then b0= b − m; r = 0 and q = t − 1.

(2) If b − 2¿m and m is an even number greater than 2t; then b0= b − m; r = 0; and q = t − 1.

(3) If b−2¿m and m is an even number less than or equal to 2t; then the balanced optimum design is obtained from one of the following two possibilities:

(3a) b0= b − m; r = 0; and q = t − 1; or (3b)b0= b − m=2; r = 1; and q = t −1₂m. The choice is to be made by numerically comparing the average variances corre-sponding to each of these cases and the result can depend on the value of .

(4) If m=26b − 2 ¡ m and m is an even integer less than or equal to 2t; then b0= b − m=2; r = 1; and q = t −1₂m.

(5) In all other cases balanced designs do not exist.

Theorem 3.2. Suppose m; the number of unknown specimens; is greater than or equal to 3 and 0 ¡ ¡ 1 where is the AR(1) parameter in the model for the errors. Suppose each unknown specimen is to be measured t times and that the total number of measurements; including those on the standard; is N. The values of b0; q and r corresponding to an optimal balanced design are as follows:

(1) If m is even and 1

26(b−2)=m6t; then the optimum balanced design is obtained from one of the following four possibilities. The four possible designs given below must be numerically compared to pick the best among them. The result will generally depend on the value of .

(1a) Suppose the interval given by maximum

0;2(m − b + 2)_{m(m − 1)}

6r62(mt − b + 2)_{m(m − 1)} (3.2)

contains at least one even nonnegative integer. Then a candidate for an optimum balanced design has b0=b−mx0 and r =r0 where r0 is the largest even integer in the interval in (3:2) and x0 is the largest positive integer less than or equal to (b − 2)=m. Note that this case can occur only when (b − 2)=m¿1.

(1b) Suppose the interval in (3:2) contains at least one nonnegative odd integer. Then a candidate for an optimum balanced design has b0=b−mx0 and r=r0 where r0 is the largest odd integer in the interval given by (3:2); and x0 is the largest positive odd multiple of 1=2 less than or equal to (b − 2)=m.

(1c) Suppose the interval given by 2(mt − b + 2)

m(m − 1) ¡ r6

2(mt − 1)

m(m − 1) (3.3)

contains at least one nonnegative even integer. Then a candidate for an optimum balanced design has b0= b − mx0 and r = r0 where r0 is the smallest even integer in the interval given by (3:3) and

(14)

(1d) Suppose the interval in (3:3) contains at least one odd integer. Then a can-didate optimum balanced design has b0= b − mx0 and r = r0 where r0 is the smallest odd integer in the interval in (3:3) and

x0= t − (m − 1)r0=2:

(2) If m is odd and 16(b − 2)=m6t. Then the optimum design is given by one of the following two possibilities. They must be numerically compared and the better one must be chosen. The result will generally depend on the value of .

(2a) Suppose the interval in (3:2) contains at least one nonnegative integer. Then a candidate optimum balanced design has b0= b − mx0 and r = r0 where r0 is the largest integer in the interval given by (3:2); and x0 is the largest integer less than or equal to (b − 2)=m.

(2b) Suppose the interval in (3:3) contains at least one nonnegative integer. Then a candidate optimum balanced design has b0= b − mx0 and r = r0 where r0 is the smallest integer in the interval given by (3:3) and x0= t − (m − 1)r0=2.

(3) (b − 2)=m ¿ t; then the optimum balanced design has b0= b − mt and r = 0. (4) In all remaining cases a balanced design does not exist.

Example 3.1. We will use Theorems 3.1 and 3.2 to make a complete determination of optimum balanced designs over the entire range −1 ¡ ¡ 1 of values of , for the case when N = 60; m = 4; t = 10, and b = 21. This is a subclass of designs considered in Example 2.4.

First consider the case ¡ 0. Observing that b−2=19 is greater than m=4 and that m is an even number less than 2t = 20, we decide we are in case (3) of Theorem 3.1. Therefore there are two candidates for an optimum balanced design, say D1 and D2, given by cases (3a) and (3b), respectively. For design D1 we must take b0=b−m=17 and r = 0. For design 2 we must choose b0= b − m=2 = 19 and r = 1. The complete set of design parameters for the two designs is listed below.

Design D1: N = 60; m = 4; b = 21; t = 10; q = 9; r = 0; b0= 17; Design D2: N = 60; m = 4; b = 21; t = 10; q = 8; r = 1; b0= 19:

Let tr1 and tr2 denote the trace criterion for D1 and D2, respectively. We have

tr1= 2(−75 + 190 − 186 2_{+ 70}3₎ (5 − 9 + 52_{) (−50 + 115 − 111}2_{+ 45}3₎ and tr2= 8(−75 + 190 − 186 2_{+ 70}3₎ 5(2 − 3 + 22_{) (−100 + 260 − 251}2_{+ 90}3₎:

These are rational functions of and it is easy to show that tr26tr1 for all −1 ¡ ¡ 0. So design D2 is optimum for all negative values of . Fig. 1 shows the graphs of tr1and tr2as functions of in the interval −1660. The two curves appear indistinguishable in the gure.

(15)

Fig. 1. Plot of the trace criterion for designs D1 (dotted curve) and D2 (solid curve).

Next, consider the case ¿ 0. Since m = 4 is even and (b − 2)=m = 4:75 is between 1=2 and t = 10, case (1) holds. We examine the four subcases for this situation. For (1a) we note that the interval for r in (3.2) is 06r621=6, so r0= 2 is the largest even integer in this interval. Also x0= 4. Hence, b0= b − mx0= 5 and q = 3. So case (1a) leads to the design D3 with the parameters listed below.

Design D3: N = 60; m = 4; b = 21; t = 10; q = 3; r = 2; b0= 5:

Under case (1b) we have r0= 3 and x0= 9=2. So case (1b) leads to the design D4 with the parameters listed below.

Design D4: N = 60; m = 4; b = 21; t = 10; q = 1; r = 3; b0= 3:

The interval for r in (3.3) is 21=6 ¡ r639=6, so we get the following candidate designs under case (1c) and case (1d):

Design D5: N = 60; m = 4; b = 21; t = 10; q = 0; r = 4; b0= 5; Design D6: N = 60; m = 4; b = 21; t = 10; q = 0; r = 5; b0= 11:

Denote the trace criterion for designs D3; D4; D5; D6 by tr3; tr4; tr5; tr6, respectively. We have tr3= 10(−15 + 14 − 14 2_{+ 14}3₎ (5 − 2 + 52_{)(−50 + 25 − 24}2_{+ 45}3₎; tr4= 8(−75 + 40 − 41 2_{+ 70}3₎ (10 + + 102_{)(−100 + 20 − 19}2_{+ 90}3₎; tr5= 2(−75 + 40 − 41 2_{+ 70}3₎ (5 + 2 + 52_{)(−50 + 25 − 24}2_{+ 45}3₎

(16)

Fig. 2. Plot of the trace criterion for designs D3 (dotted curve), D4 (dashed curve), D5 (solid curve) and D6 (dot-dash curve). and tr6= 8(−15 + 14 − 14 2_{+ 14}3₎ 5(2 + + 22_{)(−20 + 28 − 27}2_{+ 18}3₎:

Again, since these are rational functions of , it is easy to show that tr5 is smaller than tr3, tr4 and tr6 over the interval 0 ¡ ¡ 1. Therefore D5 is the optimum design for any positive value of . Fig. 2 shows the graphs of tr3; tr4, tr5 and tr6 as functions of in the interval 0661.

The use of Theorems 3.1 and 3.2 has enabled us to determine that design D2 is optimum in the class of balanced designs with m = 4; t = 10; b = 21 (there are 26 of these designs) for every value of between −1 and 0. Likewise, we have deter-mined that design D5 is optimum in this class for every value of between 0 and 1 (when = 0, every balanced design with these parameters leads to the same trace criterion).

It may be veried that, for N = 60; m = 4, and = 0:1, the trace of (x∗_{I + y}∗_J)−1

in Corollary 2:5 has a minimum value equal to 0.559 and the corresponding value for t is 9.86 (rounded to two decimals). So a lower bound for the trace criterion for an arbitrary measurement design is equal to 0.559 when N = 60; m = 4, and = 0:1. The optimum balanced design D5 above has a trace criterion equal to 0.5698 at = 0:1 which is very close to the lower bound.

(17)

4. An algorithm to construct balanced measurement designs

For given values of m, N, and t, we gave in the previous section the values of r and b0 which minimize Trace(C−1). The parameters values obtained from Theorems 3.1 and 3.2 automatically satisfy the necessary conditions for balance, stated in Lemma 2.5. It remains to show how to construct the desired balanced designs corresponding to m, t, r and b0.

For our construction, we will need to use a special class of latin squares which were considered by Kiefer and Wynn (1981) in their study of equineighbored designs. Also see Cheng (1983). For a given integer m, consider the m × m array A whose (j; l) element (16j; l6m) is the symbol Up where

p − 1 = _P_j i=1(−1) i_{(i − 1) +}Pl i=1(−1) i_{(i − 1)} mod m : (4.1)

It is easily veried that A has the following properties:

(1) A is a latin square. Each column or each row of A contains all m dierent symbols (which represent the m unknown specimens) exactly once.

(2) When m is an odd number, A is symmetric. When m is an even number, then A is symmetric with respect to the two leading diagonals, i.e., aj;l= al;j= am+1−j;m+1−l= am+1−l;m+1−j.

(3) For m odd, each pair of symbols appear as neighbors exactly once in the rows of the rst m × (m + 1)=2 subarray.

(4) For m even, each pair of symbols appear as neighbors exactly once in the rows of the rst (m=2) × m subarray and exactly twice in the rows of the entire array.

The following arrays are given to illustrate these results.

Example 4.1. For m = 5 and 6, the arrays obtained from (4.1) are, respectively, U5 U1 U4 U2 U3 U1 U2 U5 U3 U4 U4 U5 U3 U1 U2 U2 U3 U1 U4 U5 U3 U4 U2 U5 U1 U6 U1 U5 U2 U4 U3 U1 U2 U6 U3 U5 U4 U5 U6 U4 U1 U3 U2 U2 U3 U1 U4 U6 U5 U4 U5 U3 U6 U2 U1 U3 U4 U2 U5 U1 U6

Based on the arrays produced by (4.1), we present the following algorithm to con-struct balanced measurement designs. We consider three separate cases: (A) r = 0, (B) r ¿ 0 with (m − 1)r odd, and (C) r ¿ 0 with (m − 1)r is even.

The Algorithm

Case (A): Suppose r = 0. Then mq = mt − (b − b0) and k = (b − b0)=m must be an integer. In particular q = t − k. Let t = uk + v where u and v are integers and 06v ¡ k. Construct an array D (not necessarily a rectangular array), with mk rows such that, for each i with 16i6m, there are k − v rows consisting only of u occurrences of Ui and v rows consisting only of u + 1 occurrences of Ui. To this array, add an

(18)

initial column consisting of mk standards, and also add a single standard after the last element of the last row. Concatenate all the rows of this array to produce a single row consisting of m(k − v)u + mv(u + 1) + mk + 1 = N symbols. If b0= 2, then this is the required measurement design. If b0¿ 2 then add b0− 2 additional standards to the current design sequence by placing each additional standard next to an already occurring standard. This can usually be done in many dierent ways and each one will lead to a design satisfying the requirements.

Case (B): Suppose r ¿ 0 and (m−1)r is an even integer. Thus either m is odd or m and r are both even. We must have b − b0= km for some integer k. The following steps lead to a balanced measurement design meeting the requirements and with (nearly) equal batch sizes for the nonempty batches.

Step (1): For a given m, let A be the m × m array obtained from (4.1). If m is odd, then let m1= (m + 1)=2, otherwise m1= m=2. Let A1 be the rst m × m1 subarray of A and A2 be the m × m1 array which is the “re ection” of A1, i.e., A2 is obtained by permuting the columns of A1 according to the permutation

1 2 3 · · · m1− 1 m1

m1 m1− 1 m1− 2 · · · · 2 1

:

Now we dene an operation for merging two arrays by columns. Let B = B1...b and C = c...C1 be two arrays of sizes m0× l1 and m0× l2, respectively, and b and c are column vectors. Then the operation ◦ is dened by

B ◦ C =  

B_B1...b...C1 if b = c; 1...b...c...C1 if b 6= c:

Let the array D1= A1. If r ¿ 1, then for j¿2 recursively dene Dj= Dj−1◦ A1 if j is odd and Dj= Dj−1◦ A2 if j is even. By using Properties (3) and (4) of A described previously and mathematical induction, it is easy to verify that each pair of unknown specimens appears as neighbors exactly r times in the m rows of the array Dr. Furthermore, no unknown occurs next to itself in any of the m rows. Note also that Dr is an array of size m × n1, where n1= 1 + (m − 1)r=2.

Step (2): If q is equal to 0 then go to Step (3). Suppose q is greater than 0. Let c1; : : : ; cn1 be the n1 columns of Dr. Enlarge the current array Dr by repeating selected

columns of Dr next to themselves such that the number of columns in the enlarged array is q + n1= q + 1 + (m − 1)r=2. It makes no dierence as to which columns are selected to be repeated or how many times a given column is repeated as long as the total number of new columns added is q. At the end of this step each unknown specimen occurs in the array exactly n = q + 1 + (m − 1)r=2 times. This follows from the fact that each column of Dr contains each unknown exactly once.

Step (3): If k = 1 then go to Step (4). Suppose k ¿ 1. Let k0 = k − 1. Consider integers

(19)

For each j = 1; : : : ; k0, replace column cij in Dr by the block of three columns large

cij|s|cij where s is a column of S’s representing measurements of a standard. The

resulting modied array Dr will now have n+k −1=t columns consisting of unknown specimens and k − 1 columns consisting of standards only.

Step (4): Add a column s of standards as the rst row of Dr and also add a sin-gle standard to the right of the last element of the last row. The total number of standards in this modied (nonrectangular) array is mk + 1 = (b − 1) − (b0− 2). Con-catenate the rows of this array to produce a single row consisting of standards and unknowns. If b0= 2, then this row, when viewed as a measurement design, meets all the requirements. If b0¿ 2 then the number of standards occurring in the current array Dr is less than the required number by an amount equal to b0 − 2 as is the number of empty batches. This can be rectied by including b0− 2 additional stan-dards such that each of the added standard is placed adjacent to an already occurring standard. This also creates b0− 2 additional empty batches. This nal row consisting of unknowns and standards is then a balanced measurement design meeting all the requirements.

Note: The batch sizes in this design depend on how the columns i1¡ · · · ¡ ik0 are

chosen. Let t = (k − 1)u + v where u and v are nonnegative integers and |u − v| is as small as possible. For j=1; : : : ; k0, choose ij=ju−j+1. Then the resulting measurement design will have (k −1)m batches of size u and m batches of size v and thus will have nearly equal batch sizes. Other choices of i1; : : : ; ik0 are also possible.

Case (C): Now suppose r ¿ 0 and (m − 1)r is odd. In this case b − b0= km=2, where k is an odd positive integer. The following steps will lead to a (nearly) equal batch-size design.

Step (1): Let A be the m × m array obtained from (4.1). Let A1 be the m=2 × m array of A consisting of the rst m=2 × m subarray of A. Also let A2 be the m=2 × m array that is the “re ection” of A1 as described earlier.

Let the array D1=A1. If r ¿ 1, then for j¿2 dene Dj recursively by Dj=Dj−1◦A1 if j is odd, and Dj= Dj−1◦ A2 if j is even. Each pair of unknown specimens appears as neighbors exactly r times in the m=2 rows of the array Dr. Also Dr is an array of size m=2 × n1, where n1= m + (m − 1)(r − 1).

Step (2): If q equals zero, then go to Step (3). Suppose q ¿ 0. By using Properties (1) and (2) of the array A described previously, it is easily checked that for each i, i = 1; : : : ; n1=2, the pair of columns, column i and column n1+ 1 − i, together contains all m dierent unknown specimens exactly once. Now enlarge the current array Dr by repeating next to themselves selected pairs of columns, column i and column n1+1−i for various i, such that the total number of additional pairs of columns added is q. It makes no dierence which pairs of columns are selected to be repeated or how many times a given pair of columns is repeated so long as the total number of added pairs is q.

At this point the array Dr has m=2 rows and n = n1+ 2q columns. We also note that, for each i, the unknown Ui occurs next to itself exactly q times among its rows. Moreover, each unknown occurs in the array exactly q+n1=2=q+m=2+(m−1)(r−1)=2

(20)

times. Since t = q + n1=2 + (k − 1)=2 we need (k − 1)=2 additional observations on each unknown.

Step (3): If k = 1 then go to Step (4). Suppose k ¿ 1. The current array Dr is of size m=2 × n, where n = 2q + m + (m − 1)(r − 1). Each pair of columns, i and n + 1 − i, still contains all m dierent unknown specimens, for i =1; 2; : : : ; n=2. Let k0=(k −1)=2. Suppose

16i1¡ i2¡ · · · ¡ ik0−1¡ ik06n=2:

For each j = i1; : : : ; ik0, replace columns cj and cn+1−j by a block of 3 columns, cj|s|cj

and cn+1−j|s|cn+1−j, respectively. At the end of this step each unknown specimen occurs exactly t times in the rows of the modied array Dr.

Step (4): Add a column of S’s before the rst column of the current array Dr and a single S following the element in the last row and last column. This yields a (nonrectangular) array with m rows. Concatenate the rows of this array to produce a single row of standards and unknowns. This leads to a balanced measurement design meeting all requirements except that it has b0− 2 fewer measurements on standards than what is required and also has b0− 2 fewer empty batches. If b0= 2 then we are done. If b0¿ 2, then we introduce b0− 2 additional standards in the design sequence by placing them adjacent to already occurring standards. Many dierent choices will be available here but any one of them will work. This yields the nal design.

Note: The choice of the columns i1¡ i2¡ · · · ¡ ik0 in Step (3) determines the batch

sizes in the nal design. Let 2t = (k − 1)u + v where u and v are nonnegative integers and |u−v| is as small as possible. Choose ij=ju−(j −1) for j =1; : : : ; k0. This choice leads to (k − 1)m=2 batches of size u and m=2 batches of size v, thus producing nearly equal batch sizes. Other choices of the columns are also possible leading to a dierent denition of batches.

The following three examples are given to illustrate the algorithm.

Example 4.2. Suppose m = 5, r = 0, q = 3, t = 5, b = 12, and b0= 2. A balanced measurement design with these parameters can be constructed according to Case (A) of the algorithm above. Note that the value of k here is 2 and also that t = 5 = uk + v with u = 2 and v = 1. Hence construct the following array D rst.

D = U1 U1 U1 U1 U1 U2 U2 U2 U2 U2 U3 U3 U3 U3 U3 U4 U4 U4 U4 U3 U5 U5 U5 U5 U3

(21)

After adding the standards we get D = S U1 U1 S U1 U1 U1 S U2 U2 S U2 U2 U2 S U3 U3 S U3 U3 U3 S U4 U4 S U4 U4 U3 S U5 U5 S U5 U5 U3 S

Concatenating the rows of the above array gives us the required measurement design, viz.,

S U1 U1 S U1 U1 U1 S U2 U2 S U2 U2 U2 S U3 U3 S U3 U3 U3 S U4 U4 S U4 U4 U4 S U5 U5 S U5 U5 U5 S

Example 4.3. Suppose m = 5, r = 3, q = 1, t = 9, b = 14 and b0= 4. A balanced measurement design satisfying these parameters can be obtained from Case (B) of the algorithm described above.

Step (1): From Example 4.1,

A1...A2= U5 U1 U4 ... U4 U1 U5 U1 U2 U5 ... U5 U2 U1 U4 U5 U3 ... U3 U5 U4 U2 U3 U1 ... U1 U3 U2 U3 U4 U2 ... U2 U4 U3 Since r = 3, Dr= A1◦ A2◦ A1= U5 U1 U4 U1 U5 U1 U4 U1 U2 U5 U2 U1 U2 U5 U4 U5 U3 U5 U4 U5 U3 U2 U3 U1 U3 U2 U3 U1 U3 U4 U2 U4 U3 U4 U2

Step (2): Since q = 1, repeating the last column of the current Dr gives

Dr= U5 U1 U4 U1 U5 U1 U4 U4 U1 U2 U5 U2 U1 U2 U5 U5 U4 U5 U3 U5 U4 U5 U3 U3 U2 U3 U1 U3 U2 U3 U1 U1 U3 U4 U2 U4 U3 U4 U2 U2

(22)

Step (3): b − b0= 10 = km, with k = 2. Since t = 9, we have u = 5 and v = 4. The 5th column U5 U1 U4 U2 U3

is replaced by the block of three columns U5 S U5

U1 S U1 U4 S U4 U2 S U2 U3 S U3

and we obtain the array

U5 U1 U4 U1 U5 S U5 U1 U4 U4 U1 U2 U5 U2 U1 S U1 U2 U5 U5 U4 U5 U3 U5 U4 S U4 U5 U3 U3 U2 U3 U1 U3 U2 S U2 U3 U1 U1 U3 U4 U2 U4 U3 S U3 U4 U2 U2

Step (4): Add a column of S’s before the rst column and a single S at the end of the last row and get the following nonrectangular array.

S U5 U1 U4 U1 U5 S U5 U1 U4 U4 S U1 U2 U5 U2 U1 S U1 U2 U5 U5 S U4 U5 U3 U5 U4 S U4 U5 U3 U3 S U2 U3 U1 U3 U2 S U2 U3 U1 U1 S U3 U4 U2 U4 U3 S U3 U4 U2 U2 S

Concatenating the rows of this array produces the following design:

S U5 U1 U4 U1 U5 S U5 U1 U4 U4 S U1 U2 U5 U2 U1 S U1 U2 U5 U5 S U4 U5 U3 U5 U4 S U4 U5 U3 U3 S U2 U3 U1 U3 U2 S U2 U3 U1 U1 S U3 U4 U2 U4 U3 S U3 U4 U2 U2 S

Since b0 = 4, b0 − 2 = 2, and we add two additional standards each adjacent to a previously occurring standard. The nal design is given below.

S U5 U1 U4 U1 U5 S U5 U1 U4 U4 S S U1 U2 U5 U2 U1 S U1 U2 U5 U5 S U4 U5 U3 U5 U4 S U4 U5 U3 U3 S S U2 U3 U1 U3 U2 S U2 U3 U1 U1 S U3 U4 U2 U4 U3 S U3 U4 U2 U2 S

Example 4.4. Suppose m = 6; r = 3; q = 0; t = 10; b = 17 and b0= 2. A balanced measurement design satisfying these parameters can be obtained from Case (C) of the algorithm as follows.

(23)

Step (1): From Example 4.1, A1...A2= U6 U1 U5 U2 U4 U3 ... U3 U4 U2 U5 U1 U6 U1 U2 U6 U3 U5 U4 ... U4 U5 U3 U6 U2 U1 U5 U6 U4 U1 U3 U2 ... U2 U3 U1 U4 U6 U5 Since r = 3, Dr= A1◦ A2◦ A1= U6 U1 U5 U2 U4 U3 U4 U2 U5 U1 U6 U1 U5 U2 U4 U3 U1 U2 U6 U3 U5 U4 U5 U3 U6 U2 U1 U2 U6 U3 U5 U4 U5 U6 U4 U1 U3 U2 U3 U1 U4 U6 U5 U6 U4 U1 U3 U2 Step (2): q = 0. Keep the current array. We have n1= 16.

Step (3): b − b0= 15 = km=2, with k = 5. Since t = 10, we get 2t = (k − 1)4 + 4 so u = 4 and v = 4. Each row of Dr is split at the 4th, 7th, 10th and 13th elements and this gives

U6 U1 U5 U2 S U2 U4 U3 U4 S U4 U2 U5 U1 S U1 U6 U1 U5 S U5 U2 U4 U3 U1 U2 U6 U3 S U3 U5 U4 U5 S U5 U3 U6 U2 S U2 U1 U2 U6 S U6 U3 U5 U4 U5 U6 U4 U1 S U1 U3 U2 U3 S U3 U1 U4 U6 S U6 U5 U6 U4 S U4 U1 U3 U2

Step (4): Add a column of S’s before the rst column and a single S at the end of the last row to get the following nonrectangular array:

S U6 U1 U5 U2 S U2 U4 U3 U4 S U4 U2 U5 U1 S U1 U6 U1 U5 S U5 U2 U4 U3 S U1 U2 U6 U3 S U3 U5 U4 U5 S U5 U3 U6 U2 S U2 U1 U2 U6 S U6 U3 U5 U4 S U5 U6 U4 U1 S U1 U3 U2 U3 S U3 U1 U4 U6 S U6 U5 U6 U4 S U4 U1 U3 U2 S

Concatenating the rows of the above array we get the design sequence shown below. S U6 U1 U5 U2 S U2 U4 U3 U4 S U4 U2 U5 U1 S U1 U6 U1 U5

S U5 U2 U4 U3 S U1 U2 U6 U3 S U3 U5 U4 U5 S U5 U3 U6 U2 S U2 U1 U2 U6 S U6 U3 U5 U4 S U5 U6 U4 U1 S U1 U3 U2 U3 S U3 U1 U4 U6 S U6 U5 U6 U4 S U4 U1 U3 U2 S

Since b0= 2, no more standards need to be added. 5. Concluding remarks

In this paper we have carried out a detailed analysis of exact optimum balanced mea-surement designs for three or more unknowns, assuming an AR(1) error structure. For the case of less than three unknowns the reader is referred to Taylor (1989). Although the measurement design problem presented here is equivalent to the treatment-control design problem considered by others, the optimum designs derived in this paper are not obtainable from existing results.

(24)

It is customary to prove results that guarantee that certain families of designs ex-hibiting some balance properties are optimal among all competing designs. However, designs exhibiting such balance may not always exist for user specied values of N, the total number of runs, and m, the number of unknowns. Therefore, we have focused on the actual construction of balanced optimum measurement designs with the user having complete freedom to choose the values for m and N. Special cases of optimal balanced designs may turn out to be optimum among all possible designs, but we have not pursued this problem here. We do, however, point out how lower bounds for the trace criterion may be obtained by using the well known symmetrization argument. The calculations required for obtaining optimum balanced designs and lower bounds for the trace criterion are easily implemented on a computer.

In principle the approach can be extended to higher-order autoregressive error struc-tures but an analytical treatment of exact optimum designs seems unfeasible. In these cases it may be necessary to consider approximately optimum designs. In practical applications low order AR(1) error models are often useful as approximations to the actual serial correlation structure. Thus the designs presented in this paper can provide useful information in these cases as well.

Appendix

Proof of Theorem 3.1. Suppose −1 ¡ ¡ 0. The equality in (2.7) implies that b − b0= mx for some x where x must be a positive integer when (m − 1)r is even and a positive odd multiple of 1=2 when (m − 1)r is odd. We substitute b − b0= mx in the expression for the trace in (3.1) and

Trace(C−1_{) =} m − 1 t(1 − )2₊2x m + mr + 1 t(1 − )2₊2x m − mt 2₍₁₋₎3 2+(N−2)(1−) : (A.1) We minimize this expression with respect to allowable values of x0 and r for each xed t. We consider several cases.

(1) Suppose m is odd and b − 2¿m. For a xed r, (A.1) is minimized by a small x. Due to the constraints given in (2.7)–(2.9) and since (m − 1)r is even in this case for every allowable r; x must be a positive integer in the interval

max 1 m; 1 − (m − 1)r 2 6x6min t −(m − 1)r₂ ;b − 2_m :

Hence x must be chosen to be 1. We substitute x = 1 in the expression for the trace in (A.1) and minimize the resulting expression with respect to r. Again, since is negative, we must choose the smallest allowable value for r. Since r must belong to the interval given by

max

0;2(m − b + 2)_{m(m − 1)}

6r62(mt − 1)_{m(m − 1)}

and since the lower bound for r in this case is zero, we take r =0. Therefore, we have r = 0 and b0= b − m.

(25)

(2) Suppose m is an even number greater than 2t and b − 2¿m. Here we consider two subcases – r odd and r even.

If r is odd, then (m − 1)r is odd, so x must be an odd positive multiple of 1=2. However, since m is greater than 2t, it can be easily veried that there are no such values for x and hence a balanced design does not exist in this case.

If r is even, then (m − 1)r is even, so x must be a positive integer. As in case (1) x must be chosen to be 1. Substituting x = 1 in (A.1) and minimizing with respect to r yields r = 0. So the solution in this case is given by r = 0 and b0= b − m.

(3) Suppose m is an even number less than or equal to 2t and b − 2¿m. Again we consider two subcases – r odd and r even.

If r is odd, then (m − 1)r is odd, so x must be an odd positive multiple of 1=2. The smallest allowable value for x in this case is x = 1=2. Substituting x = 1=2 in (A.1) and minimizing with respect to r gives r = 1. Therefore the solution in this subcase is r = 1 and b0= b − m=2.

If r is even, then (m − 1)r is even, so x must be a positive integer. As in case (2) x must be chosen to be 1. Substituting x = 1 in (A.1) and minimizing with respect to r yields r = 0. So the solution in this subcase is given by r = 0 and b0= b − m.

Putting the two subcases together we nd that the optimum design is the better of the two designs obtained in the two subcases.

(4) Here we have 1=26(b − 2)=m ¡ 1 and m is an even integer less than or equal to 2t. If r is an even integer we nd that the set of allowable values for x is empty. So r must be chosen to be odd. For a xed odd positive r the trace is minimized by taking x = 1=2. Substituting this in (A.1) and minimizing with respect to r gives r = 1. The solution in this cases is then r = 1 and b0= b − m=2.

(5) Clearly no balanced design exists in the remaining cases because the set of allowable values for (x; r) is empty for these cases.

Hence we have proved Theorem 3.1.

Proof of Theorem 3.2. As in the case of Theorem 3.1, we have b − b0= mx where x is an integer when (m − 1)r is even and it is an odd positive multiple of 1=2 when (m − 1)r is odd. Substituting mx for b − b0 in (3.1) gives us the expression in (A.1). We now consider each of the cases.

(1) Here we have m even and 1=26(b − 2)=m6t. We consider four subcases. Suppose r is an odd integer in the interval

max

0;2(m − b + 2)_{m(m − 1)}

6r62(mt − b + 2)_{m(m − 1)} : (A.2)

Then x must be an odd positive multiple of 1=2. The expression in (A.1) is minimized with respect to x by choosing x to be as large as possible within its range of allowable values. It is easy to verify that x cannot exceed (b−2)=m under this case, so we choose x to be the largest odd positive multiple of 1=2 that is less than or equal to (b − 2)=m. Call this value x0. We substitute x = x0 in (A.1) and minimize the resulting expression with respect to r. Since is positive, we must pick r to be as large as possible.

(26)

So r is chosen to be equal to r0, the largest odd positive integer in the interval in (A.2).

Suppose r is an even integer in the interval in (A.2). Then x must be a positive integer. The expression in (A.1) is minimized with respect to x by choosing x to be as large as possible within its range of allowable values. Under this subcase, we must then pick x to be the largest integer, say x0, less than or equal to (b − 2)=m. We substitute x = x0 in (A.1) and minimize the resulting expression with respect to r. Again, we must pick r as large as possible. So we chose r = r0, the largest even nonnegative integer in the interval in (A.2).

Suppose r is an odd integer in the interval 2(mt − b + 2)

m(m − 1) ¡ r6

2(mt − 1)

m(m − 1): (A.3)

Then x is an odd positive multiple of 1=2 within its range of allowable values. It is easily veried that we must chose x to be equal to x0 where x0 is given by x0= t − (m − 1)r=2. We then substitute x = x0 in (A.1) and minimize the resulting expression with respect to r. By examining the rst derivative of this expression with respect to r it can be veried that it is increasing for values of r in the interval in (A.3). So we choose r to be the smallest odd integer, say r0, in the interval given in (A.3). This leads to a design for which r = r0 and b0= b − mt + m(m − 1)r0=2.

Suppose r is an even integer in the interval in (A.3). Then x must be a positive integer within its range of allowable values. Again this leads to the choice x=x0 where x0 is t − (m − 1)r=2. Substituting this value for x in (A.1) and minimizing with respect to r leads to the choice r = r0 where r0 is the smallest even integer in the interval in (A.3).

By examining the best designs obtained for the above four subcases we choose the overall best design when m is an even integer with 1=26(b − 2)=m6t. This proves part (1) of Theorem 3.2.

Suppose m is odd and 16(b − 2)=m6t. Here we consider two subcases. For values of r in the interval in (A.2) x must be the largest positive integer, say x0, less than or equal to (b − 2)=m. Then r must be the largest integer, say r0, in the interval in (A.2). For values of r in the interval in (A.3) x must be equal to x0 were x0 is the value t −(m−1)r=2. Then r must be the smallest integer in the interval in (A.3) because the trace can be shown to be a nondecreasing function of r for values of r in the interval in (A.3).

Putting the two subcases together we obtain the result in part (2) of Theorem 3.2. For case (3) of Theorem 3.2, we have (b − 2)=m ¿ t. Here x must be chosen as x0= t − (m − 1)r=2. We substitute this expression for x in (A.1) and note that the resulting expression is a nondecreasing function of r for all allowable values of r. So we pick the smallest possible value of r, i.e., we choose r = 0. Hence x = t and b0= b − mt.

It is easy to verify that a balanced design does not exist in the remaining cases. This concludes the proof of Theorem 3.2.

(27)

References

Berenblut, R.E., Webb, G.I., 1974. Experimental design in the presence of autocorrelated errors. Biometrika 61, 427.

Cheng, C.S., 1983. Construction of optimal balanced incomplete block designs for correlated observations. Ann. Statist. 11, 204–209.

Cheng, C.S., 1988. A note on the optimality of semibalanced arrays. In: Dodge, Y., Fedorov, V.V., Wynn, H.P. (Eds.), Optimal Design and Analysis of Experiments. North-Holland, Amsterdam, pp. 115–122. Cutler, D.R., 1993. Ecient block designs for comparing test treatments to a control when the errors are

correlated. J. Statist. Plann. Inference 36, 107–125.

Fedorov, V.V., 1972. Theory of Optimal Experiments. Academic Press, New York.

Hedayat, A.S., Jacroux, M., Majumdar, D., 1988. Optimal designs for comparing test treatments with controls. Statist. Sci. 3, 462–476.

Ipinyomi, R.A., 1986. Equineighboured experimental designs. Aust. J. Statist. 28, 79–88. Kiefer, J., 1960. Optimum experimental designs. J. Roy. Statist. Soc. Ser. B 21, 272–319.

Kiefer, J., 1975. Construction and optimality of generalized youden designs. In: Srivastava, J.N. (Ed.), A Survey of Statistical Design and Linear Models, pp. 333–353.

Kiefer, J., Wynn, H.P., 1981. Optimal balanced block and latin square designs for correlated observations. Ann. Statist. 9, 737–757.

Kiefer, J., Wynn, H.P., 1983. Autocorrelation-robust design of experiments. In: Leonard, T., Wu, C.F. (Eds.), Scientic Inference, Data Analysis and Robustness, p. 279.

Kiefer, J., Wynn, H.P., 1984. Optimum and minimax exact treatment designs for one dimensionl autoregressive error processes. Ann. Statist. 12, 414–450.

Kunert, J., 1985a. Optimal experimental design when the errors are assumed to be correlated. Statist. Decisions. Suppl. 2, 287–298.

Kunert, J., 1985b. Optimal repeated meaurements designs for correlated observations and analysis by weighted least squares. Biometrika 72, 375–389.

Kunert, J., 1987. Neighbour balanced block designs for correlated errors. Biometrika 74, 717–724. Kunert, J., Martin, R.J., 1987. On the optimality of nite Williams II(a) designs. Ann. Statist. 15, 1604–1628. Martin, R.J., 1982. Some aspects of experimental design and analysis when errors are correlated. Biometrika

69, 597–612.

Martin, R.J., Eccleston, J.A., 1991. Optimal incomplete block designs for general dependence structures. J. Statist. Plann. Inference 28, 67–81.

Majumdar, D., 1988. Optimal repeated meaurements designs for comparing test treatments with control. Commun. Statist. — Theory Meth. 17, 3687–3703.

Pepper, M.P.G., 1973. A calibration of instruments with non-random errors. Technometrics 15, 587–599. Pigeon, G., Raghavarao, D., 1987. Crossover designs for comparing treatments with a control. Biometrika

74, 321–328.

Russell, K.G., Eccleston, J.A., 1987a. The construction of optimal balanced incomplete block designs when adjacent observations are correlated. Aust. J. Statist. 29, 84–90.

Russell, K.G., Eccleston, J.A., 1987b. The construction of optimal balanced incomplete block designs when observations within a block are correlated. Aust. J. Statist. 29, 293–302.

Siddiqui, M.M., 1958. On the inversion of the sample covariance matrix in a stationary autogressive process. Ann. Math. Statist. 29, 585–588.

Taylor, C.H., 1989. Optimal measurement designs when errors are correlated. Ph.D. Dissertation, Colorado State University.

Williams, E.J., 1949. Experimental designs balanced for the estimation of residual eects and treatments. Aust. J. Sci. Res. 2, 149–168.

Williams, E.R., 1985. A criterion for the construction of optimal neighbour designs. J. Roy. Statist. Soc. Ser. B 47, 489–497.