• 沒有找到結果。

Indices for Multivariate Processes

In this subsection, consider a multivariate process with k quality characteristics. Sup-pose X1, . . . , Xn are ni i.i.d. k × 1 random vectors of observations. ¯X is a k × 1 vector representing the sample mean of X1, . . . , Xn.

Assuming the process data X follows a multivariate normal distribution with mean µ and variance-covariance matrix Σ, denoted by X ∼ Nk(µ, Σ). Chan et al. [6] proposed an index for measuring how far the process mean µ is from the target value T as

Cepm =

µ k

E[(X − T )TΣ−1(X − T )]

1/2 .

Pearn et al. [16] introduced two multivariate PCIs, which are viewed as more nat-ural generalizations of Cpm than the one proposed by Chan et al. [6]. They defined a multivariate Cp index as

kCp2 = c2 χ2k,0.0027,

where χ2k,α is the upper α quantile of a chi-square distribution with degrees of freedom k, and c is a constant satisfying P {(X − T )TΣ−1(X − T ) ≤ c2} = 0.9973. Analogously, they defined a multivariate Cpm index by

kCpm2 = kCp2

1 + (µ − T )TΣ−1(µ − T )/k. (1) Hubele et al. [13] proposed an index vector (CP M, P V, LI)T for bivariate normal pro-cesses. The first component

CP M = area of specification

area of modified process region =

The modified process region is the smallest rectangle that can circumscribe 100(1−α)% of the process distribution (see Fig 1). The edges of the modified process region are defined as the lower and upper process limits, LP Li and UP Li, i = 1, 2. These four values can be obtained by solving the system of equations of first derivatives with respect to each xi

of

(X − µ)TΣ−1(X − µ) = χ22, α, where X = (X1, X2)T and µ = (µ1, µ2)T.

The solutions are

UP L1 = µ1+ s

χ22,αdet(Σ−11 )

det(Σ−1) , LP L1 = µ1 s

χ22,αdet(Σ−11 ) det(Σ−1) , UP L2 = µ2+

s

χ22,αdet(Σ−12 )

det(Σ−1) , LP L2 = µ2 s

χ22,αdet(Σ−12 ) det(Σ−1) ,

where Σ−1i , i = 1, 2, is the matrix obtained from Σ−1 by deleting the ith row and column.

The meaning of this component is analogous to that of Cp, measuring the variation of product characteristics relative to the specifications.

Figure 1: Explaining diagram of CP M

The second component is the p-value of testing the difference between the center of specification (target value T ) and the process mean. Let the null hypothesis H0 : µ = T , the Hotelling T2 statistic [11] is

T2 = n( ¯X − T )TΣˆ−1( ¯X − T ),

where ¯X is the sample mean and ˆΣ is the usual sample variance-covariance matrix of process data. Since n − 2

2(n − 1)T2 follows F2,n−2 distribution under null hypothesis, the p-value-based component P V is defined as

P V = P

³

T2 2(n − 1)

n − 2 F2,n−2,α

´ ,

where F2,n−2,α stands for the 100(1 − α)% percentile of F distribution with degrees of freedom 2 and n − 2. This component measures the distance of the process mean and the

target value. If the process mean is close to the target value, P V will be close to 1.

The third component LI provides the information about the location of the modified process region relative to the specification, defined as

LI = max

³

1, |U P LU SL11−U SL−LSL11|, |LP LU SL11−LSL−LSL11|, |U P LU SL22−U SL−LSL22|, |LP LU SL22−LSL−LSL22|

´ .

If this component is equal to 1, then the entire modified process region falls within or on the specification. If the component is greater than 1, then some or all modified process region falls out of the specification.

This index vector contains three components summarizing the size and location of process contour related to the specification.

Taam et al. [18] proposed an index as the ratio of two areas

Cfp = Area(R1)

Area(R2) = Area(modified specification)

Area(99.73% process region), (2)

Figure 2: Explaining diagram of fCp

where R1 is a modified specification, which is the largest ellipsoid that is centered at the target value and completely within the original specification, R2 is an elliptical region containing 99.73% of the bivariate normal distribution. This index is an extension of the univariate Cp for bivariate processes. Considering the shift of process mean from the

target value T , Taam et al. [18] further modified this index by taking an adjustment factor D into account and defined a Cpm index for two quality characteristics as follows:

MCpm = Cfp

0 < D−1 < 1 measures the closeness between the process mean and the target value. A larger value of 0 < D−1 < 1 implies that the process mean is closer to the target value.

Chen [7] proposed an index using the concept of a specification zone expressed as V (r0) = {x ∈ Rk: h(x − µ0) ≤ r0}, (3) where h(·) is a nonnegative homogeneous scalar function satisfying the condition h(tx) = th(x) for all t > 0 and r0 is a positive number. A process is considered capable if P (X ∈ V (r0)) ≥ 1 − α, where α is the allowable expected proportion of non-conforming products. Let r = min{c : P (X ∈ V (c)) ≥ 1 − α}. Then a process is considered to be capable if and only if r ≤ r0. This leads one to express an index for multivariate process in the form

MCp = r0 r .

According to Chen [7], this definition provides the following advantages: (i) allowing flexible specifications as general as given by V (r0) in (3), (ii) assuming no conditions on the underlying distribution, and (iii) permiting flexibility in setting a criterion for the capability of a process. For example, consider a rectangular specification zone

W = {x ∈ Rk: |xi− µi| ≤ ri, i = 1, . . . , k},

where µ is the process mean and ri’s are positive constants. One can derive an alternative definition of MCp as

MCp = 1 r, where r is a constant satisfying P

³

max{|Xi− µi|/ri, i = 1, . . . , k} ≤ r

´

= 1 − α. If MCp ≥ 1, the process is capable at 100(1 − α)% confidence level.

Pal [15] proposed an index defined as follows:

CP B = SR

Ap = (USL1− LSL1)(USL2− LSL2) πχ22,0.0027p

σ12σ22− σ212 ,

where SR represents the area of the specification rectangle and Ap represents the 99.73%

area of the process region. This index is, in fact, an extension of the index (2) proposed by Taam et al. [18]. It is an area ratio of a rectangular region over an elliptical region while Taam et al. [18] used an elliptical region over another elliptical region as the area ratio.

Bothe [2] proposed a multivariate Cpk index defined as MCpk = ZPT

3 ,

where ZPT is the PTth percentile of the standard normal distribution, and PT is defined as

PT = 1 − ((1 − PQC1)(1 − PQC2) · · · (1 − PQCk))k1

with PQCi, i = 1, . . . , k, being the non-conforming rate of the ith quality characteristic.

However, this index is designed only for independent process characteristics.

Wang and Du [19] proposed a method using principal component (PC) analysis to describe the performance of a process of multiple characteristics. In that paper, the pro-cedures of obtaining the indices for normal data as well as non-normal data are described in the following:

and the elements in the above expressions are given in the following.

Suppose S is a non-singular k × k sample variance-covariance matrix. LSL and U SL are k × 1 vectors of lower and upper specification limits, respectively. Using spectral de-composition, we can obtain a matrix D = UTSU , where D = diag(λ21, λ22, . . . , λ2k) with λ21

λ22 ≥ · · · ≥ λ2k being the eigenvalues of S, and the columns of U , u1, u2, . . . , uk, are the associated eigenvectors. As a result,

SP Ci = λi, ¯XP Ci = uTiX,¯

USLP Ci = uTi U SL, LSLP Ci = uTi LSL, i = 1, . . . , k, d = 1k¯ 1

n Pk i=1

Pn j=1

¯¯

¯uiTXj USLP Ci+ LSLP Ci 2

¯¯

¯ .

Here we remark that the numerators of dMCp and dMCpk seem somewhat unreasonable, since the vectors U SL and LSL no longer represent upper or lower bounds of the spec-ification region in the directions of principal components. As a result, USLP Ci− LSLP Ci

sometimes may even become negative.

Wang et al. [20] compared three process capability indices: (CP M, P V, LI)T pro-posed by Hubele et al. [13], MCpm proposed by Taam et al. [18], and MCp proposed by Chen [7]. They summarized that, in general, the multivariate indices could be obtained from (i) the area ratio of a specification region to a process region, (ii) the probability of a non-conforming product, and (iii) other approaches using loss functions or vector representation. The purpose of Wang et al. [20] is to illustrate the distinctions among the various meanings of capability in the multivariate case.

The purpose of this paper is to study yield related PCIs for multivariate processes.

As mentioned in Section 1, BCpk and BCp proposed by Castagliola and Castellanos [4]

are such indices. We shall give a more detailed review on these indices and then present how we would extend BCpk to higher dimensions and how to modify BCp to become scale-invariant in the later sections. And the last but not the least, we will provide methodologies on how to compute these indices.

3 Multivariate C

pk

Index

3.1 Yield Measuring Index for Processes with Multiple Charac-teristics

In this subsection, we first introduce the bivariate Cpk index, BCpk, proposed by Castagliola and Castellanos [4]. Then we provide the link between BCpk and yield. More-over, we extend this index to higher dimensions.

3.1.1 Alternative Definition of Cpk

Assume that the quality characteristic X of a product item is a N(µ, σ2) random vari-able. Let [LSL, USL] be the corresponding lower and upper specification limits. Equiv-alent to the definition of Kane [14], an alternative definition for Cpk was proposed by Castagliola and Castellanos [4]. This definition is based on the lower and upper propor-tions of non-conforming products pL = P (X ≤ LSL) and pU = P (X ≥ USL). Since X

∼ N(µ, σ2), pL = Φ(LSL−µσ ) and pU = Φ(−U SL+µσ ), where Φ is the cumulative distribution function (c.d.f.) of the standard normal distribution. Moreover, since the cumulative distribution function Φ is a strictly increasing function of the random variable, Cpk is equivalent to

1

3min{−Φ−1(pU), −Φ−1(pL)}. (4) Similarly, the Cp in Kane [14] is equivalent to

1 6

³

−Φ−1(pU) − Φ−1(pL)

´ .

3.1.2 Definition of BCpk

Let X1 and X2 be the quality characteristics of interests with the specification limits [LSL1, USL1] for X1 and [LSL2, U SL2] for X2. These limits define a rectangular specifi-cation area A. Assume that X = (X1, X2)T follows a bivariate normal distribution with mean µ = (µ1, µ2)T and variance-covariance matrix Σ. Applying eigenvalue-eigenvector decomposition to Σ, we obtain two eigenvalues λ21 ≥ λ22 > 0 and the associated eigenvec-tors, v1 and v2. Let R = [v1, v2], then RTR = I and Σ can be expressed as Σ = RV RT,

where V is the diagonal matrix with diagonal elements λ21 and λ22. In fact, the matrix R represents the rotation matrix that rotates the original axes to the main axes of the bivariate normal distribution (see Figure 3), v1 and v2 correspond to the main axes, and λ21 and λ22 are the variances on these main axes. More specifically, if we let Si = viTX, then Si ∼ N(viTµ, λ2i), i = 1, 2, and S1 and S2 are independent. Suppose we move the origin to the process mean µ and have the two new axes being in the directions of v1 and v2. Then the two main axes divide the plane into four regions, A1, A2, A3, and A4. Obviously, P (X ∈ Ai) = 1/4, i = 1, . . . , 4. Denoting the specification region by A and Qi = Ai∩ A, i = 1, . . . , 4. Let qi = P (X ∈ Qi), i = 1, . . . , 4. Then the probability that X is in Ai but not in the specification region is pi = 1/4 − qi (see Figure 3).

Figure 3: Explaning diagram of BCpk

By analogy to the alternative definition of Cpk given in (4), Castagliola and Castel-lanos [4] defined a bivariate Cpk as

BCpk = 1

3min(−Φ−1(2p1), −Φ−1(2p2), −Φ−1(2p3), −Φ−1(2p4)).

This definition is similar to the alternative definition of Cpk, except that 0 ≤ pi ≤ 1/4, i = 1, . . . , 4, in the bivariate case, while 0 ≤ pu(or pL) ≤ 1/2 in the univarite case. We extend this definition to higher dimensions later.

3.1.3 Non-conforming Rate Based on BCpk

According to the definition of BCpkin the last subsection, we can establish a connection between the non-conforming rate (%NC) and BCpk. First note that

BCpk = 1

3min(−Φ−1(2p1), −Φ−1(2p2), −Φ−1(2p3), −Φ−1(2p4))

= −1

3max(Φ−1(2p1), Φ−1(2p2), Φ−1(2p3), Φ−1(2p4)).

Since Φ−1(·) is a strictly increasing function, BCpk = −1

−1(2pmax),

where pmax = max(p1, p2, p3, p4). Φ−1(·) is a one-to-one function, so pmax = 1

2Φ(−3BCpk). (5)

Note that pmax ≤ %NC ≤ 4pmax. Plugging (5) into this inequality, we obtain 1

2Φ(−3BCpk) ≤ %NC ≤ 2Φ(−3BCpk). (6) Although the lower bound of (6) is quite conservative, it is a convenient bound, mean-ing once the engineer gets a BCpk value, he/she will know the bounds of non-conforming rate. The upper bound is very useful and is not a loose bound, meaning that it is reachable. Usually producers can take the upper bound of the non-conforming rate as an quality assurance to customers. For example, if the process is with BCpk=1.00, one can guarantee that there will be 2700 non-conformities in 1,000,000 product items at most.

Table 1 gives the upper and lower bounds of the non-conforming rate %NC for various values of BCpk. Figure 4 plots the bounds. We can see the bounds drop sharply as BCpk increases and soon levels off when BCpk ≥ 1.33.

The second inequality of (6) is equivalent to

2Φ(3BCpk) − 1 ≤ % yield,

providing a same lower bound for the yield as in the univariate case [3]. The lower bound gives the worst level of the yield for a given BCpk.

0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0

010000300005000070000

BCpk

NCppm

upb lwb

Figure 4: Bounds of non-conformity based on BCpk

Table 1: Bounds of non-conformity based on BCpk BCpk Non-conformities in ppm

lwb upb

0.60 17965.15956 71860.63823 0.80 4098.76796 16395.07185 1.00 674.94902 2699.79606 1.33 16.51832 66.07330

1.50 1.69884 6.79535

1.60 0.39666 1.58666

1.67 0.13608 0.54430

2.00 0.00049 0.00197

3.1.4 Extending Cpk to Higher Dimensions

Now we generalize the alternative definitions of Cpkand BCpkto multivariate processes of k > 2 characteristics. By the same notion for the bivariate case, dividing the space Rk into 2k subregions by the main axes of the k-variate distribution, we can define a multivariate Cpk index as

MCpk = 1

3min(−Φ−1(2k−1p1), −Φ−1(2k−1p2), . . . , −Φ−1(2k−1p2k))

= −1

3max(Φ−1(2k−1p1), Φ−1(2k−1p2), . . . , Φ−1(2k−1p2k))

= −1

−1(2k−1pmax),

where pi is the probability of a randomly selected sample being in the ith subregion, but not meeting the specification and pmax = max(p1, p2, . . . , p2k). Equivalently,

pmax= 1

2k−1Φ(−3MCpk).

Since pmax ≤ %NC ≤ 2kpmax, we can also get an inequality of non-conforming rate in the general multivariate case as

1

2k−1Φ(−3MCpk) ≤ %NC ≤ 2Φ(−3MCpk), which is equivalent to

1 − 1

2k−1Φ(−3MCpk) ≤ %yield ≤ 1 − 2Φ(−3MCpk), (7)

相關文件