• 沒有找到結果。

Two-stage Welsh's trimmed mean for the simultaneous equations model

N/A
N/A
Protected

Academic year: 2021

Share "Two-stage Welsh's trimmed mean for the simultaneous equations model"

Copied!
12
0
0

加載中.... (立即查看全文)

全文

(1)Aust. N. Z. J. Stat. 43(4), 2001, 481–492. TWO-STAGE WELSH’S TRIMMED MEAN FOR THE SIMULTANEOUS EQUATIONS MODEL. Lin-An Chen1∗ , Kuo-Yuan Liang2 and Chwen-Chi Liu3 National Chiao Tung University, Shih Hsin University and Feng Chia University Summary This paper discusses the large sample theory of the two-stage Welsh’s trimmed mean for the limited information simultaneous equations model. Besides having asymptotic normality, this trimmed mean, as the two-stage least squares estimator, is a generalized least squares estimator. It also acts as a robust Aitken estimator for the simultaneous equations model. Examples illustrate real data analysis and large sample inferences based on this trimmed mean. Key words: Aitken estimator; generalized least squares estimator; simultaneous equations model; trimmed mean.. 1. Introduction The conventional method of two-stage least squares is commonly used in econometrics, with simultaneous equations models. Two justifications are frequently associated with its popularity. First, from a computational perspective, it requires only the least squares technique. Second, it is well known that a two-stage least squares estimator (2SLSE) can be interpreted as an Aitken estimator (see e.g. Fomby, Hill & Johnson, 1984 p . 478; Amemiya, 1985 p . 239). More specifically, it implies that after linear transformations of the model, the 2SLSE is a generalized least squares estimator. It is also well known that the 2SLSE is highly sensitive to even a very small departure from normality and to the presence of outliers. Therefore, many robust estimators have been proposed as alternatives to the 2SLSE for simultaneous equations systems (see e.g. Amemiya, 1982; Powell, 1983; Krasker, 1985; Chen & Portnoy, 1996). In this article, we extend the Welsh’s trimmed mean (Welsh, 1987) for linear regression to the simultaneous equations model. Large sample statistical inferences based on this trimmed mean and real data analysis are also provided. We are interested in two aspects of this estimator. First, because its asymptotic distribution is independent of the choice of initial estimator, this trimmed mean can be obtained simply on the basis that an initial estimate can be easily computed. In contrast, the robust estimators above rely on the estimation of regression quantiles and so are computationally much more difficult. Second, we show that this trimmed mean can be interpreted as a robust Aitken estimator in the simultaneous equations model. We introduce the two-stage Welsh’s trimmed mean in Section 2 and develop its large sample distribution in Section 3. Its ability to serve as a generalized least squares estimator is Received March 1999; revised May 2000; accepted August 2000. ∗ Author to whom correspondence should be addressed. 1 Institute of Statistics, National Chiao Tung University, Hsinchu, Taiwan. e-mail: lachen@stat.nctu.edu.tw 2 School of Management, Shih Hsin University, Taipei, Taiwan. 3 Institute of Economics, Feng Chia University, Taichung, Taiwan. Acknowledgments. The authors thank the Technical Editor and the Associate Editor and referee whose valuable comments led to a substantial improvement of the exposition. c Australian Statistical Publishing Association Inc. 2001. Published by Blackwell Publishers Ltd,  108 Cowley Road, Oxford OX4 1JF, UK and 350 Main Street, Malden MA 02148, USA.

(2) 482. LIN-AN CHEN, KUO-YUAN LIANG AND CHWEN-CHI LIU. proved in Section 4. Examples and large sample inferences are presented in Sections 5 and 6, respectively. Finally, the proof of Theorem 4.1 is given in the appendix. 2. Two-stage Welsh’s trimmed mean Consider the simultaneous equations model y = Y 1 β 1 + Z 1 β2 + τ ,. (1). where Y = [y Y1 ] denotes an n × p0 observation matrix of p0 endogenous variables (i.e. dependent variables), Z1 denotes an n × p1 observation matrix of p1 exogenous variables (i.e. independent variables) including an intercept term, and τ denotes a vector of independent and identically distributed (iid) disturbance variables. Let β = (β1 , β2 ) denote the parameter vector which is to be estimated. Let the reduced form of the simultaneous equations model be Y = Z + V , where Z = [Z1 Z2 ] denotes the set of all exogenous variables, Z2 denotes an n × p2 matrix, and rows of V are vectors of iid random variables (v1 , . . . , vp0 ) with zero mean vector and positive definite covariance matrix. Let  = [1 2 ] and V = [V1 V2 ] be partitioned to correspond with the dimensions of [y Y1 ], so that the reduced form can be represented as [y Y1 ] = Z[1 2 ] + [V1 V2 ]. For the simultaneous equations model, the regression quantile of Koenker & Bassett (1978) can be applied, to construct the two-stage trimmed least squares estimator established by Chen & Portnoy (1996). ˆ be an estimator of  . Replacing Y by Z  ˆ and using the parameter restriction Let  2 2 1 2 τ = V γ , where γ = (1, −β1 ), the simultaneous equations model can be rewritten as ˆ −  )β , y = Dn β + V1 − Z( 2 2 1. (2). ˆ Z ]. Let y = (y , . . . , y ) and d T denote the ith row of D , i = where Dn = [Z  2 1 1 n i n ˆ −  )β as regression errors. 1, . . . , n. The two-stage estimation techniques treat V1 − Z( 2 2 1 The Welsh’s trimmed mean is defined on a Winsorized observation with its construction based on an initial estimator of β. It can also be seen that the trimmed mean developed in this section has asymptotic distribution independent of the initial estimator (see Welsh, 1987 for this property in the trimmed mean for the linear regression model). Compared with the trimmed least squares estimator of Koenker & Bassett (1978), which requires computing of the regression quantiles, this estimator has the advantage of computational ease if the initial estimator can be calculated by using the least squares estimator. ˆ −  )β in (2) as regression Let βˆ0 be an initial estimator of β, treating V1 − Z( 2 2 1 errors. The regression residuals are ei = yi − diT βˆ0. (i = 1, . . . , n).. For large sample analysis in this paper, the initial estimator βˆ0 needs to satisfy the assumption n1/2 (βˆ0 − β) = Op (1) (assumption (A5) in the next section). Choices of βˆ0 include consistent root n estimators such as the 2SLSE, the two-stage 1 -norm estimator, minimizing a sum c Australian Statistical Publishing Association Inc. 2001 .

(3) TWO-STAGE WELSH’S TRIMMED MEAN FOR SIMULTANEOUS EQUATIONS MODEL. 483. of absolute values of residual terms that depend upon an initial estimator, and the Koenker– Bassett two-stage trimmed least squares estimator βˆKB . Amemiya (1982) demonstrated the asymptotic properties of the two-stage 1 -norm estimator under the particular normal distribution of the error terms and Powell (1983) provided a general asymptotic theory for it. Chen & Portnoy (1996) gave a general asymptotic theory for the estimator βˆKB . For 0 < α < 0.5, let η(α) ˆ and η(1 ˆ − α) represent the αth and (1 − α)th empirical quantiles of the regression residuals, respectively. A Winsorized observation defined by Welsh (1987) is       yi∗ = yi I η(α) ˆ −α ˆ − α) + η(α) ˆ I ei < η(α) ˆ ≤ ei ≤ η(1     + η(1 ˆ − α) I ei > η(1 ˆ − α) − α (i = 1, . . . , n). (3) Let y ∗ = (y1∗ , . . . , yn∗ )T , and denote the initial-estimator-based trimming matrix by A = diag(a1 , . . . , an ), where ai = I (η(α) ˆ ≤ ei ≤ η(1 ˆ − α)). The Welsh’s trimmed mean (Welsh, 1987) from the linear regression model to the simultaneous equations model is defined as βˆW = (DnT ADn )−1 DnT y ∗ .. 3. Asymptotic normality of two-stage Welsh’s trimmed mean Let fj , Fj and Fj−1 represent the probability density function (pdf), cumulative distribution function (cdf) and inverse cdf, respectively, of vj , for j = 1, . . . , p0 . The following are some assumptions concerning the design matrix Z, the error variables v1 , . . . , vp0 and the first-stage estimator.  Ip1  (A1) n−1 Z T Z = Q + o(1), where Q is positive definite; and the matrix J = 2 , 0p2 ×p1 is full rank.  4 = O(1) for all j. (A2) n−1 ni=1 zij (A3) n−1/4 maxij |zij | = O(1).. (A4) For j = 1, . . . , p0 , fj is symmetric at zero; and fj and fj are both bounded away from 0 in a neighbourhood of Fj−1 (λ) for λ ∈ (0, 1). ˆ = [πˆ , . . . , πˆ ]. For j = 2, . . . , p , (A5) Denote the partition  2 2 p0 0 n1/2 (πˆj − πj ) = Q−1 n−1/2. n . zi ψj (vj i ) + op (1) ,. i=1. where function ψj satisfies E(ψj (vj )) = 0. (A6) n1/2 (βˆ0 − β) = Op (1). Assumptions (A1)–(A4) are standard as given in Ruppert & Carroll (1980), Koenker & Portnoy (1987) and Welsh (1987). We let  = J T QJ . Assumption (A1) shows that  is also ˆ include the non-robust least squares estimator producing positive definite. Examples of  2 ψj (vj ) = vj ; the robust 1 -norm estimator producing ψj (vj ) = fj−1 (0)(0.5 − I (vj < 0)); and the trimmed mean studied by Ruppert & Carroll (1980) producing ψj (vj ) = φ(vj ) with c Australian Statistical Publishing Association Inc. 2001 .

(4) 484. LIN-AN CHEN, KUO-YUAN LIANG AND CHWEN-CHI LIU. Theorem 3.1..  −1 F (α)    j φ(vj ) = vj    −1 Fj (1 − α). n1/2 (βˆW − β) =. if vj < Fj−1 (α), if Fj−1 (α) ≤ vj ≤ Fj−1 (1 − α), if vj > Fj−1 (1 − α)..  1 ˆ −  )β + o (1), d˜i φ(v1i ) −  −1 H ∗ n1/2 ( n−1/2  −1 2 2 1 p 1 − 2α n. i=1. where d˜iT is the ith row of matrix [Z2 Z1 ] and H ∗ = J T Q. Corollary 3.2. The Welsh’s trimmed mean asymptotically has a normal distribution with 2 −1 2 mean 0 and covariance matrix given by σW  , where σW = γ T D(φ ∗ )γ  −1 and φ ∗ = −1 ˆ 2 is the  -norm ((1 − 2α) φ(v1 ), ψ2 (v2 ), . . . , ψp0 (vp0 )). For the case in which  1 −1 ∗ −1 estimator, D(φ ) = D(φ1 ), where  = diag((1 − 2α) , f2 (0), . . . , fp−1 (0)) and 1 1 1 φ1 = (φ(v11 ), 2 − I (v21 < 0), . . . , 2 − I (vp1 1 < 0)). And D(φ1 ) = [cij ], where  2  2  −1  c11 = 2α F1−1 (α) + E v11 I F1 (α) < v11 < F1−1 (1 − α) ,.    c1j = cj 1 = − F1−1 (α)E I v11 < F1−1 (α), vj i < 0    + F1−1 (1 − α)E I v11 > F1−1 (1 − α), vj 1 < 0   

(5) + E v11 I F1−1 (α) < v11 < F1−1 (1 − α), vj 1 < 0 ,   cjj = 41 , cj k = E I (vj 1 < 0, vk1 < 0) − 41 (j, k = 2, . . . , p1 ). ˆ denote the matrix for the α-trimming estimator of  , based on either the Let  2j 2 Welsh’s trimmed mean or the regression quantiles. Then we have matrix D(φ ∗ ) = (1 − 2α)−2 D(φ), where φ = (φ(v11 ), . . . , φ(vp1 )). When we further assume that fj k (x, y) = fj k (−x, −y) for (x, y) ∈ R2 , j, k = 1, . . . , p1 , then D(φ ∗ ) = (1 − 2α)−2 H , where H = [hij ], with  2    hjj = 2α Fj−1 (α) + E vj21 I Fj−1 (α) < vj 1 < Fj−1 (1 − α) , and for j = k with j, k = 1, . . . , p1 ,    hj k = E vj 1 vk1 I Fj−1 (α) < vj 1 < Fj−1 (1 − α), Fk−1 (α) < vk1 < Fk−1 (1 − α)    + 2Fj−1 (α)Fk−1 (α)E I vj 1 < Fj−1 (α), vk1 < Fk−1 (α)    + 2Fj−1 (α)Fk−1 (1 − α)E I vj 1 < Fj−1 (α), vk1 > Fk−1 (1 − α) . 4. Two-stage Welsh’s trimmed mean as a robust Aitken estimator Let (1) be premultiplied by matrix Z T to yield Z T y = Z T [Y1 Z1 ]β + Z T τ . c Australian Statistical Publishing Association Inc. 2001 . (4).

(6) TWO-STAGE WELSH’S TRIMMED MEAN FOR SIMULTANEOUS EQUATIONS MODEL. 485. Denote the variance of variables τ by στ2 . As interpreted by Fomby et al. (1984), the new explanatory variables in (4) consist essentially of sample cross moments between the endogenous variables and the exogenous variables — the former as they appear in (1), the latter as they appear in the entire system. The new explanatory variables divided by sample size n converge in probability to a non-stochastic limit and thus are uncorrelated with the error term appearing in (4). On the other hand, the covariance matrix of Z T τ is στ2 Z T Z which makes the generalized least squares estimation appropriate. Define the transformation Z T y ∗ where y ∗ is given by (3). We see from (4) that   ˆ −  )β + Z T η(α)δ ˆ ˆ − α)δ1−α , (5) Z T y ∗ = Z T ADn β + Z T AV1 − Z T AZ( 2 2 1 α − η(1 ˆ − λ, . . . , I (en < η(λ)) ˆ − λ). where vector δλ = (I (e1 < η(λ)) The induced model for Z T y ∗ is obtained through the large sample representations of ˆ −  )β and (Z T AV , Z T (η(α)δ ˆ ˆ − α)δ1−α )), where the former produces Z T AZ( 1 α − η(1 2 2 1 errors in terms of v2i , . . . , vp0 i and the latter produces errors in terms of v1i . ˆ satisfying assumption (A5), the following is an induced Aitken simulTheorem 4.1. For  2 taneous equations model Z T y ∗ = Z T ADn β + (1 − 2α). n . zi γ T ψ ∗ + op (n1/2 ) .. i=1. The least squares estimator for the above induced model is βˆWLS = (DnT AZZ T ADn )−1 DnT AZZ T Ay. Then n1/2 (βˆWLS − β) has normal asymptotic distribution with zero mean and covariance matrix 2 (J T Q2 J )−1 J T Q3 J (J T Q2 J )−1 , W = σW which implies that the following estimator is a robust Aitken estimator, called Welsh’s generalized trimmed mean, ˆ Z T AD )−1 D T AZ Q ˆ ZTy ∗ , βˆGW = (DnT AZ Q λ n n λ ˆ = λQ + o (1) for some positive constant λ. ˆ satisfies n−1 Q where Q p λ λ Corollary 4.2. The least squares estimator for the induced model is βˆKBLS = (DnT AZZ T ADn )−1 DnT AZZ T Ay , and then n1/2 (βˆKBLS − β) has normal asymptotic distribution with zero mean and covariance matrix W larger than that of βˆGW . Accordingly: (1) The Welsh’s generalized trimmed mean provides another robust Aitken estimator for the limited information simultaneous equations model. c Australian Statistical Publishing Association Inc. 2001 .

(7) 486. LIN-AN CHEN, KUO-YUAN LIANG AND CHWEN-CHI LIU. (2) βˆGW has asymptotic distribution exactly the same as that of the Koenker–Bassett (KB) trimmed least squares estimator (see Chen & Portnoy, 1996), which is then independent of the choice of initial estimator βˆ0 . (3) The least squares estimator for the induced model for Z T y ∗ is (DnT AZZ T ADn )−1 DnT AZZ T y ∗ which has asymptotic distribution exactly the same as that of βˆKBLS . This then implies that, in this induced transformed model, the least squares estimation is less efficient than the generalized least squares estimation. −1 ˆ ˆ = Z T AZ for βˆ The choice Q λ GW satisfying n Qλ = (1 − 2α)Q + op (1) generates the two-stage Welsh’s trimmed mean which implies Theorem 4.3. Theorem 4.3. The two-stage Welsh’s trimmed mean βˆW is a robust Aitken estimator. ˆ = Z T Z makes the following estimator The choice Q λ . DnT AZ(Z T Z)−1 Z T ADn. −1. DnT AZ(Z T Z)−1 Z T y ∗. also a robust Aitken estimator. 5. Example and simulation In this section we contrast the results for different estimates when applied to two datasets. First, consider the model for estimating supply of the commercial banks’ loans to business firms in the United States for 1979–1984 (monthly data). The supply model is y1 = β0 + β1 y2 + β2 z1 + β3 z2 + τ . This is a simultaneous equations model with endogenous variables y1 and y2 and exogenous variables z1 and z2 , where y1 = total commercial loans (billions of US dollars), y2 = average prime rate charged by banks, z1 = 3-months treasury bill rate, z2 = total bank deposits. Economic theory expects β2 to be negative, and β1 and β3 positive. In this simultaneous equations model, there are also two available instrumental variables, z3 and z4 . These variables represent an AAA corporate bond rate and an industrial production index, respectively. The reduced form model then takes a bivariate regression of (y1 , y2 ) on exogenous variables z1 , . . . , z4 associated with the intercept term. For details about the specifications of this model and the data, see Maddala (1988 p . 314). Maddala analysed this dataset through the least squares estimate (LSE) and two-stage least squares estimate (2SLSE). He concluded that quantity supplied is more responsive to changes in interest rates (see these two estimates in Table 1) than is suggested by the LSE. In Table 1, we also display the two-stage 1 estimate (21 ), two-stage trimmed least squares estimator based on regression quantile (βˆKB (α)), for α = 0.05(0.05)0.25, and the two-stage Welsh’s trimmed mean for number of the observations trimmed, nt = 1, 2, . . . , 6. Comparing these estimates in Table 1, we conclude the following: (a) For β1 , β2 and β3 , all robust estimates carry the expected sign. This is consistent with the LSE estimates made by Maddala. However, magnitudes of all robust estimates, in absolute terms, are smaller than those obtained using the 2SLSE method. c Australian Statistical Publishing Association Inc. 2001 .

(8) TWO-STAGE WELSH’S TRIMMED MEAN FOR SIMULTANEOUS EQUATIONS MODEL. 487. Table 1 Estimates for commercial loan data Estimates LSE 2SLSE. 21 βˆKB (α) α = 0.05 = 0.10 = 0.15 = 0.20 = 0.25 βˆW nt = 1 =2 =3 =4 =5 =6. β0. β1. β2. β3. −77.414 −87.988 −88.071. 2.415 6.905 5.102. −1.888 −7.081 −4.706. 0.331 0.334 0.335. −90.723 −84.083 −92.925 −86.117 −91.375. 6.504 5.766 5.888 5.727 6.165. −6.407 −5.730 −5.533 −5.660 −5.773. 0.336 0.333 0.338 0.335 0.334. −87.529 −86.496 −86.806 −84.808 −86.277 −84.518. 6.056 6.329 6.204 6.378 5.839 6.095. −5.928 −6.299 −5.923 −6.180 −5.508 −5.865. 0.334 0.334 0.332 0.331 0.332 0.331. (b) From the residuals computed from the two-stage 1 -norm estimates, we see a few suspect outliers. The 2SLSEs of β1 and β2 are slightly larger than the robust estimates, while the two-stage 1 -norm estimates are slightly smaller. From the theory of estimation, we can expect the 2SLSE to produce the worst estimates and the two-stage 1 -norm to be inefficient. (c) An important advantage of the two-stage Welsh’s trimmed mean is that we can have the actual percentage of trimming close to any specified α. After performing a sequential trimming of βˆKB (α) and βˆW , the two-stage Welsh’s trimmed means remain quite stable in the first six trimming estimates. We then expect that the two-stage Welsh’s trimmed mean with a small number of trimmed observations is appropriate for estimating the parameters. Next, we consider macroeconomic data for 1970–1984 in the United States (see Gujarati, 1988 p. 568) where the income model is y1 = β0 + β1 y2 + β2 z1 + β3 z2 + τ, where y1 = income, y2 = stock of money, z1 = investment expenditure, z2 = government expenditure on goods and services. This model states that income is determined by the endogenous variable y2 and two exogenous variables z1 and z2 . Table 2 displays the various estimates for parameters of this model. Based on Table 2, we can conclude the following: (a) Estimates of 2SLSE with negative signs are larger and those with positive signs are smaller than most of the corresponding robust estimates. This reveals the non-robustness of this least-squares-type estimation method. (b) The two-stage 1 -norm is quite satisfactory for these data. (c) Again, the stability of the two-stage Welsh’s trimmed mean reveals that smaller number trimming is appropriate for these data; however, this property is not shown by trimmed least squares. c Australian Statistical Publishing Association Inc. 2001 .

(9) 488. LIN-AN CHEN, KUO-YUAN LIANG AND CHWEN-CHI LIU. Table 2 Estimates for money income data Estimate 2SLSE. 2 1 βˆKB (α) α = 0.05 = 0.10 = 0.15 = 0.20 = 0.25 = 0.30 βˆW nt = 1 =2 =3 =4 =5 =6. β0. β1. β2. β3. −0.034 −0.052. −0.227 −1.125. 1.379 1.636. 4.090 4.654. −0.053 −0.033 −0.046 −0.044 −0.054 −0.054. −1.168 −0.125 −0.761 −0.638 −1.208 −1.237. 1.704 1.370 1.395 1.323 1.603 1.673. 4.655 4.009 4.538 4.479 4.755 4.714. −0.045 −0.050 −0.051 −0.051 −0.052 −0.052. −0.691 −1.032 −1.080 −1.093 −1.108 −1.145. 1.245 1.599 1.626 1.635 1.613 1.646. 4.626 4.639 4.629 4.627 4.656 4.659. To study two-stage estimators for the simultaneous equations model with asymmetric error distributions, we performed a Monte Carlo simulation for the simple simultaneous equations model y = β0 +β1 y1 +β2 z1 +β3 z2 +τ with reduced form [y y1 ] = [1 z1 z2 ][1 2 ]+ [v1 v2 ]. We let (u1 , u2 ) denote a vector of independent exponential random variables with mean 1. We assume that the error vector in the reduced form follows the following mixture model     1 − ρ2 ρ u1 − 1    with probability 1 − δ,  0 1 u2 − 1 v1.  = v2    s u1 − 1 with probability δ. u2 − 1 This ensures that (v1 , v2 ) has an asymmetric distribution with mean 0 and probability (1 − δ) 1 ρ

(10) from a distribution with covariance matrix , and probability δ from a distribution ρ 1 with covariance matrix s 2 I2 , where large values of s may produce outliers. We take (1 , 2 ) such that βj = 0.5, j = 0, 1, 2, 3, and we use sample size n = 40 and samples (z1 , z2 ) randomly generated from a bivariate normal distribution. With 1000 replications, we generate observations (y, y1 , z1 , z2 ), obeying the assumptions above, and, for estimating parameters βj , j = 0, 1, 2, 3, we compute the two-stage 1 -norm estimates, the two-stage trimmed LSE and the two-stage Welsh’s trimmed mean. Table 3 displays the results in terms of average mean squared errors (MSE). The outliers produced by asymmetric distributions are, in general, unbalanced with respect to the population mean. Therefore none of the estimators is very efficient for estimating the population mean, though the two-stage 1 -norm estimator is relatively less efficient because it, in fact, estimates the population median. On the other hand, both the two-stage Welsh’s trimmed mean and the two-stage trimmed least squares estimator are quite promising and are very competitive in estimating the population mean for this asymmetric distribution. c Australian Statistical Publishing Association Inc. 2001 .

(11) TWO-STAGE WELSH’S TRIMMED MEAN FOR SIMULTANEOUS EQUATIONS MODEL. 489. Table 3 MSE for two-stage estimators Estimator. δ=0. δ = 0.1 s=3. δ = 0.1 s = 10. δ = 0.2 s=3. δ = 0.2 s = 10. 21 βˆKB (α) α = 0.1 = 0.2 = 0.3 ˆ βW nt = 2 =4 =6 =8. 20.70. 7.047. 23.98. 10.370. 35.81. 0.405 0.551 0.395. 0.372 1.94 2.506. 0.436 3.371 2.143. 4.374 0.526 0.911. 1.985 0.814 0.685. 0.155 0.286 0.673 0.993. 0.187 0.396 0.745 1.080. 1.563 2.870 1.632 0.687. 0.173 0.431 0.904 1.480. 2.180 1.436 0.952 0.683. 6. Large sample inference Here we sketch some large-sample methods for confidence ellipsoids and hypothesis testing based on the two-stage Welsh’s trimmed mean. First assume that we have a statistic V which is a consistent estimator of the asymptotic covariance matrix γ T D(φ ∗ )γ  −1 . For 0 < λ < 1, let r dλ (r, s) = c (F ) , 1 − 2α 1−λ r,s where cq (Fr,s ) denotes the q-quantile of the Fr,s distribution. Suppose for some integer , K is an  × p matrix of rank . Let m be the number of residuals ei lying outside the interval (η(α), ˆ η(1 ˆ − α)). Then   Pr (βˆW − β)T K T (KV −1 K T )−1 K(βˆW − β) ≤ du (, n − m − p) ≈ 1 − u. If K = Ip , the confidence ellipsoid for β is given by (βˆW − β)T V −1 (βˆW − β) ≤ du (, n − m − p). Moreover, if we test H0 : Kβ = v by rejecting H0 whenever (K βˆW − v)T (KV −1 K T )−1 (K βˆW − v) ≥ du (, n − m − p), then this test has an asymptotic size of u. We still need to have an estimator of asymptotic covariance matrix γ T D(φ ∗ )γ  −1 . Let βˆ and Jˆ denote the two-stage Welsh’s trimmed mean of β1 and the matrix J in (A1), replacing ˆ , respectively. The estimator of  is given by  ˆ = n−1 Jˆ T Z T Z Jˆ . It remains to 2 by  2 ˆ is the trimmed least squares estimator for estimate the matrix D(φ ∗ ). Suppose that here  2 2 . Denote the residual matrix as: . e11  e12   .  .. e1n. e21 e22 .. . e2n.  · · · ep1 1 · · · ep1 2   ˆ ]. = [y − Dˆ n βˆW Y1 − Z  ..  2 ..  . . · · · ep1 n. c Australian Statistical Publishing Association Inc. 2001 .

(12) 490. LIN-AN CHEN, KUO-YUAN LIANG AND CHWEN-CHI LIU. Also, let ηˆj (λ) be the λth empirical quantile based on residuals ej i , i = 1, . . . , n and define  ηˆ (α)   j φˆj (a) = a   ηˆ (1 − α) j. if a < ηˆj (α), if ηˆj (α) ≤ a ≤ ηˆj (1 − α), if a > ηˆj (1 − α).. ˆ ∗) = ˆ is the trimmed least squares estimator is given by D(φ An estimator of C where  2  (1 − 2α)−2 n−1 ni=1 φˆ φˆ T , where φˆ = (φˆ 1 (e1i ), . . . , φˆp1 (ep1 i )). On the other hand, if we −1 n ˆ ˆT ˆ ˆ ˆ is the  -norm estimator of  , let D(φ consider that  1) = n i=1 φ1 φ1 , where φ1 = 2 1 2 1 1 ˆ ˆ (φ1 (e1i ), 2 − I (e2i < 0), . . . , 2 − I (ep1 i < 0)). We also let fj (0) be the estimator of fj (0) for j = 2, . . . , p1 ; it can be the estimator by Koenker & Portnoy (1987), Welsh (1991) or ˆ ∗ ) , the estimator of D(φ ∗ ) when we use  -norm estimator for  , Chen (1997). Then D(φ 1 2 equals     −1 ˆ−1 ˆ−1 ˆ diag (1 − 2α)−1 , fˆ2−1 (0), . . . , fˆp−1 (0) D(φ 1 ) diag (1 − 2α) , f2 (0), . . . , fp1 (0) . 1. 7. Appendix Proof of Theorem 4.1. Let H (t) = n−1/2. n  .  γ − I (v1i < F1−1 (γ ) + n−1/2 ziT t) .. i=1. From Jureˇckov´a (1984), we have n    H (T ) − H (0) − f1 F1−1 (γ ) n−1 ziT T = op (1) i=1. for any random vector T with T = Op (1). From Ruppert & Carroll (1980), we have n−1/2. n  .  ˆ ) = op (1). γ − I (ei < η(γ. (6). i=1. By rearrangement, the following equation holds ei − η(γ ˆ ) = v1i − F1−1 (γ ) − n−1/2 ziT Tn (γ ) with.  

(13). ˆ −  )β + η(γ ˆ ) − F1−1 (γ ) a , Tn (γ ) = n1/2 Jˆ (βˆ0 − β) − ( 2 2 1. (7). where a is a vector of zeros except for the first element which is 1. The method of Jureˇckov´a (1977 proof of Lemma 5.2) and (7) also show that for δ > 0, there exist positive values s, k and N0 such that  Pr. inf n. |t|≥k.     n    −1 T  γ − I v1i < F1 (γ ) + zi t  < s < δ . −1/2 . i=1. c Australian Statistical Publishing Association Inc. 2001 . (8).

(14) TWO-STAGE WELSH’S TRIMMED MEAN FOR SIMULTANEOUS EQUATIONS MODEL. 491. for n ≥ N0 . From (6) and (8), it is seen that Tn (γ ) = Op (1) which implies that η(γ ˆ ) is consistent for F −1 (γ ), as η(γ ˆ ) = F1−1 (γ ) + op (1).. Let. M(t, γ ) = n−1/2. n .   zi v1i I v1i ≤ F1−1 (γ ) + n−1/2 ziT t .. i=1.     n−1/2 Z T AV1 = M Tn (1 − α), 1 − α − M Tn (α), α .. Now. (9). From Ruppert & Carroll (1980) and Jureˇckov´a (1984), we have n    M(T , γ ) − M(0, γ ) = F1−1 (γ )f1 F1−1 (γ ) n−1 ziT T + op (1). (10). i=1. for any sequence T with T = Op (1). Then (9) and (10) induce the following, n    zi Tn (1 − α) n−1/2 Z T AV1 = F1−1 (1 − α)f1 F1−1 (1 − α) n−1 i=1. . . − F1−1 (α)f1 F1−1 (α) n−1. n . zi Tn (α). i=1. + n−1/2. n . .  zi v1i I F1−1 (α) ≤ v1i ≤ F1−1 (1 − α) + op (1). (11). i=1. Similarly, T n−1/2 η(α)Z ˆ δα + n−1/2 η(1 ˆ − α)Z T δ1−α. n      = F1−1 (α)f1 F1−1 (α) n−1 zi Tn (α) − F1−1 (1 − α)f1 F1−1 (1 − α) i=1. n−1. n . zi Tn (1 − α) + F1−1 (α)n−1/2. i=1. n    zi I (v1i < F1−1 (α) − α) i=1. n      + F1−1 (1 − α)n−1/2 zi I v1i > F1−1 (1 − α) − α + op (1).. (12). i=1. Combining (11) and (12), we have T T n−1/2 Z T AV1 +n−1/2 η(α)Z ˆ δα +n−1/2 η(1−α)Z ˆ δ1−α = n−1/2. n . ziT φ(v1i )+op (1). (13). i=1. Then the induced form of Theorem 4.1 follows from (5), (13) and because n−1 Z T AZ = ˆ =  + o (1). (1 − 2α)Q + op (1) and  2 2 p c Australian Statistical Publishing Association Inc. 2001 .

(15) 492. LIN-AN CHEN, KUO-YUAN LIANG AND CHWEN-CHI LIU. References Amemiya, T. (1982). Two stage least squares deviations estimators. Econometrica 50, 689–711. Amemiya, T. (1985). Advanced Econometrics. Massachusetts: Harvard University Press. Chen, L-A. (1997). An efficient class of weighted trimmed means for linear regression models. Statistica Sinica 7, 669–686.. Chen, L-A. & Portnoy, S. (1996). Regression quantiles and trimmed least squares estimators for structural equations models. Comm. Statist. Simulation Comput. 25, 1005–1032.. Fomby, T.B., Hill, R.C. & Johnson, S.R. (1984). Advanced Econometric Methods. New York: SpringerVerlag.. Gujarati, D.N. (1988). Basic Econometrics. New York: McGraw-Hill. Jureˇckova, ´ J. (1977). Asymptotic relation of M-estimates and R-estimates in a linear regression model. Ann. Statist. 5, 464–472.. Jureˇckova, ´ J. (1984). Regression quantiles and trimmed least squares estimators under a general design. Kybernetika 20, 345–357.. Koenker, R.W. & Bassett, G.W. (1978). Regression quantiles. Econometrica 46, 33–50. Koenker, R. & Portnoy, S. (1987). L-estimators for linear models. J. Amer. Statist. Assoc. 82, 851–857. Krasker, W.S. (1985). Two stage bounded-influence estimators for simultaneous equations models. J. Bus. Econom. Statist. 4, 432–444.. Maddala, G.S. (1988). Introduction to Econometrics. New York: Macmillan Publishing Company. Powell, J.L. (1983). The asymptotic normality of two-stage least absolute deviations estimators. Econometrica 51, 1569–1576.. Ruppert, D. & Carroll, R.J. (1980). Trimmed least squares estimation in the linear model. J. Amer. Statist. Assoc. 75, 828–838.. Welsh, A.H. (1987). The trimmed mean in the linear model. Ann. Statist. 15, 20–36. Welsh, A.H. (1991). Asymptotically efficient adaptive L-estimators in the linear model. Statist. Sinica 1, 203–228.. c Australian Statistical Publishing Association Inc. 2001 .

(16)

參考文獻

相關文件

identify different types of tourist attractions and examine the factors affecting the development of tourism in these places;.4. recognize factors affecting tourist flows and the

For finite-dimensional second-order cone optimization and complementarity problems, there have proposed various methods, including the interior point methods [1, 15, 18], the

Interestingly, the periodicity in the intercept and alpha parameter of our two-stage or five-stage PGARCH(1,1) DGPs does not seem to have any special impacts on the model

– Funding liquidity - ability to borrow money to buy securities or make loans.... The

在雲中街文創聚落中營運中的「凹凸 咖啡館」是利用當時遺留下的建築群

This two-phase decision-making model includes two major concepts: (1) Analyzing the customer’ s perception of quality practices using the IPGA model, and identifying the service

Measuring managerial efficiency in non-life insurance companies: An application of two-stage data envelopment analysis technique. (2008).Efficiency decomposition in two-stage data

This paper formulates the above situation to the Multi-temperature Refrigerated Container Vehicle Routing Problem (MRCVRP), and proposes a two-stage heuristic which consists of