Large Sample Theory
Homework 4: Methods of Estimation, Asymptotic Distribution, Probability and Conditioning Due Date: December 1st
1. The Weibull distribution (after the Swedish physicist Waloddi Weibull, who proposed the distribution in 1939 for the breaking strength of materials), has density function
f (x) = λxλ−1exp−xλ for x, λ > 0.
[As an aside, note that the Weibull arises by assuming y = xλMfollows an exponential distribution].
a. What is the resulting likelihood function `(λ|x1, . . . , xn), for λ?
b. What is the resulting log-likelihood function?
c. What is the score function?
d. What is the second derivative of the log-likelihood function?
e. Suppose 5 values, 0.10, 0.25, 0.5, 1, and 2 are observed. Plot the resulting log- likelhood function
f. What is the approximate sample variance?
g. What is an approximate 95% confidence interval for λ?
2. Let X be N (0, θ), 0 < θ < ∞.
a. Find the Fisher information I(θ).
b. If X1, X2, . . . , Xnis a random sample from this distribution, show that the MLE of θ is an efficient estimator of θ.
3. For Type II censoring, the data consist of the rth smallest lifetimes X(1) ≤ X(2) ≤
· · · ≤ X(r) out of a random sample of n lifetimes X1, . . . , Xn from the assumed life distribution. Assuming X1, . . . , Xn are i.i.d. and have a continuous distribution with p.d.f. f (x) and survival function S(x).
a. Show that the joint p.d.f. of X(1), X(2), · · · X(r)is LII,1= n!
(n − r)!
" r Y
i=1
f (x(i))
#
[S(x(r)]n−r.
b. Suppose that Xi is an exponentially distributed random variable with mean θ. De- rive the MLE of θ, ˆθ, and state the condition on r to guarantee consistency of ˆθ.
c. Use EM algorithm to derive the MLE of θ.
4. The normally distributed random variables X1, . . . , Xnare said to be serially correlated or to follow an autoregressive model if we can write
Xi = θXi−1+ i, i = 1, . . . , n,
where X0 = 0 and 1, . . . , nare independent N (0, σ2) random variables.
a. Show that the density of (X1, . . . , Xn) is 1
(2πσ2)n/2 exp
(
−
Pn
i=1(xi− θxi−1)2 2σ2
)
for −∞ < xi < ∞, i = 1, . . . , n, x0 = 0.
1
b. Derive MLE of θ and σ2. Give a condition on θ so that they are consistent estimates.
5. Let Yi denote the response of a subject at time i, i = 1, . . . , n. Suppose that Yi satisfies the following model
Yi = θ + i, i = 1, . . . , n
where i can be written as i = cei−1+ ei for a given constant c satisfying 0 ≤ c ≤ 1, and the ei are independent and identically distributed with mean zero and variance σ2, i = 1, . . . , n; 0 = 0. Let
Y =¯ 1 n
n
X
i=1
Yi, ˆθ =
n
X
j=1
ajYj
where
aj =
n−j
X
i=0
(−c)j 1 − (−c)j+1 1 + c
!
/
n
X
i=1
1 − (−c)i 1 + c
!2
. a. Show that if ei ∼ N (0, σ2), then ˆθ is the MLE of θ.
b. Show that ¯Y and ˆθ are unbiased.
c. Show that V ar( ¯Y ) ≥ V ar(ˆθ).
d. Show that ¯Y and ˆθ are consistent estimates of θ.
6. Suppose that X1, . . . , Xn are independent and identically distributed according to a lo- cation family with cdf F (x − θ), with F known and with 0 < F (x) < 1 for all x, but that it is only observed whether each Xifalls below a, between a and b, or above b where a < b are two given constants.
a. Describe the joint distribution of the observed three outcomes.
b. Let V denote the number of observations less than a. Describe the asymptotic distribution of√
n(V /n − p1) where p1 = F (a − θ).
c. Show that ˜Vn = a − F−1(V /n) is a consistent estimate of θ. Derive the asymptotic distribution of√
n( ˜Vn− θ)
7. Let X1, . . . , Xnbe iid with distribution Pθdepending on a real-valued parameter θ, and suppose that Eθ(X) = g(θ) and V arθ(X) = τ (θ) < ∞, where g is continuously differentiable function with derivative g0(θ) > 0 for all θ. Denote the estimator obtained by the method of moments by ˆθ. ( i.e., ˆθ is the solution of the equation g(θ) = ¯X.)
a. Show that ˆθ is consistent.
b. Derive its asymptotic distribution.
8. Suppose that vi and ui, 1 ≤ i ≤ n, are associated with a linear relationship vi = a + bui. Due to data collection error, we can only observe (xi, yi) where yi = vi+ δi and xi = ui+ i. It is known that E(δi) = E(i) = 0 and δi and i are to be independent. Note that yi = a + bxi+ (δi− bi) and E(δi− bi) = 0.
a. When V ar(i) = V ar(δi) = σ2, show that the least squares estimate of b (based on (xi, yi)) is not consistent when n−1Pni=1(ui− ¯u)2goes to a nonzero constant c.
b. Propose a consistent estimate of b when V ar(δi) = 2V ar(i).
2
9. Let X1, . . . , Xn be iid according to the normal distribution N (θ, 1). Consider the se- quence of estimators
δn =
( X¯ if | ¯X| ≥ n−1/4 a ¯X if | ¯X| < n−1/4 Find the asymptotic distribution of√
n(δn− θ).
Hint: You may need to derive your answer for θ = 0 and θ 6= 0 separately.
10. Show the following properties of the multivariate normal distribution Nk(µ, Σ) where µ ∈ Rkand Σ is a positive definite k × k matrix. Note that, if X ∼ Nk(µ, Σ), its pdf is
f (x) = (2π)−k/2[Det(Σ)]−1/2exp(−(x − µ)TΣ−1(x − µ)).
(a) The mgf of Nk(µ, Σ) is exp(µTt + tTΣt/2).
Fact: The mgf of X is defined as E exp(XTt).
(b) Let X be a random k-vector having the Nk(µ, Σ) distribution and Y = AX + c, where A is a k × ` matrix of rank ` ≤ k and c ∈ R`. Then Y has the N`(Aµ + c, ATΣA) Distribution.
Fact: If X and Y are random k-vectors and their mgf are identical for all t ∈ N = {t ∈ Rk : ktk ≤ }, then the distribution of X is identical to that of Y.
(c) A random k-vector X has a k-dimensional normal distribution if and only if for any c ∈ Rk, XTc has a univariate normal distribution.
(d) Let X be a random k-vector having the Nk(µ, Σ) distribution. Let A be a k ×` matrix and B be a k × m matric. Then XA and XB are independent if and only if they are uncorrelated.
(e) Let (XT1, XT2)T be a random k-vector having the Nk(µ, Σ) distribution with Σ = Σ11 Σ12
Σ21 Σ22
!
,
where X1 is a random `-vector and Σ11is an ` × ` matrix. Then the conditional pdf of X2 given X1 is
Nk−`(µ2+ (x1− µ1)Σ−111Σ12, Σ22− Σ21Σ−111Σ12), where µi = E(Xi), i = 1, 2.
Hint: Consider X2− µ2− (X1− µ1)Σ−111Σ12and X1− µ1.)
11. Suppose X1, X2, and X3 are multivariate normally distributed with means 1 µ1 = 1, µ2 = 0, µ3 = −2 and covariance structure
σ2(X1) = 3, σ2(X2) = 4, σ2(X3) = 6, σ(X1, X2) = 1, σ(X1, X3) = −1, σ(X2, X3) = 2.
a. What is the distribution of (X1, X2) given X3? b. What is the regression of X1on X2 and X3?
c. What is the conditional variance of X1given X2and X3?
3