3.6 Inequalities and Identities

(1)

3.6 Inequalities and Identities

Theorem 3.6.1 (Chebychev’s Inequality)

Let X be a random variable and let g(x) be a nonnegative function. Then, for any r > 0, P (g(X) ≥ r) ≤ Eg(X)

r .

Proof:

Eg(X) = Z _∞

−∞

g(x)f_X(x)dx

≥ Z

{x:g(x)≥r}

g(x)fX(x)dx (g is nonnegative)

≥ r Z

{x:g(x)≥r}

f_X(x)dx

= rP (g(X) ≥ r).

Rearranging now produces the desired inequality. ¤

Example 3.6.2 (Illustrating Chebychev)

let g(x) = (x − µ)²/σ², where µ = EX and σ² = VarX. For convenience write r = t². Then P ((X − µ)²

σ² ≥ t²) ≤ 1

t²E(X − µ)² σ² = 1

t². Thus,

P (|X − µ| ≥ tσ) ≤ 1 t². For example, taking t = 2, we get

P (|X − µ| ≥ 2σ) ≤ 1

2² = 0.25.

Example 3.6.3 (A normal probability inequality) If Z is standard normal, then

P (|Z| ≥ t) ≤ r2

π e^−t²^/2

t , for all t > 0.

1

(2)

Write

P (Z ≥ t) = 1

√2π Z _∞

t

e^−x²^/2dx

≤ 1

√2π Z _∞

t

x

te^−x²^/2dx (since x/t > 1)

= 1

√2π e^−t²^/2

t and use the fact that P (|Z| ≥ t) = 2P (Z ≥ t).

Theorem 3.6.4

Let X_α,β denote a gamma(α, β) random variable with pdf f (x|α, β), where α > 1. Then for any constants a and b,

P (a < X_α,β < b) = βf (a|α, β) − f (b|α, β) + P (a < X_α−1,β < b).

Lemma 3.6.5(Stein’s Lemma)

Let X ∼ N(θ, σ²), and let g be a differentiable function satisfying E|g⁰(X)| < ∞. Then E[g(X)(X − θ)] = σ²Eg⁰(X).

Proof: The left-hand side is

E[g(X)(X − θ)] = 1

√2πσ Z _∞

−∞

g(x)(x − θ)e^−(x−θ)²^/(2σ²⁾dx.

Using integration by parts with u = g(x) and dv = (x − θ)e^−(x−θ)²^/(2σ²⁾dx to get E[g(X)(X − θ)] = 1

√2πσ

£− σ²g(x)e^−(x−θ)²^/(2σ²⁾|^∞_−∞+ σ² Z _∞

−∞

g⁰(x)e^−(x−θ)²^/(2σ²⁾dx¤ .

The condition on g⁰ is enough to ensure that the first term is 0 and what remains on the right-hand side is σ²Eg⁰(X). ¤

Example 3.6.6 (Higher-order normal moments)

Stein’s lemma makes calculation of higher-order moments quite easy/ For example, if X ∼

2

(3)

N(θ, σ²), then

EX³ = EX²(X − θ + θ) = EX²(X − θ) + θEX²

= 2σ²EX + θEX² = 2σ²θ + θ(σ²+ θ²)

= 3θσ²+ θ³.

Theorem 3.6.7

Let χ²_p denote a chi-squared random variable with p degrees of freedom. For any function h(x),

Eh(χ²_p) = pE¡h(χ²_p+2) χ²_p+2

¢

provided the expectations exist.

Some moment calculations are very easy with Theorem ??. For example, the mean of a χ²_p is

Eχ²_p = pE¡χ²_p χ²_p

¢= pE(1) = p,

and the second moment is

E(χ²_p)² = pE¡(χ²_p)² χ²_p

¢= pE(χ²_p) = p(p + 2).

So Var¡ χ²_p¢

= p(p + 2) − p² = 2p.

Theorem 3.6.8 (Hwang)

Let g(x) be a function with −∞ < Eg(X) < ∞ and −∞ < g(−1) < ∞. Then:

a. If X ∼ P oisson(λ),

E(λg(X)) = E(Xg(X − 1)).

b. If X ∼ negative binomial(r, p),

E((1 − p)g(X)) = E¡ X

r + X − 1g(X − 1)¢ .

Example 3.6.9 (Higher-order Poisson moments)

For X ∼ Poisson(λ), take g(x) = x² and use Theorem 3.6.8:

E(λX²) = E(X(X − 1)²) = E(X³− 2X²+ X).

3

(4)

Therefore, the third moment of a Poisson(λ) is EX³ = λEX² = 2EX²− EX

= λ(λ + λ²) + 2(λ + λ²) − λ = λ³+ 3λ²+ λ.

For the negative binomial, the mean can be calculated by taking g(x) = r + x, E((1 − p)(r + X)) = E¡ X

r + X − 1(r + X − 1)¢

= EX, so, rearranging, we get

EX = r(1 − p)

p .

4