3.3.3 Normal Distribution

(1)

3.3.3 Normal Distribution

The normal distribution has several advantages over the other distributions.

a. The normal distribution and distributions associated with it are very tractable and analytically.

b. The normal distribution has the familiar bell shape, whose symmetry makes it an appealing choice for many popular models.

c. There is the Central Limit Theorem, which shows that, under mild conditions, the normal distribution can be used to approximate a large variety of distributions in large samples.

The normal distribution has two parameters, usually denoted by µ and σ², which are its mean and variance. The pdf is

f (x|µ, σ²) = 1

√2πσe^−(x−µ)²^/(2σ²⁾, −∞ < x < ∞.

If X ∼ N(µ, σ²), then the random variable Z = (X − µ)/σ has a N(0, 1) distribution, also known as the standard normal.

If Z ∼ N(0, 1),

EZ = 1

√2π Z _∞

−∞

ze^−z²^/2dz = 0, and so, if X ∼ N(µ, σ²),

EX = E(µ + σZ) = µ + σEZ = µ.

Similarly, we have that VarZ = 1 and VarX = σ².

To show

√1 2π

Z _∞

−∞

e^−z²^/2dz = 1.

We only need to show Z _∞

0

e^−z²^/2dz = rπ

2. 1

(2)

Since

¡Z _∞

0

e^−z²^/2dz¢₂

=¡Z _∞

0

e^−t²^/2dt¢¡Z _∞

0

e^−u²^/2du¢

= Z _∞

0

Z _∞

0

e^−(t²^+u²^)/2dtdu.

Now we convert to polar coordinates. Define

t = r cos θ, u = r sin θ.

Then t² + u² = r² and dtdu = rdθdr and the limits of integration become 0 < r < ∞, 0 < θ < π/2. We now have

Z _∞

0

Z _∞

0

e^−(t²^+u²^)/2dtdu = Z _∞

0

Z _∞

0

re^−r²^/2dθdr

= π 2

Z _∞

0

re^−r²^/2dr = π 2.

The probability content within 1, 2 or 3 standard deviations of the mean is

P (|X − µ| ≤ σ) = P (|Z| ≤ 1) = .6826, P (|X − µ| ≤ 2σ) = P (|Z| ≤ 2) = .9544, P (|X − µ| ≤ 3σ) = P (|Z| ≤ 3) = .9974, where X ∼ N(µ, σ²) and Z ∼ N(0, 1).

Among the many uses of the normal distribution, an important one is its use as an approximation to other distributions. For example, if X ∼ binomial(n, p), then EX = np and VarX = np(1 − p), and under suitable conditions, the distribution of X can be approximated by that of a normal random variable with mean µ = np and variance σ² = np(1 − p). The suitable conditions are that n should be large and p should not be extreme (near 0 or 1). We want n large so that there are enough values of X to make an approximation by a continuous distribution reasonable, and p should be “in the middle” so the binomial is nearly symmetric, as is the normal. A conservative rule to follow is that the approximation will be good if min(np, n(1 − p)) ≥ 5.

2