1. hw 9
(1) Let L(Rn, Rm) be the space of all linear maps from Rn to Rm. For each T ∈ L(Rn, Rm), we define
kT kop= sup
kxkRn=1
kT (x)kRn. (a) Prove that k · kop defines a norm on L(Rn, Rm).
Proof. The number kT (x)k ≥ 0 for any x ∈ Rn, kT kop = supkxk=1kT (x)k ≥ 0.
If kT kop = 0, then kT (y)k = 0 for any y with kyk = 1. If x = 0, by linearity of T, T (x) = 0. If x 6= 0, take y = x/kxk. Then T (y) = 0 implies that T (x) = 0 by linearity of T. This shows that T (x) = 0 for all x ∈ Rn, i.e. T is the zero map.
For a ∈ R, kaT kop= sup
kxk=1
kaT (x)k = sup
kxk=1
|a|kT (x)k = |a| sup
kxk=1
kT (x)k = |a|kT kop. For S, T ∈ L(Rn, Rm) and any x ∈ Rn with kxk = 1,
k(S + T )(x)k = kS(x) + T (x)k ≤ kS(x)k + kT (x)k ≤ kSkop+ kT kop
by triangle inequality. Hence kS + T kop≤ kSkop+ kT kop. (b) Prove that kT (x)kRm ≤ kT kopkxkRn for all x ∈ Rn.
Proof. When x = 0, the inequality is obvious. When x 6= 0, take y = x/kxk.
Then kT (y)k ≤ kT k implies that kT k ≥
T
x kxk
= kT (x)k kxk .
Multiplying the inequality on the both side by kxk, we obtain the result.
(c) Let S ∈ L(Rm, Rp). Prove that kS ◦ T kop≤ kSkopkT kop.
Proof. For x ∈ Rn with kxk = 1, by exercise (1b), kS ◦ T (x)k ≤ kSkkT (x)k ≤ kSkkT k.
Thus kS ◦ T k ≤ kSkkT k.
(2) For each A ∈ Mmn(R), we define the matrix norm of A to be
kAk = kLAkop
where LA: Rn→ Rm is the linear map LA(x) = Ax for any x ∈ Rn. In this exercise, we assume m = n.
(a) Use mathematical induction to prove that kAkk ≤ kAkk for any k ≥ 1.
Proof. When k = 1, the statement is obvious. Assume that kAkk ≤ kAkk for k ∈ N. Exercise (1c) implies that
kAk+1k = kA ◦ Akk ≤ kAkkAkk ≤ kAk · kAkk= kAkk+1.
Thus the statement is true for k + 1. By induction, the statement is true for any
natural number k.
1
Remark. This implies that the sequence of numbers (kAkk1/k) is bounded above by kAk. We define the spectral radius of a square matrix A to be
ρ(A) = lim sup
k→∞
kAkk1/k. This implies that ρ(A) ≤ kAk.
(b) Let Eij be the n × n matrix whose ij-th entry is 1 and zero otherwise. Find kEijk for all 1 ≤ i, j ≤ n.
Proof. Let {ei} be the standard basis for Rn. For any x =Pn
i=1xiei, we have kEijxk = |xj| ≤
q
x21+ · · · + x2n= kxk.
This shows that kEijk ≤ 1. On the other hand, kEijejk = 1 ≤ kEijk. We find kEijk = 1.
(c) Let λ1, · · · , λn be n real numbers. Suppose that D = Pn
i=1λiEii, i.e. D is a diagonal matrix. Prove that
kDk = max{|λ1|, · · · , |λn|}.
We denote D by diag(λ1, · · · , λn). Prove that ρ(D) = kDk.
Proof. Denote λ = max{|λ1|, · · · , |λn|}. Let x =Pn
i=1xieiThen Dx =Pn i=1λixi. We see that
kDxk =p
(λ1x1)2+ · · · + (λnxn)2 ≤ q
λ2(x21+ · · · + x2n) ≤ λkxk.
Thus kDk ≤ λ. On the other hand,
|λj| = kDejk ≤ kDk for 1 ≤ j ≤ n.
This implies that λ ≤ kDk. We conclude that kDk = λ.
(3) Let R > 0. Suppose that the power series f (x) =P∞
k=0akxk is convergent for any x ∈ (−R, R). Here an∈ R for any n ≥ 0. Let A be an n × n real matrix such that kAk ∈ (−R + δ, R − δ) where 0 < δ < R.
(a) Prove that P∞
k=0akAk is convergent in (Mn(R), k · k). In this case, we define f (A) =P∞
k=0akAk.
Proof. Let ρ = R − δ. Since ρ ∈ (−R, R), P∞
k=0akρk is convergent in R. By n-th term test, lim
k→∞akρk = 0. Choose = 1. Then we can find N ∈ N so that
|akρk| < 1 whenever k ≥ N. By exercise (2a), kakAkk = |ak|kAkk ≤ |ak|kAkk≤ kAk
ρ
k
for k ≥ N .
Since kAk ∈ (−ρ, ρ), 0 < (kAk/ρ) < 1 and hence the geometric series
∞
X
k=N
(kAk/ρ)k
is convergent in R. By comparison test,
∞
X
k=N
kakAkk is convergent in R. Since
(Mn(R), k · k) is a Banach space,
∞
X
k=N
akAk is convergent in (Mn(R), k · k). Thus
∞
X
k=0
akAk=
N −1
X
k=0
akAk+
∞
X
k=N
akAk
is convergent in (Mn(R), k · k).
(b) If D = diag(λ1, · · · , λn), where λ1, · · · , λn are real numbers in (−R + δ, R − δ).
Prove that f (D) = diag(f (λ1), · · · , f (λn)), i.e. f (D) =Pn
i=1f (λi)Eii.
Proof. Let B = diag(f (λ1), · · · , f (λn)). For each m ≥ 1, we set fm(x) =
m
X
k=0
akxk for x ∈ (−R, R). To prove that f (D) = B, it we have to show that B = lim
m→∞fm(D). By induction, we have Dm =Pn
i=1λmi Eii. Hence fm(D) =
n
X
i=1
fm(λi)Eii. By exercise (2c),
kfm(D) − Bk = max{|fm(λi) − f (λi)| : 1 ≤ i ≤ n}.
Since limm→∞fm(x) = f (x) for x ∈ (−R, R), for any > 0, there exists N,1, · · · , N,n∈ N so that
|fm(λi) − f (λi)| <
if m ≥ N,i. Take N = max{N,i : 1 ≤ i ≤ n}. If m ≥ N, |fm(λi) − f (λi)| < for all 1 ≤ i ≤ n. Thus kfm(D) − Bk < if m ≥ N. We prove that f (D) = B.
(c) Let A be diagonalizable1 with A = SDS−1 where D = diag(λ1, · · · , λn). Sup-
pose λi ∈ (−R + δ, R − δ) for 1 ≤ i ≤ n. Show that f (A) = Sf (D)S−1.
Proof. Let fm : (−R, R) → R be the polynomial function as above. By induc- tion, we see that
Ak= SDkS−1 for any k ≥ 1.
Hence fm(A) = Sfm(D)S−1 for any m ≥ 1. Let C = Sf (D)S−1. By exercise (1c),
kfm(A) − Ck = kSfm(D)S−1− Sf (D)S−1k
= kS(fm(D) − f (D))S−1k
≤ kSkkS−1kkfm(D) − f (D)k.
Since lim
m→∞fm(D) = f (D) by the previous exercise, we find
m→∞lim fm(A) = C = Sf (D)S−1.
1A matrix A ∈ Mn(R) is said to be diagonalizable if there exists an invertible matrix S and a diagonal matrix D in Mn(R) such that S−1AS = D.
Remark. If A is diagonalizable with A = SDS−1, then exp(A) = S−1exp(D)S, and cos A = S−1cos(D)S, and sin A = S−1sin(D)S.
(d) Let λ ∈ (−R, R) and denote J3(λ) =
λ 1 0 0 λ 1 0 0 λ
.
Compute f (J3(λ)) in terms of f and λ and compute exp(tJ3(λ)) for all t, λ ∈ R.
In general, compute f (Jn(λ)) and compute exp(tJn(λ)), where2
Jn(λ) =
λ 1 0 · · · 0 0 λ 1 · · · 0 ... ... ... . .. ...
0 0 0 λ 1
0 0 0 0 λ
n×n
Solution:
f (J3(λ)) =
f (λ) f0(λ) 1!
f00(λ) 2!
0 λ f0(λ)
0 0 f (λ)1!
and
f (Jn(λ)) =
f (λ) f0(λ) 1!
f00(λ)
2! · · · f(n−1)(λ) (n − 1)!
0 f (λ) f0(λ)
1! · · · 0
... ... ... . .. ...
0 0 0 f (λ) f0(λ)
0 0 0 0 f (λ)1!
n×n
(4) Let A, B : (a, b) → Mn(R) be matrix valued differentiable functions3. . (a) Prove that
d
dtTr(A(t)) = Tr(A0(t)) for any a < t < b.
Proof. Let us prove that Tr : Mn(R) → R is continuous. For any A = [aij], we have
Tr A =
n
X
i=1
aii.
2Jnis called a Jordan matrix.
3Let (V, k · k) be a normed space and f : (a, b) → V be a function. (Such a function is called a V -valued function).
Let t0∈ (a, b), we say that f is differentiable at t0 if
t→tlim0
1 t − t0
(f (t) − f (t0)) exists.
In this case, we denote the limit by f0(t0). If f is differentiable at every point of (a, b), we say that f is differentiable.
We can inductively define f(k)(t) for any k ≥ 1. (We can also define the right derivative and the left derivative of f.)
Using kAk∞≤ kAkop,
| Tr(A)| ≤
n
X
i=1
|aii| ≤
n
X
i=1
kAk∞= nkAk∞≤ nkAkop.
By linearity of T, | Tr A − Tr B| = | Tr(A − B)| ≤ nkA − Bkop. This shows that Tr is a Lipschitz continuous function on Mn(R). Therefore it is continuous.
Choose δ > 0 such that (t − δ, t + δ) ⊂ (a, b). Let h be a real number such that 0 < |h| < δ. By linearity of Tr,
Tr(A(t + h)) − Tr(A(t))
h = TrA(t + h) − A(t)
h .
By continuity of Tr and the differentiability of A(t), d
dtTr(A(t)) = lim
h→∞
Tr(A(t + h)) − Tr(A(t)) h
= Tr
h→∞lim
A(t + h) − A(t) h
= Tr A0(t).
(b) Prove that
(A(t)B(t))0 = A0(t)B(t) + A(t)B0(t) for any a < t < b.
Proof. Let t, h, δ be as above. Then A(t + h)B(t + h) − A(t)B(t)
h = A(t + h) − A(t)
h B(t + h)
+ A(t)B(t + h) − B(t)
h .
Now we need the following lemmas.
Lemma 1.1. Let A : (a − δ, a + δ) → Mn(R). Suppose lim
t→aA(t) = A(a). Then A(t) is bounded in a neighborhood of a.
Proof. Let = 1. We can find δ0 such that kA(t) − A(a)k < 1 for |t − a| < δ0. By triangle inequality, kA(t)k ≤ 1 + kA(a)k for |t − a| < δ0. Take M = 1 + kA(a)k.
We find kA(t)k ≤ M for any t with |t − a| < δ0. Lemma 1.2. Let A : (a − δ, a + δ) → Mn(R) and B : (a − δ, a + δ) → Mn(R) be functions. Suppose that
limt→aA(t) = A(a), lim
t→aB(t) = B(a).
Then lim
t→aA(t)B(t) = A(a)B(a).
Proof. By norm inequality and the triangle inequality, we have
kA(t)B(t) − A(a)B(a)k = kA(t)B(t) − A(a)B(t)k + kA(a)B(t) − A(a)B(a)k
≤ kA(t) − A(a)kkB(t)k + kA(a)kkB(t) − B(a)k.
By Lemma 1.1, we can choose M > 0 such that kB(t)k ≤ M and kA(t)k ≤ M for any |t − a| < δ0 for some δ0 > 0. This implies that
kA(t)B(t) − A(a)B(a)k ≤ M (kA(t) − A(a)k + kB(t) − B(a)k).
Using the standard − δ argument, we find lim
t→aA(t)B(t) = A(a)B(a).
(c) Show that A0(t) = 0 for t ∈ (a, b) if and only if there exists A ∈ Mn(R) such
that A(t) = A for any a < t < b.
Proof. Let us write A(t) = [aij(t)] where aij : (a, b) → R are functions.
Lemma 1.3. Let A = [aij]. Then lim
t→aA(t) = A if and only if lim
t→aaij(t) = aij. Proof. This can be proved by the inequality:
kAk∞≤ kAk ≤ nkAk∞. This gives
max{|aij(t) − aij| : 1 ≤ i, j ≤ n} ≤ kA(t) − Ak ≤ n · max{|aij(t) − aij| : 1 ≤ i, j ≤ n}.
By the standard − δ argument, we prove our result.
Lemma 1.4. A : (a, b) → Mn(R) is differentiable if and only if aij are all differentiable. In this case,
A0(t) = [a0ij(t)].
Proof. By definition,
A(t + h) − A(t)
h = aij(t + h) − aij(t) h
. By lemma 1.3,
A0(t) = lim
h→0
A(t + h) − A(t)
h =
h→0lim
aij(t + h) − aij(t) h
= [a0ij(t)].
Let us go back to our main problem. By Lemma 1.4, A0(t) = 0 if and only if a0ij(t) = 0. Assume that A0(t) = 0. By mean value theorem, aij(t) = aij for some aij for all t ∈ (a, b). Let A = [aij]. Then A(t) = [aij(t)] = [aij] = A. When A(t) = A, A0(t) = 0 is obvious.
(d) Suppose that A(t) ∈ GLn(R) for all a < t < b. Prove that
(1.1) d
dt(A(t))−1 = −A(t)−1A0(t)A(t)−1 for any a < t < b.
Proof. By definition, A(t)A(t)−1= I. Using the previous results, A0(t)A(t)−1+ A(t)d
dtA(t)−1= 0.
This implies that
A(t)d
dtA(t)−1 = −A0(t)A(t)−1.
Multiplying the both side of the equation by A(t)−1 to their left, we obtain that d
dtA(t)−1 = −A(t)−1A0(t)A(t)−1.
(e) Let t0 ∈ (a, b). Let C : [a, b] → Mn(R) be a continuous function. Define F :
[a, b] → Mn(R) by
F (t) = Z t
a
C(s)ds, t ∈ [a, b].
Prove that F is differentiable with F0 = C.
Lemma 1.5. Let A : [a, b] → Mn(R) be a continuous function. Write A(t) = [aij(t)]. Then
Z b a
A(t)dt =
Z b a
aij(t)dt
.
Proof. Let P = {tk : 0 ≤ k ≤ m} be a partition of [a, b] and C = {ξk : ξk ∈ [tk−1, tk]} be a mark of P. Then the Riemann sum of A with respect to (P, C) is given by
R(A, P, C) =
m
X
k=1
(tk− tk−1)A(ξi)
=
" m X
k=1
(tk− tk−1)aij(ξi)
#
= [R(aij, C, P )] .
Let L = [Lij] be a n × n real matrix. By inequality, kAk∞≤ kAk ≤ nkAk∞, we have
max{|R(aij, C, P ) − Lij|} ≤ kR(A, P, C) − Lk ≤ n · max{|R(aij, C, P ) − Lij|}.
This shows that
kP k→0lim R(A, C, P ) = L if and only if lim
kP k→0R(aij, C, P ) = Lij
for any 1 ≤ i, j ≤ n. This proves the result.
If F (t) =Rt
aC(s)ds, then F (t) =h Rt
acij(s)dsi
. Since cij are continuous, by the fundamental theorem of calculus, t 7→Rt
acij(s)ds is differentiable with d
dt Z t
a
cij(s)ds = cij(t).
By Lemma 1.4, F is differentiable and F0(t) = [cij(t)] = C(t).
(f) Let A(t) = etcos t etsin t
−etsin t etcos t
, for t ∈ R. Find A0(t), Z t
0
A(s)ds and verify the equation (1.1) using A(t).
(5) Let A =
2 1 1 2
∈ M2(R).
(a) Let χA(λ) = det(λI2− A). Find the roots of χA(λ).
(b) Let λ1 and λ2 be the roots of χA(λ). Find unit vectors u1 and u2 such that Aui = λiui for i = 1, 2 and a matrix S whose i-th column vector is ui with det S > 0. More precisely, if ui= (xi, yi), then
S =
x1 x2
y1 y2
.
(c) Prove that AS = SD where D = diag(λ1, λ2) and that A is diagonalizable.
(d) Use the result obtained in exercise (1.3) to compute exp A, cos A and sin A.
(e) Solve for the matrix differential equation
X0(t) = AX(t), X(0) = I2. (f) Let Y (t) = cos(tA) and Z(t) = sin(tA)
A for t ≥ 0. Verify that Y (t) and Z(t) are both solutions to the matrix differential equation
Φ00(t) + A2Φ(t) = 0.
Here we use
sin θ
θ =
∞
X
k=0
(−1)k θ2k
(2k + 1)! for any θ.