Definition Let X, Y be metric spaces, E ⊂ X, p ∈ E and f : E → Y. f is said to becontinuous at p if
x∈E, x→plim f (x) = f (p),
⇐⇒ ∀ > 0, ∃ δ > 0 such that if x ∈ Nδ(p) ∩ E, then f (x) ∈ N(f (p)) ∩ f (E),
⇐⇒ ∀ > 0, ∃ δ > 0 such that f (Nδ(p) ∩ E) ⊂ N(f (p)) ∩ f (E).
(a) Theorem Let X, Y be metric spaces and f : X → Y. Then f is continuous if and only if f−1(V ) = {x ∈ X | f (x) ∈ V } is open in X for every open set V ⊂ Y.
(b) Theorem Let X, Y be metric spaces and f : X → Y. Then f is continuous if and only if f−1(C) = {x ∈ X | f (x) ∈ C} is closed in X for every closed set C ⊂ Y.
(c) Theorem Let X be a compact metric space, Y be metric spaces. Suppose f : X → Y is a continuous mapping. Then f (X) is compact.
(d) Theorem Let X be a compact metric space, Y be metric spaces. Suppose f : X → Y is a continuous mapping. Then f is uniformly continuous on X, i.e. ∀ > 0, ∃ δ > 0 such that if p, q ∈ X satisfying dX(p, q) < δ then dY(f (p), f (q)) < .
(e) Theorem Let X be a compact metric space, Y be metric spaces. Suppose f : X → Y is a continuous 1 − 1 onto mapping. Then the inverse mapping f−1 : Y → X defined by
For ech y ∈ Y, f−1(y) = x ⇐⇒ ∃! x ∈ X such that f (x) = y is continuous.
Definition Let X be a metric space. Two sets A, B ⊂ X are said to be separated if A ∩ ¯B = ∅ and ¯A ∩ B = ∅.
A set E ⊂ X is said to beconnected if E is not a union of two nonempty separated sets, i.e.
6 ∃ A 6= ∅, B 6= ∅ such that E = A ∪ B, A ∩ ¯B = ∅ and ¯A ∩ B = ∅.
(a) Let E be a connected subset of a metric space X. If E ⊂ F ⊂ ¯E, then F is also connected.
Proof Suppose F is disconnected and F = C ∪ D is a separation of F, i.e.
C 6= ∅, D 6= ∅ and C ∩ ¯D = ∅, ¯C ∩ D = ∅.
Since E is connected, E ⊂ F = C ∪ D and E = E ∩ C ∪ E ∩ D, either E ∩ C = ∅ or E ∩ D = ∅, i.e. the set E must lie entirely in either C or D. Suppose E ⊂ C. Then ¯E ⊂ ¯C and
D = F ∩ D ⊂ ¯E ∩ D ⊂ ¯C ∩ D = ∅ =⇒ D = ∅.
This contradicts to the fact that D 6= ∅. Hence, F is connected.
(b) Theorem 2.47 A subset E ⊂ R is connected if and only if it is an interval in one of the following forms: (a, b), [a, b), [a, b) or [a, b].
Theorem 4.22 Let X, Y be metric spaces and let f : X → Y be a continuous mapping. If E is a coonected subset of X, then f (E) is connected.
Theorem 4.23 Let f : [a, b] → R be continuous. If f (a) < f (b) and if c is a number such that f (a) < c < f (b), then there exists x ∈ (a, b) such that f (x) = c.
Definition Let f have domain E in Rn and range in Rm, and let p be an interior point of E.
We say that f is differentiable at p if there exists a linear function L : Rn → Rm such that for every > 0 there exists δ() > 0 such that if x ∈ Rn is any vector satisfying kx − pk ≤ δ(), then x ∈ E and
kf (x) − f (p) − L(x − p)k ≤ kx − pk.
Equivalently, we say that f is differentiable at p if there exists an m × n matrix L : Rn → Rm such that
x→plim
kf (x) − f (p) − L(x − p)k
kx − pk = lim
kx−pk→0
kf (x) − f (p) − L(x − p)k
kx − pk = 0,
i.e., the function f (x) − f (p) − L(x − p) = o(kx − pk), or we say that f (x) − f (p) − L(x − p) is of a little o of kx − pk.
Remarks
(a) If L1, L2 are linear functions such that f (x) − f (p) − L1(x − p) = o(kx − pk) and f (x) − f (p) − L2(x − p) = o(kx − pk), then L1 = L2.
Proof Observe that
x → p ⇐⇒ t → 0
and if L : Rn → Rm is an m × n matrix and let ei denote the unit vector in the positive i−th coordinate of Rn then
L(ei) = Lm×n·
0
... 1 ... 0
n×1
=Li1 Li2 · · · Lin
1×n= the i-th row of L.
Since
0 ≤ lim
x→p
k L1− L2(x − p)k kx − pk
= lim
x→p
kf (x) − f (p) − L2(x − p) − f (x) − f (p) − L1(x − p)k kx − pk
≤ lim
x→p
kf (x) − f (p) − L2(x − p)k
kx − pk + lim
x→p
kf (x) − f (p) − L1(x − p)k kx − pk
= 0,
we get L1 = L2 by taking x − p = tei and the fact that 0 = lim
t→0
k L1− L2(tei)k
kteik = k L1− L2(ei)k which implies that the i-th row of L1 is equal to the i-th row of L2.
(b) If L exists at p ∈ E, then it is called the derivative of f at p, denoted by Df (p) or f0(p).
Note that the linear function Df (p) : Rn → Rm is defined by
Df (p)(x − p) =
∂f1
∂x1
(p) ∂f1
∂x2
(p) · · · ∂f1
∂xn
(p)
∂f2
∂x1(p) ∂f2
∂x2(p) · · · ∂f2
∂xn(p)
· · · ·
∂fm
∂x1(p) ∂fm
∂x2(p) · · · ∂fm
∂xn(p)
m×n
x1− p1 x2− p2
... xn− pn
n×1
(c) When m = n = 1, f is differentiable at p ∈ (a, b) ⊂ R if there exists L ∈ R such that
x→plim
|f (x) − f (p) − L(x − p)|
|x − p| = lim
x→p
f (x) − f (p) − L(x − p) x − p
= 0
⇐⇒ lim
x→p
f (x) − f (p) − L(x − p)
x − p = 0
⇐⇒ lim
x→p
f (x) − f (p) x − p = L, and the deirivative of f at p is defined to be f0(p) = L.
(d) Definition A function f : [a, b] → R is differentiable at x ∈ [a, b] if this limit exists:
f0(x) ≡ lim
t→x
f (t) − f (x) t − x .
Questions
(a) If f is continuous on [a, b], is f (x) differentiable? No, consider the absolute value function.
(b) If f is differentiable on [a, b], is f continuous? Yes. Notice:
f (t) − f (x)− = f (t) − f (x)
t − x (t − x) = f0(x) · 0 = 0, as t → x.
(c) If f is differentiable on [a, b], is f0 continuous? No, consider f (x) = x 4 3 sin(1
x) if x 6= 0 and 0 if x = 0.
Remarks
(a) f0 always satisfies the intermediate value property and has no simple discontinuities, if it exists.
(b) Since f0 is a limit, f0 satisfies sum, product and quotient rules.
Theorems
(a) (f g)0 = f0g + f g0.
Proof Let h = f g. Consider:
h(t) − h(x) = f (t)[g(t) − g(x)] + g(x)[f (t) − f (x)].
Now take t → x after dividing by t − x.
(b) (Mean Value Theorem) If f is continuous on [a, b] and differentiable (a, b) then there exists a point c ∈ (a, b) such that f (b) − f (a) = (b − a)f0(c).
(c) (General Mean Value Theorem) If f, g are continuous on [a, b] and differentiable on (a, b), then there exists c ∈ (a, b) such that:
[f (b) − f (a)]g0(c) = [g(b) − g(a)]f0(c).
Note if g(x) = x, this gives the Mean Value Theorem.
Proof (Idea) Consider h(x) = [f (b) − f (a)]g(x) − [g(b) − g(a)]f (x). Note that h(a) = 0 = h(b). There must be a local min or max between a or b where h0(c) = 0. Hence the Generalized Mean Value Theorem follows from Rolle’s Theorem.
Example If f0(x) > 0 on (a, b) then f is strictly increasing on (a, b).
Proof Let a < s < t < b. Then f (s) < f (t). In other words, f (t) − f (s) > 0. But f (t) − f (s) = (t − s)f0(c) for some c ∈ (a, b). We know f0(c) > 0. So f (t) − f (s) > 0.
Remark Book’s proof uses Rolle’s Theorem which says if h : [a, b] → R has a local max at c ∈ (a, b) and h0(c) exists and then h0(c) = 0.
L’Hospital’s Rule Suppose f, g : (a, b) → R are differentiable and g0(x) 6= 0 for every x ∈ (a, b), where −∞ ≤ a < b ≤ ∞. Suppose
x→alim f0(x) g0(x) = A.
(a) If lim
x→af (x) = 0 and lim
x→ag(x) = 0, (b) or if lim
x→ag(x) = ∞, then
x→alim f (x) g(x) = A.
Proof
Case 1: −∞ < A < ∞ Since
x→alim f0(x)
g0(x) = A ⇐⇒ ∀ > 0, ∃ δ > 0 such that if x ∈ (a, a + δ) then A − < f0(x)
g0(x) < A +
⇐⇒ ∃ c = a + δ ∈ (a, b) such that if x ∈ (a, c) then A − < f0(x)
g0(x) < A + .
For any x, y satisfying a < x < y < c, the generalized Mean Value Theorem shows that
∃ t ∈ (x, y) such that A − < f (x) − f (y)
g(x) − g(y) = f0(t)
g0(t) < A + . (1) (a) If lim
x→af (x) = 0 and lim
x→ag(x) = 0, by letting x → a in (1), we get
A − ≤ f (y)
g(y) = lim
x→a
f (x) − f (y)
g(x) − g(y) = lim
x→a
f0(t)
g0(t) ≤ A + for every y ∈ (a, c)
=⇒ A − ≤ f (y)
g(y) ≤ A + for every y ∈ (a, c)
=⇒ lim
y→a
f (y) g(y) = A.
(b) Suppose lim
x→ag(x) = ∞, by keeping y fixed,
∃ c1 ∈ (a, y) such that if a < x < c1 then
g(x) > 0, g(x) > g(y)
g(y) g(x)
< ,
f (y) g(x)
<
Thus g(x) − g(y)
g(x) > 0 for every a < x < c1, and
(A − ) ·g(x) − g(y)
g(x) < f (x) − f (y)
g(x) − g(y) · g(x) − g(y)
g(x) < (A + ) ·g(x) − g(y) g(x)
=⇒ A − − (A − )g(y)
g(x) < f (x)
g(x) −f (y)
g(x) < A + − (A + )g(y) g(x)
=⇒ A − − (A − )g(y)
g(x) + f (y)
g(x) < f (x)
g(x) < A + − (A + )g(y)
g(x) +f (y) g(x)
=⇒ A − − (A − ) − < f (x)
g(x) < A + + (A + ) + for every x ∈ (a, c1)
=⇒ lim
x→a
f (x) g(x) = A.
Case 2: A = −∞
Since
x→alim f0(x)
g0(x) = −∞ ⇐⇒ ∀ q > r ∈ R, ∃ δ > 0 such that if x ∈ (a, a + δ) then f0(x)
g0(x) < r < q
⇐⇒ ∃ c = a + δ ∈ (a, b) such that if x ∈ (a, c) then f0(x)
g0(x) < r < q.
For any x, y satisfying a < x < y < c, the generalized Mean Value Theorem shows that
∃ t ∈ (x, y) such that f (x) − f (y)
g(x) − g(y) = f0(t)
g0(t) < r < q. (2)
(a) If lim
x→af (x) = 0 and lim
x→ag(x) = 0, by letting x → a in (2), we get f (y)
g(y) = lim
x→a
f (x) − f (y) g(x) − g(y) = lim
x→a
f0(t)
g0(t) ≤ r < q for every y ∈ (a, c)
=⇒ f (y)
g(y) ≤ r < q for every y ∈ (a, c)
=⇒ lim
y→a
f (y)
g(y) ≤ r < q for arbitrary q ∈ R
=⇒ lim
y→a
f (y)
g(y) = −∞.
(b) Suppose lim
x→ag(x) = ∞, by keeping y fixed, choose > 0 such that r + |r| + < q ⇐⇒
0 < < q − r 1 + |r|,
∃ c1 ∈ (a, y) such that if a < x < c1 then
g(x) > 0, g(x) > g(y)
g(y) g(x)
< ,
f (y) g(x)
<
Thus g(x) − g(y)
g(x) > 0 for every a < x < c1, and
f (x) − f (y)
g(x) − g(y) · g(x) − g(y)
g(x) < r · g(x) − g(y) g(x)
=⇒ f (x)
g(x) −f (y)
g(x) < r −rg(y) g(x)
=⇒ f (x)
g(x) < r −rg(y)
g(x) +f (y) g(x)
=⇒ f (x)
g(x) < r + |r| + < q for every x ∈ (a, c1)
=⇒ lim
x→a
f (x)
g(x) ≤ q for arbitrary q ∈ R
=⇒ lim
x→a
f (x)
g(x) = −∞.
Case 3: A = ∞ Since
x→alim f0(x)
g0(x) = ∞ ⇐⇒ ∀ p < r ∈ R, ∃ δ > 0 such that if x ∈ (a, a + δ) then f0(x)
g0(x) > r > p
⇐⇒ ∃ c = a + δ ∈ (a, b) such that if x ∈ (a, c) then f0(x)
g0(x) > r > p.
For any x, y satisfying a < x < y < c, the generalized Mean Value Theorem shows that
∃ t ∈ (x, y) such that f (x) − f (y)
g(x) − g(y) = f0(t)
g0(t) > r > p. (3)
(a) If lim
x→af (x) = 0 and lim
x→ag(x) = 0, by letting x → a in (3), we get f (y)
g(y) = lim
x→a
f (x) − f (y) g(x) − g(y) = lim
x→a
f0(t)
g0(t) ≥ r > p for every y ∈ (a, c)
=⇒ f (y)
g(y) ≥ r > p for every y ∈ (a, c)
=⇒ lim
y→a
f (y)
g(y) ≥ r > p for arbitrary p ∈ R
=⇒ lim
y→a
f (y) g(y) = ∞.
(b) Suppose lim
x→ag(x) = ∞, by keeping y fixed, choose > 0 such that r − |r| − > p ⇐⇒
0 < < r − p 1 + |r|,
∃ c1 ∈ (a, y) such that if a < x < c1 then
g(x) > 0, g(x) > g(y)
g(y) g(x)
< ,
f (y) g(x)
<
Thus g(x) − g(y)
g(x) > 0 for every a < x < c1, and
f (x) − f (y)
g(x) − g(y) ·g(x) − g(y)
g(x) > r ·g(x) − g(y) g(x)
=⇒ f (x)
g(x) − f (y)
g(x) > r − rg(y) g(x)
=⇒ f (x)
g(x) > r − rg(y)
g(x) + f (y) g(x)
=⇒ f (x)
g(x) > r − |r| − > p for every x ∈ (a, c1)
=⇒ lim
x→a
f (x)
g(x) ≥ p for arbitrary p ∈ R
=⇒ lim
x→a
f (x) g(x) = ∞.
Question Let f : R → R be differentiable to any order (i.e., f is class C∞ or smooth). How can we approximate f near the point x = a?
Let P0(x) = f (a). Can we find a polynomial P1(x) such that P1(a) = f (a) and moreover P10(a) = f0(a). In other words, they agree to first order, et cetera. Then let:
P1(x) = f (a) + f0(a)(x − a),
P1(x) = f (a) + f0(a)(x − a) + f00(a)(x − a)2
2 .
Definition The n-th order Taylor polynomial of f (x) centered at x = a is given by:
Pn(a, x) = Pn(x) =
n
X
k=0
f(k)(a)(x − a)k
k! .
Taylor’s Theorem For C∞, f : R → R, then:
f (x) = Pn(x) + f(n+1)(c)(x − c)(n+1) (n + 1)!
for some c between x and a for a particular x.
Remarks
(a) The error is En(x) = f (x) − Pn(x).
(b) When n = 0,
f (x) = f (a) + f0(c)(x − a)
for some c between a and x at a particular x. Notice that this is equivalent to the Mean-Value Theorem. Thus Taylor’s Theorem is a generalization of Mean-Value Theorem.
Proof Let k ∈ R be defined by
f (b) − P(n−1)(a, b) = k(b − a)n
n! . (4)
Our goal is to show k = f(n)(c) for some c ∈ (a, b). Let:
g(x) = f (b) − P(n−1)(x, b) − k(b − x)n n! .
Then g(a) = 0 using (1). But it also holds that g(b) = 0 because P (b, b) = f (b). Then there exists c ∈ (a, b) such that g0(c) = 0. Notice that:
g0(x = −P(n−1)0 (x, b) + k(b − x)n−1 (n − 1)!
Notice that:
P(n − 1)(x, b) =
n−1
X
k=0
f(k)(x)(b − x)k k!
Then it follows that:
P(n−1)0 (x, b) = f0(x) − f0(x) + f00(x)(b − x) − f00(x)(x − b) + · · · + f(n)(x)(b − x)(n−1) (n − 1)!
At x = c, then
0 = g0(c) = [−fn(c) + k](b − c)n−1 n − 1! . Since the term on the right is not zero, our conclusion follows.
Question How about derivatives of complex functions?
Example Let f : R → C given by f (x) = eix = cos x + i sin x. Consider f on [0, 2π]. Notice f (0) = 1 = f (2π). So f (2π) − f (0) = 0. The mean-value theorem would imply there is a point in (0, 2π) with derivative zero. But f0(x) = ieix and moreover |f0(x)| = 1. Thus the mean-value theorem does not hold for complex differentiation. Similarly, L’hˆopital’s rule does not hold.
Theorem Let f : [a, b] → Rk be continuous and differentiable on (a, b). Then exists c ∈ (a, b) such that |f (b) − f (a)| ≤ (b − a)|f0(c)|.
Inverse Function Theorem Suppose that
U ⊆ Rn is open, f : U → Rn is C1, x0 ∈ U,
dfx0 : Rn → Rn is invertible.
Then there exists
a neighborhood V of x0 in U a neighborhood W of f (x0) in Rn x0 ∈ U,
dfx0 : Rn → Rn is invertible.
such that f has a C1 inverse g = f−1 : W → V such that
f (g(y)) = y ∀ y ∈ W and g(f (x)) = x ∀ x ∈ V.
Moreover
∂gi
∂yj(y)
1≤i,j≤n
= dgy = (dfg(y))−1 = ∂fi
∂xj(g(y))
−1 1≤i,j≤n
∀ y ∈ W and g is smooth whenever f is smooth.
Remarks
(a) The theorem says that a continuously differentiable function f between regions in Rn is locally invertible near points where its differential is invertible,
i.e.
W = {y = (y1(x), . . . , yn(x)) = (f1(x), . . . , fn(x)) = f (x) | x ∈ V }, V = {x = (x1(y), . . . , xn(y)) = (g1(y), . . . , gn(y)) = g(y) | y ∈ W }, and each coordinate function is continuously differentiable.
(b) Let 0 < a < 1, define f : R → R by
f (x) =
ax + x2sin1
x if x 6= 0
0 if x = 0
.
Compute f0(x) for all x ∈ R. Show that f0(0) > 0, yet f is not one-to-one in any neighbor- hood of 0.
Hint Note that
(i) there exists a sequence of points {xn → 0} at which f0(xn) = 0, and f00(xn) 6= 0, (ii) if f0(p) = 0, f00(p) 6= 0, then f cannot be one-to-one near p.
Remark This example shows that in the Inverse Function Theorem, the hypothesis that f is C1 cannot be weaken to the hypothesis that f is differentiable.
(c) Define f : R2 → R2 by
f (x, y) = (excos y, exsin y).
Show that f is C1 and df(x,y) is invertible for all (x, y) ∈ R2 and yet f is not one-to-one function globally. Why doesn’t this contradict the Inverse Function Theorem?
Example Use Inverse Function Theorem to determine whether the system u(x, y, z) = x + xyz
v(x, y, z) = y + xy w(x, y, z) = z + 2x + 3z2 can be solved for x, y, z in terms of u, v, w near p = (0, 0, 0).
Solution Set F (x, y, z) = (u, v, w). Then
DF (p) =
ux uy uz vx vy vz
wx wy wz
(p) =
1 + yz xz xy
y 1 + x 0
2 0 1 + 6z
(p) =
1 0 0 0 1 0 2 0 1
and
1 0 0 0 1 0 2 0 1
= 1 6= 0.
By the Inverse Function Theorem, the inverse F−1(u, v, w) exists near p = (0, 0, 0), i.e. we can solve x, y, z in terms of u, v, w near p = (0, 0, 0).
Implicit Function Theorem Let
U ⊆ Rm+n≡ Rm× Rn be an open set, f = (f1, . . . , fn) : U → Rn be a C1 function, (a, b) ∈ U be a point at which f (a, b) = 0, the n × n matrix dyf |(a,b)= ∂fi
∂yj(a, b)
1≤i,j≤n
is invertible.
Then there exists
a neighborhood V of (a, b) in U, a neighborhood W of a in Rm, and a C1 function g : W → Rn
such that
{(x, y) ∈ V ⊂ Rm× Rn| f (x, y) = 0}
= {(x, g(x)) | x ∈ W }
= the graph of g over W.
i.e. Letting
S = {(x, y) ∈ Rm× Rn | f (x, y) = 0}
denote the solution set, then S ∩ V is of the dimension m, and it is equal to graph(g), the graph of a C1 function g, over W . Moreover
dgx = − (dyf )−1|(x,g(x)) · dxf |(x,g(x)) and g is smooth whenever f is smooth.
Example Let F : R4 → R2 be given by
F (x, y, z, w) = (G(x, y, z, w), H(x, y, z, w)) = (y2+ w2− 2xz, y3+ w3+ x3− z3), and let p = (1, −1, 1, 1).
(a) Show that we can solve F (x, y, z, w) = (0, 0) for (x, z) in terms of (y, w) near (−1, 1).
Solution Since
DF (p) =Gx Gy Gz Gw Hx Hy Hz Hw
(p) = −2 −2 −2 2
3 3 −3 3
and
Gx Gz Hx Hz
(p) =
−2 −2 3 −3
= 12 6= 0,
we can write (x, z) in terms of (y, w) near (−1, 1) by Implicit Function Theorem.
(b) If (x, z) = Φ(y, w) is the solution in part (a), show that DΦ(−1, 1) is given by the matrix
−−2 −2 3 −3
−1
−2 2
3 3
=−1 0
0 1
Solution The Implicit Function Theorem implies that, near p,the solution set {(x, y, z, w) | F (x, y, z, w) = (0, 0)}
can be represented as the graph of (x, z) = Φ(y, w) near (−1, 1).
Hence, we have
∂F
∂y = (0, 0) and ∂F
∂w = (0, 0) near (−1, 1).
Therefore
0 = Gx∂x
∂y + Gy+ Gz∂z
∂y and 0 = Gx∂x
∂w + Gz∂z
∂w + Gw
which implies that
−[Gy, Gw] = [Gx, Gz]
∂x
∂y
∂x
∂w
∂z
∂y
∂z
∂w
. Similarly, we have
−[Hy, Hw] = [Hx, Hz]
∂x
∂y
∂x
∂w
∂z
∂y
∂z
∂w
. Thus, we have
−Gy Gw
Hy Hw
=Gx Gz
Hx Hz
∂x
∂y
∂x
∂w
∂z
∂y
∂z
∂w
, or
DΦ =
∂x
∂y
∂x
∂z ∂w
∂y
∂z
∂w
= −Gx Gz
Hx Hz
−1
Gy Gw
Hy Hw
.
Hence, DΦ(−1, 1) is given by the matrix
−−2 −2 3 −3
−1
−2 2
3 3
=−1 0
0 1
.
Remarks
(a) The theorem says that if
f : U ⊂ Rm+n → Rn
is a map of the class C1(U ), (a, b) is a point in U, and dyf |(a,b) = h∂fi
∂yj(a, b) i
1≤i,j≤n is invertible, then, locally, the solution set
S = {(x, y) ∈ U | f (x, y) = f (a, b)} is a C1 “manifold” of dimension m.
(b) Let
f : U ⊂ Rm+n → Rn
be a map of the class C1 on an open subset U of R(m+n), and assume that the differential dfp = ∂fi
∂xj
1≤i≤n;1≤j≤(m+n)
is of the constant rank k at each p ∈ U.
By relabeling, if necessary, we may assume that
f = (fˆ 1, . . . , fk) : U ⊂ Rm+n→ Rk is a map with its differential
d ˆfp = ∂fi
∂xj
1≤i≤k;1≤j≤(m+n)
of rank k.
The Implicit Function Theorem says that locally the solution set
S = {x ∈ U | ˆf (x) = 0} is a C1 “manifold” of dimension (m + n) − k.
Identify S with an open neighborhood W ⊂ R(m+n)−k of 0 ∈ R(m+n)−k and identify U with V × W, such that we write each x ∈ V × W ⊂ R(m+n)as x = (¯x, ˆx), where ¯x = (¯x1, . . . , ¯xk) ∈ V, and ˆx = (ˆxk+1, . . . , ˆxm+n) ∈ W. This implies
f (0, ˆˆ x) = 0 and
"
∂ ˆfi
∂ ¯xj
#
1≤i,j≤k
is invertible on V × W.
Using the Inverse Function Theorem, we may show that the range
f (V × W ) = f (U ) locally is a k-dimensional “manifold”.ˆ
(c) Two mappings are said to be locally equivalent if under suitable choices of local coordinate systems in the domain and range spaces (with origin at 0) they can be written by the same formulas. The Implicit Function Theorem says that if
f, g : U ⊂ Rm+n → Rn
are maps of the class C1(U ) , p is a point in U, and rk[df (p)] = rk[dg(p)] = n, then f and g are equivalent, i.e. there exists diffeomorphisms
h : Rm+n → Rm+n and k : Rn → Rn, such that
f ◦ h = k ◦ g.
Definitions
(a) A map f : U ⊂ Rm → Rn is called a Lipschitz map on U if there exists a constant C ≥ 0 such that
kf (x) − f (y)k ≤ Ckx − yk for all x, y ∈ U.
If one can choose a (Lipschitz) constant C < 1 such that the above Lipschitz condition hold on U, then f is called a contraction map.
Example Let U be a convex subset of Rm, and f : U ⊂ Rm→ Rn be a map with bounded kdf k = sup kdfp(x)kRn
kxkRm | p ∈ U, x 6= 0
.
For each i = 1, . . . , m, by setting x = ei, the ith unit vector of Rm, we get k∇fik ≤ kdf k for each i = 1, . . . , m
which implies that f is Lipscitz on U.
(b) Let U be a subset of Rm, and f be a map that maps U into U, i.e. f : U → U. A point p ∈ U is said to be a fixed point of f if f (p) = p.
Fixed Point Theorem for Contractions Let f : Rm → Rm be a contraction map. Then f has a fixed point.
Note that Rm is complete which implies that any Cauchy sequence (is bounded and has a limiting point by Bolzano-Weierstrass theorem, and Cauchy condition implies that it) converges.