Chapter 1 Mathematical Preliminaries and Error Analysis

(1)

Chapter 1 Mathematical Preliminaries and Error Analysis

Hung-Yuan Fan (范洪源)

Department of Mathematics, National Taiwan Normal University, Taiwan

Spring 2016

(2)

Section 1.1

Review of Calculus

(3)

Limits

Def 1.1

A fuction f : X→ R has the limit L at x0, denoted by

xlim→x0

f(x) = L,

if∀ ε > 0, ∃ δ > 0s.t. x ∈ X, 0 < |x − x0| < δ ⇒ |f(x) − L| < ε.

(4)

Continuity (連續性)

Def 1.2

1 A fuction f : X→ R is continuous (簡寫: conti.) at x0∈ X if

xlim→x0

f(x) = f(x₀).

2 f is conti. on X if it is conti. at each point of X.

3 C(X) ={f | f is conti. on X} denotes the set of all conti.

functions defined on X.

Note: if X = [a, b], (a, b), [a, b) or (a, b] with a < b, write C[a, b],

C(a, b), C[a, b) or C(a, b], respectively.

(5)

Limits of Sequences

Def 1.3

A sequence (簡寫: seq.) of real numbers{xn}^∞n=1 converges (簡寫:

conv.) to the limit x, written

nlim→∞x_n= x, or x_n→ x as n → ∞, if∀ ε > 0, ∃ N(ε) ∈ N s.t. n > N(ε) ⇒ |xn− x| < ε.

Thm 1.4 (序列與連續性的關係)

Let f be a real-valued function defined on∅ ̸= X ⊆ R and x0∈ X.

The followings are equivalent:

a. f is conti. at x₀.

b. ∀ seq. {xn}^∞_n=1⊆ X with lim

→∞x_n= x₀, lim

→∞f(x_n) = f(x₀).

(6)

Differentiability (可微分性)

Def 1.5

1 A fuction f : X→ R is differentiable (簡寫: diffi.) at x0 ∈ X if f^′(x0) = lim

x→x⁰

f(x)− f(x0) x− x0

= lim

h→0

f(x₀+ h)− f(x0)

h .

2 f is cdiff. on X if it is cdiffy6t. at each point of X.

3 Cⁿ(X) denotes the set of all functions having n conti.

derivatives on X.

4 C^∞(X) denotes the set of functions having derivatives of all orders on X.

(7)

Continuity v.s. Differentiability

Thm 1.6

Let f be a real-valued function defined on X and x₀ ∈ X. Then f is diff. at x₀ =⇒ f is conti. at x0.

(8)

Rolle’s Theorem

Thm 1.7 (Rolle’s Thm)

f∈ C[a, b] and f is diff. on (a, b). If f(a) = f(b), then ∃ c ∈ (a, b) s.t. f^′(c) = 0.

(9)

Generalized Rolle’s Theorem

Thm 1.10 (Generalized Rolle’s Thm)

f∈ C[a, b] is n times diff. on (a, b). If f(xi) = 0 for some n + 1 distinct numbers a≤ x0< x1 <· · · < xn≤ b, then

∃ c ∈ (x0, xn)⊆ [a, b] s.t. f⁽ⁿ⁾(c) = 0.

(10)

Mean Value Theorem (簡寫: MVT, 均值定理)

Thm 1.8 (MVT)

f∈ C[a, b] and f is diff. on (a, b). Then ∃ c ∈ (a, b) s.t.

f^′(c) = f(b)− f(a)

b− a or f(b)− f(a) = f^′(c)(b− a).

(11)

Extreme Value Theorem (簡寫: EVT, 極值定理)

Thm 1.9 (EVT)

If f∈ C[a, b], then ∃ c1, c2∈ [a, b] s.t.

f(c₁)≤ f(x) ≤ f(c2) ∀ x ∈ [a, b].

(12)

Intermediate Value Theorem (簡寫: IVT, 中間值定理)

Thm 1.11 (IVT)

f∈ C[a, b], K is any number between f(a) and f(b)

=⇒ ∃ c ∈ (a, b) s.t. f(c) = K.

(13)

Taylor Polynomials and Series

Thm 1.14 (Taylor’s Thm, 泰勒定理)

f∈ Cⁿ[a, b], f⁽ⁿ⁺¹⁾ ∃ on [a, b] and x0∈ [a, b].

⇒ ∀ x ∈ [a, b], ∃ ξ(x) between x0 and x s.t. f(x) = P_n(x) + R_n(x), where

P_n(x) =

∑n k=0

f^(k)(x₀)

k! (x− x0)^k, (the nth Taylor poly. for f) R_n(x) = f⁽ⁿ⁺¹⁾(ξ(x))

(n + 1)! (x− x0)ⁿ⁺¹.

(remainder or truncation error associated with Pn(x))

(14)

Taylor Series (泰勒級數)

Remarks

1 If lim

n→∞R_n(x) = 0 ∀ x ∈ I (I: interval with x0∈ I), then f(x) = lim

n→∞P_n(x) =

∑∞ k=0

f^(k)(x₀)

k! (x− x0)^k ∀ x ∈ I.

We say that the Taylor series for f about x₀ conv. to f on I.

2 If x₀= 0, the Taylor series is often called the Maclaurin

series.

(15)

Example 3, p. 11

The second (or third) Taylor poly. for f(x) = cos x about x₀= 0 is P₂(x) = P₃(x) = 1−¹₂x², but their truncation errors satisfy

|R2(x)| ≤| sin ξ(x)| |x|³ 6 ≤ |x|⁴

6 = 0.1¯6· |x|⁴ (∵ | sin ξ(x)| ≤ |ξ(x)| ≤ |x| ∀ x ∈ R)

|R3| ≤| cos ˜ξ(x)| |x|⁴ 24 ≤ |x|⁴

24 = 0.041¯6· |x|⁴. (Sharper Bound for|x| ≈ 0!)

(16)

What are the goals of numerical analysis?

Remark

Two objectives of numerical analysis:

1 Find an approximation to the solution of a given problem.

2 Determine a bound for the accuracy of the approximation.

Is this error bound tight and sharp?

(17)

Integration (1/2)

Def 1.12 (定積分的定義)

1 The (Riemann) definite integral of f on [a, b] is defined by

∫ _b

a

f(x) dx = lim

max

1≤i≤n∆xi→ 0

∑n i=1

f(z_i)∆x_i,

where P ={a = x0 < x1<· · · < xn= b} is any partition of [a, b], zi∈ [xi−1, xi] and ∆xi= xi− xi−1 for i = 1, 2, . . . , n.

2 f is called (Riemann) integrable over [a, b] if the limit exists.

Note: f is conti. on [a, b]

⇒ f is integrable over [a, b].

(18)

Integration (2/2)

Remark

f is integrable over [a, b] =⇒

∫ _b

a

f(x) dx = lim

n→∞

∑n i=1

f(z_i)�x, (�x =

b

− a

n

)

≈

∑n i=0

w_i· f(x_i)�x, (w_i

: weighting coeff.)

withz_i= xi or x_i₋₁ for i = 1, 2, . . . , n.

(19)

Riemann Sums (黎曼和) with z

_i

= x

_i

∀ i

(20)

Weighted MVT for Definite Integrals

Thm 1.13 (定積分的權重均值定理)

f∈ C[a, b] and g is an integrable function that does not change sign on [a, b]. Then∃ c ∈ (a, b) s.t.

∫ b a

f(x)g(x) dx = f(c)

∫ b a

g(x) dx.

Note: When

g(x)≡ 1, we have

f(c) = 1 b− a

∫ _b

a

f(x) dx≡ favg, where f_avg is the average value of f on [a, b].

(21)

The Average Value of a Function

(22)

Section 1.2

Round-off Errors and Computer Arithmetic

(捨入誤差與電腦算術)

(23)

Binary Machine Numbers (二進位機器數字)

IEEE 754-1985 Standard (updated version: IEEE 754-2008)

1 Single Precision Format (32 bits; 單精度)

2 Double Precision Format (64 bits; 雙精度)

3 Extended Precision Format (80 bits; 擴充精度) sign: 1 bit, exponent: 16 bits, fraction: 63 bits

(24)

64-bit Floating-Point Representation

64-bit representation is used for a real number.

Each binary floating-point number (浮點數) has at least 16

decimal digits of precision.

1-bit sign (符號) s is followed by 11-bit exponent (指數) c (characteristic, 0≤ c ≤ 2¹¹− 1 = 2047) and 52-bit binary

fraction f (mantissa: 尾數).

(25)

The Normalized Forms (正規化形式或標準化形式)

Normalized binary floating-pint form of x∈ R is

fl(x) = (−1)^s2^c⁻¹⁰²³(1 + f)₂ = (−1)^s( 1 +

∑k i=1

b_i2⁻ⁱ )

102^c⁻¹⁰²³, where f = (0.b₁b₂· · · bk)2.

F ={fl(y) | y ∈ R} is a finite (and proper) subset of R.

The difference between two adjacent (相鄰的) 64-bit floating-point numbers is ε_M = 2⁻⁵²≈ 2.22 × 10⁻¹⁶.

Note: the machine precision (or epsilon) is

ε_M = 2⁻²³≈ 1.19 × 10⁻⁷ for the single precision format.

(26)

Some Examples

1 Since

27.56640625₁₀= 11011.10010001₂

= 1.101110010001₂× 2⁴, (Normalized Form) we have s = 0, c = 4 + 1023 = 1027₁₀=10000000011₂ and mantissa f = 0.101110010001₂. Using IEEE 754 format⇒

0

10000000011 10111001000100· · · 0 (補 40 個零!)

2 Note that

0.1₁₀= 0.0 0011₂= 1.1 0011₂× 2⁻⁴. How to store 0.1₁₀ by using IEEE 754 format?

(27)

Remarks on IEEE 754 Format

1 The smallest positive floating-point number (with s = 0, c = 1 and f = 0) is

fl_min= 2⁻¹⁰²²(1 + 0)≈ 2.2 × 10⁻³⁰⁸.

2 The largest one (with s = 0, c = 2046 and f = 1− 2⁻⁵²) is fl_max= 2¹⁰²³(2− 2⁻⁵²)≈ 1.8 × 10³⁰⁸.

3 |fl(x)| > flmax ⇒ overflow (上溢位) and |fl(x)| < flmin⇒

underflow (下溢位) and reset x = 0.

4 Two zeros +0 (with s = 0, c = 0, f = 0) and−0 (with s = 1, c = 0, f = 0) exist!

(28)

Decimal Machine Numbers (十進位機器數字)

Normalized decimal floating-point form of y∈ R is fl(y) =±0.d1d₂· · · dk× 10ⁿ,

where 1≤ d1 ≤ 9, 0 ≤ di≤ 9 (i = 2, . . . , k) and n ∈ Z. In this case, fl(y): k-digit decimal machine number.

The k-digit fl(y) of a normalized real number y =±0.d1d₂· · · dkd_k+1· · · × 10ⁿ

can be obtained by terminating the mantissa of y at k decimal digits.

(29)

Two Methods of Termination

1

Chopping: (直接捨去法)

fl(y) =±0.d1

d

₂· · · dk× 10ⁿ, i.e. simply chop off the digits d_k+1d_k+2· · · .

2

Rounding: (四捨五入法)

fl(y) =

{ ±(0.d1d₂· · · dk+ 10^−k)× 10ⁿ, d_k+1 ≥ 5 (Round Up)

±0.d1

d

₂· · · dk× 10ⁿ, dk+1 < 5 (Round Down)

≡ ±0.δ₁δ₂· · · δk× 10ⁿafter chopping.

(30)

Determine the 5-digit (a) chopping and (b) rounding values of π = 0.31415926· · · × 10¹.

Sol:

(a) fl(π) = 0.31415× 10¹ by chopping.

(b) fl(π) = (0.31415 + 10⁻⁵)× 10¹ = 0.31416× 10¹ by rounding.

(31)

Absolute and Relative Errors (絕對誤差與相對誤差)

Def 1.15

If p^∗ is an approximation to p, then

1 the absolute error is AE(p^∗) =|p^∗− p|.

2 the relative error is

RE(p^∗) = |p^∗− p|

|p| , providred that p̸= 0.

Note: the relative error is independent of the magnitude of p, but

the absolute error might vary widely!

(32)

Examples of Abs. and Rel. Errors

Find the abs. and rel. errors when approximating p by p^∗. (a) p = 0.3000× 10¹ and p^∗ = 0.3100× 10¹.

(b) p = 0.3000× 10⁻³ and p^∗ = 0.3100× 10⁻³. (c) p = 0.3000× 10⁴ and p^∗ = 0.3100× 10⁴.

Sol:

(a) AE(p^∗) = 0.1 and RE(p^∗) = 0.333 �3× 10⁻¹.

(b) AE(p^∗) = 0.1× 10⁻⁴ and RE(p^∗) = 0.333 �3× 10⁻¹. (c) AE(p^∗) = 0.1× 10³ and RE(p^∗) = 0.333 �3× 10⁻¹.

( 相對誤差都一樣, 但是絕對誤差變化很大!)

(33)

Significant Digits (有效位數)

Def 1.16

p^∗ approximate p̸= 0 to t significant digits (or figures) if

∃ largest t ∈ N ∪ {0} satisfying RE(p^∗) = |p^∗− p|

|p| ≤ 5 × 10^−t.

Note: for any normalized y = 0.d

₁d₂· · · × 10ⁿ∈ R, its k-digit decimal representation satisfies

RE(fl(y))≤ 10^−k+1= 10^−(k−1) by using chopping (see the textbook), and

RE(fl(y))≤ 0.5 × 10^−k+1 = 5× 10^−k

(34)

Finite-Digit Arithmetic (有限位數的算術)

Elementary Floating-Pont Arithmetic

For floating-point representations fl(x) and fl(y) of real numbers x and y, assume that

x⊕ y = fl(fl(x) + fl(y)), x ⊗ y = fl(fl(x) × fl(y)), x⊖ y = fl(fl(x) − fl(y)), x ⊘ y = fl(fl(x) ÷ fl(y)).

Note: in practical computation, we usually have

fl(x op y) = (x op y)(1 + δ) with|δ| ≤ εM, where op = +,−, ×, ÷, and εM is the machine precision.

(35)

Subtraction of Nearly Equal Numbers (相近數的減法)

Cancellation of Significant Digits

If x, y∈ R (x > y) have the k-digit decimal representations fl(x) = 0.d₁

d

₂· · · dpαp+1αp+2· · · αk× 10ⁿ, fl(y) = 0.d₁

d

₂· · · dpβp+1βp+2· · · βk× 10ⁿ, then

fl(x)− fl(y) = (0.αp+1αp+2· · · αk− 0.βp+1βp+2· · · βk)× 10ⁿ^−p

≡ 0.σp+1σp+2· · · σk× 10ⁿ^−p,

i.e. x⊖ y = fl(fl(x) − fl(y)) has at most k− psignificant digits, with the last p digits being either 0 or randomly assigned.

(36)

Magnification of Absolute Errors (絕對誤差的擴大)

Remark

Suppose that fl(z) = z + δ with|δ| being the absolute error. If ε = 10⁻ⁿ with n∈ N is a number of small magnitude, then

fl(z)

fl(ε) ≈ (z + δ) × 10ⁿ= z

ε+ 10ⁿδ.

So, the absolute error in computing z/ε is fl(z)

fl(ε) −z ε

≈ 10ⁿ· |δ| = |δ|/ε.

(37)

Example 4, pp. 23–24 (1/2) Given four real numbers

x = 5

7 = 0.714285, u = 0.714251 v = 98765.9, w = 0.111111× 10⁻⁴.

Find 5-digit chopping values of x⊖ u, (x ⊖ u) ⊘ w, (x ⊖ u) ⊗ v and u⊕ v.

Sol: The absolute error for x

⊖ u is

|(x − u) − (x ⊖ u)| = |(x − u) − fl(fl(x) − fl(u))|

=|(5

7 − 0.714251) − fl(0.71428 × 10⁰− 0.71425 × 10⁰)|

=|0.347143 × 10⁻⁴− 0.30000 × 10⁻⁴|

× 10⁻⁵

(38)

Example 4, pp. 23–24 (2/2)

The relative error for x⊖ u is given by RE(x⊖ u) = 0.47143× 10⁻⁵

0.347143× 10⁻⁴ = 0.1358 ≤ 0.136.

(39)

How to avoid the loss of accuracy?

Some Tricks

1 Reformulation of the calculations to avoid the subtraction of two nearly equal numbers.

(改變計算公式以避免相近數字相減)

2 Rearrangement of the calculations by the nested arithmetic.

(利用巢狀算術技巧以減少四則運算數量)

The lesson:

Think before you compute!

(40)

Illustration of Trick 1

Distinct real roots of ax²+ bx + c = 0 with a̸= 0 and b²− 4ac > 0 are

x₁ = −b +√

b²− 4ac

2a , x₂= −b −√

b²− 4ac

2a .

Ifb > 0 and4ac≪ b², then

−b +√

b²− 4ac ≈ 0 ⇒Loss of accuracy for computing x1! Rewrite the formula for x1by rationalization (有理化)

x1= −2c b +√

b²− 4ac. (分母不是相近數相減!) Use x₁x₂= ^c_a ⇒ x2=_ax^c

1 = ^−b−^√_2a^b²^−4ac.

(41)

An Example for Trick 1 (1/2)

Example, pp. 25–26

Use 4-digit rounding arithmetic to determine the first root x₁ of f(x) = x²+ 62.10x + 1 = 0.

Sol: Two real roots of f(x) = 0 are approximately

x₁ =−0.01610723, x2 =−62.08390.

Use 4-digit rounding⇒ fl(√

b²− 4ac) = fl(√

(62, 10)²− (4.000)(1.000)(1.000)) = 62.06, f(x₁) = −62.10 + 62.06

2.000 =−0.02000, with the relative error being

| − 0.01611 + 0.02000| ₋₁

(42)

An Example for Trick 1 (2/2)

In addition, if we use the reformulation for x₁, then fl(x₁) = fl

( fl(−2c) fl(b +√

b²− 4ac) )

= fl

( −2.000 62.10 + 62.06

)

=−0.01610, which has the small relative error 6.2× 10⁻⁴.

Note:

近似零根 x₁ 的精度提升至 3 個有效位數!

(43)

An Example of Polynomial Evaluation (1/2)

Example 6, pp. 26–27

Evaluate the 3-digit chopping and rounding values of a poly.

f(x) = x³− 6.1x²+ 3.2x + 1.5 at x = 4.71.

Sol: The actual value is y = f(4.71) =

−14.263899. Using 3-digit chopping/rounding arithmetic, we have The 3-digit approx. values of y are

fl(y) = fl(

((104.− 134.) + 15.0) + 1.5)

=−13.5, (Chopping) fl(y) = fl(

((105.− 135.) + 15.1) + 1.5)

=−13.4. (Rounding)

(44)

An Example of Polynomial Evaluation (2/2)

Hence, the relative errors in computing fl(y) are RE(fl(y)) =−14.263899 + 13.5

−14.263899 ≈ 5.36 × 10⁻², (Chopping) RE(fl(y)) =−14.263899 + 13.4

−14.263899 ≈ 6.06 × 10⁻². (Rounding)

=⇒ Onlyone significant digitfor both chopping and rounding values of y = f(4.71)!

(45)

Nested Arithmetic (巢狀算術)

Rearrangement of Poly. Evaluation

Direct Computation: (4 multiplications and 3 additions) f(x) = x· (x · x) − 6.1 · (x · x) + 3.2 · x + 1.5 Nested Computation: (2 multiplications and 3 additions)

f(x) =(

(x− 6.1) · x + 3.2)

· x + 1.5

Again, using 3-digit arithmetic with the nested form =⇒ RE(fl(y)) =−14.263899 + 14.2

−14.263899 ≈ 4.5 × 10⁻³, (Chopping) RE(fl(y)) =−14.263899 + 14.3 ≈ 2.5 × 10⁻³. (Rounding)

(46)

Useful Suggestion

The accuracy of an approximation can be improved ifwe reduce the number of arithmetic operations.

(減少四則運算的數量可以改進計算解的精度!)

HW of Sec 1.2:

√

24,

(47)

Section 1.3

Algorithms and Convergence

(演算法與收斂性)

(48)

Algorithms and Pseudocodes (虛擬碼)

An algorithm is a procedure that describes a finite sequence

of steps to be performed in a specified order.

The objective of an algorithm is to implement a procedure for

solving a problem or approximating a solution to the problem.

(演算法目標是求解問題或是得到該問題的數值近似解) Pseudocode is an informal environment-independent description of the key principles of an algorithm.

It uses structural conventions of a programming language, but is intended for human reading rather than machine reading.

(49)

An Example of Pseudocode To solve the root-finding problem

f(x) = ax²+ bx + c = 0 with a̸= 0.

INPUT coefficients a, b, c.

OUTPUT approximate root x.

Step 1 Compute the discriminant D = b²− 4ac.

Step 2 Compute approximate root x to f(x) = 0 using D.

Step 3 OUTPUT(x); STOP.

(50)

An Illustration of Algorithm

(51)

The Nth Taylor poly. of f(x) = ln x about x₀ = 1 is

P_N(x) =

∑N i=1

(−1)ⁱ⁺¹

i (x− 1)ⁱ.

Construct an algorithm to determine the minimal value of N s.t.

| ln(1.5) − PN(1.5)| < 10⁻⁵.

Note: From the Alternating Series Thm =

⇒

| ln x − PN(x)| ≤(−1)^N+1

N + 1 (x− 1)^N+1. So, the stopping criterion (停止準則) should be

|aN+1| =(−1)^N+1

(x− 1)^N+1 < TOL,

(52)

Algorithm for Example 1

(53)

Stability of Algorithms (演算法的穩定性)

Definition

An algorithm is called stable if it satisfies the property that

small changes in the initial data produce correspondingly small changes in the final results.

(初始資料的微小變動 =

⇒ 計算結果也是微小變化)

Otherwise, the algorithm is called unstable, i.e. small changes in the initial data produce large changes in the final results.

(初始資料的微小變動 =

⇒ 計算結果產生大幅變化)

(54)

Growth of Errors

Def 1.17 (誤差的線性與指數成長)

E₀ > 0: the magnitude of error at some stage in the calculations, E_n: the magnitude of error after n subsequent operations.

1 The growth of error is called linear if E_n≈ CnE0, where the constant C > 0 is independent of n.

2 The growth of error is called exponential if E_n≈ CⁿE₀ for some C > 1.

(55)

Example of an Unstable Algorithm (1/2)

An Unstable Procedure

The sequence{pn}^∞n=0 defined by p_n= c1(1

3)ⁿ+ c23ⁿ

is the general solution to the recursive equation (遞迴方程式) p_n= ¹⁰₃p_n₋₁− pn−2, n = 2, 3, . . . .

p

₀= 1, p1= ¹₃ ⇒ c1 = 1, c2 = 0. The solution is p_n= (1

3)ⁿ.

Use 5-digit rounding ⇒ ^p0 = 1.0000, ^

p

₁= 0.33333 and hence ^

c

₁= 1.0000, ^

c

₂ =−0.12500 × 10⁻⁵. The solution is

(56)

Example of an Unstable Algorithm (2/2)

The absolute error in computing ˆp_n is

AE(ˆp_n) = pn− ˆpn= 0.12500× 10⁻⁵(3ⁿ).

=⇒ An unstable procedure with exponential growth of errors!

(57)

Rates of Convergence (收斂比率)

Def 1.18

Suppose that{αn}^∞_n=1 and {βn}^∞_n=1 are two sequences with

nlim→∞αn= α and lim

n→∞βn= 0. If∃ K > 0 and n0 ∈ N s.t.

|αn− α| ≤ K|βn| ∀ n ≥ n0,

then we say that{αn}^∞n=1 conv. to α with rate (or order) of

convergence O(β

_n), and write

αn= α + O(βn). (as n→ ∞)

Note: seq.

{αn}^∞n=1 is often generated by some iterative method (迭代法), and it is often compared with β_n= 1n^p for p > 0.

(58)

For n≥ 1, consider two sequences of real numbers αn= n + 1

n² and αˆn= n + 3 n³ . Determine their rates of convergence.

Sol: Since

|αn− 0| = n + 1

n² ≤ n + n

n² = 2·1

n ≡ 2βn,

|ˆαn− 0| = n + 3

n³ ≤ n + 3n

n³ = 4· 1

n² ≡ 4 ˆβn

for all n≥ 1, it follows that αn= 0 + O(

1 n

), αˆn= 0 + O(

1 n

²).

(59)

Big-Oh Notation (大 O 符號)

Def 1.19

Suppose that lim

h→0F(h) = L and lim

h→0G(h) = 0. If ∃ K > 0 and δ > 0 s.t.

|F(h) − L| ≤ K|G(h)| for 0 < |h| < δ, then we write

F(h) = L + O(G(h)). (as h→ 0)

Note: In practice, we often choose G(h) = h

^p for p > 0, and the largest value of p is expected.

(60)

Show that cos h +¹₂h² = 1 + O(h⁴).

pf: From Taylor’s Thm,

∃ ξ(h) between 0 and h s.t.

cos h = 1−1

2h²+cos ξ(h)

24 h⁴ for h̸= 0.

Hence, we see that (cos h +1

2h²)− 1 = | cos ξ(h)|

24 |h⁴| ≤

1

24

|h⁴| for h ̸= 0, which gives the desired result by Def.

(61)