Project of Numerical Analysis

(1)

Project of Numerical Analysis

February 18, 2014

Consider the Dirichlet boundary-value problem:

−∆u ≡ −u_xx− u_yy = 2π²sin πx sin πy, for (x, y) ∈ Ω, (1) u(x, y) = 0 (x, y) ∈ ∂Ω,

for Ω := {x, y|0 < x, y < 1} ⊆ R² with boundary ∂Ω, which has the exact solution

u(x, y) = sin πx sin πy, and is shown in Figure 1.

Figure 1: Exact solution.

(2)

1 Center difference discretization

To solve (1) by means of a difference methods, one replaces the differential operator by a difference operator. Let

Ω_h:= {(x_i, y_i)|i, j = 1, . . . , n},

∂Ωh:= {(xi, 0), (xi, 1), (0, yj), (1, yj)|i, j = 0, 1, . . . , n + 1},

where x_i = ih, y_j = jh, i, j = 0, 1, . . . , n + 1, h := _n+1¹ , n ≥ 1, is an integer.

From the Taylor’s theorem, we have u(xi+ h) = u(xi) + u⁰(xi)h +h²

2 u⁰⁰(xi) +h³

6 u⁰⁰⁰(xi) +h⁴

24u⁽⁴⁾(ξ1) u(x_i− h) = u(x_i) − u⁰(x_i)h +h²

2 u⁰⁰(x_i) −h³

6 u⁰⁰⁰(x_i) +h⁴

24u⁽⁴⁾(ξ₂), where ξ₁ is between x_i and x_i+ h and ξ₂ is between x_i and x_i− h. Hence

u⁰⁰(xi) =u(xi+ h) − 2u(xi) + u(xi− h)

h² −h²

12u⁽⁴⁾(ξ)

=u(x_i+1) − 2u(x_i) + u(x_i−1)

h² −h²

12u⁽⁴⁾(ξ), where ξ is between x_i− h and xi+ h. Similarly,

∂²u

∂x²(xi, yj) = u(xi+1, yj) − 2u(xi, yj) + u(xi−1, yj)

h² −h²

12

∂⁴u

∂x⁴(ξi, yj),

∂²u

∂y²(xi, yj) = u(x_i, y_j+1) − 2u(x_i, y_j) + u(x_i, y_j−1)

h² −h²

12

∂⁴u

∂x⁴(xi, ηj), where ξ_i ∈ (xi−1, x_i+1) and η_j∈ (yj−1, y_j+1). It implies that

∂²u

∂x²(x_i, y_j) +∂²u

∂y²(x_i, y_j)

=u(xi, yj−1) + u(xi−1, yj) − 4u(xi, yj) + u(xi+1, yj) + u(xi, yj+1) h²

−h² 12

∂⁴u

∂x⁴(ξ_i, y_j) +∂⁴u

∂x⁴(x_i, η_j)

.

Let u_ij denote an approximated value of function u at the grid point (x_i, y_j) for i, j = 1, . . . , n + 1. Then

− u_xx(x_i, y_j) − u_yy(x_i, y_j) ≈ −ui,j−1− ui−1,j+ 4ui,j− ui+1,j− ui,j+1

h² with an error O(h²) and the equation

−uxx(xi, yj) − uyy(xi, yj) = 2π²sin πxisin πyj≡ fij

(3)

can be replaced by the following equation

−ui,j−1− ui−1,j+ 4ui,j− ui+1,j− ui,j+1

h² = fij (2)

for i, j = 1, . . . , n.

For j = 1, we have

−u1,0− u0,1+ 4u1,1− u2,1− u1,2=h²f1,1, (3a)

−u2,0− u1,1+ 4u_2,1− u3,1− u2,2=h²f_2,1, (3b) ...

−un−1,0− un−2,1+ 4u_n−1,1− un,1− un−1,2=h²f_n−1,1, (3c)

−un,0− un−1,1+ 4un,1− un+1,1− un,2=h²fn,1. (3d) By the boundary condition, it holds that

u1,0=u2,0 = · · · = un,0= 0, (4a)

u_0,1=u_n+1,1= 0. (4b)

Substituting (4) into (3), we get

4u1,1− u2,1−u1,2 = h²f1,1, (5a)

−u_1,1+ 4u_2,1− u_3,1−u_2,2 = h²f_2,1, (5b) ...

−un−2,1+ 4u_n−1,1− un,1−un−1,2 = h²f_n−1,1, (5c)

−un−1,1+ 4un,1−un,2 = h²fn,1. (5d) Let, for j = 1, . . . , n,

u_:,j=





 u1,j

u2,j

... un,j





 , f_:,j=





 f1,j

f2,j

... fn,j





 , A₁=







4 −1

−1 . .. . .. . .. . .. −1

−1 4







∈ R^n×n.

Then (5) can be rewritten as following matrix form:

A1 −In

u_:,1 u:,2

= h²f_:,1. For j = 2, . . . , n − 1, using u0,j = un+1,j= 0, we have

−u1,j−1+ 4u1,j− u2,j−u1,j+1 = h²f1,j,

−u_2,j−1− u_1,j+ 4u_2,j− u_3,j−u_2,j+1 = h²f_2,j, ...

−u_n−1,j−1− u_n−2,j+ 4u_n−1,j− u_n,j−u_n−1,j+1 = h²f_n−1,j,

−un,j−1− un−1,j+ 4un,j−un,j+1 = h²fn,j.

(4)

Above equations can be represented as following matrix form:

−In A1 −In



 u:,j−1

u:,j

u:,j+1



= h²f:,j.

For j = n, using u_1,n+1= u_2,n+1= u_n,n+1= 0, we have

−u1,n−1+ 4u_1,n− u2,n = h²f_1,n,

−u2,n−1− u1,n+ 4u2,n− u3,n = h²f2,n, ...

−un−1,n−1− un−2,n+ 4un−1,n− un,n = h²fn−1,n,

−un,n−1− un−1,n+ 4u_n,n = h²f_n,n. Above equations can be represented as following matrix form:

−In A1

u:,n−1

u:,n

= h²f:,n.

Therefore, (2) with boundary conditions is equivalent to a linear system

Au = h²f (6)

with

A =







A1 −In

−In A1 . .. . .. . .. −In

−In A₁







∈ Rⁿ²^×n², (7)

and

A1=







4 −1

−1 . .. . .. . .. . .. −1

−1 4





 , u =





 u:,1

u:,2

... u:,n





 , f =





 f:,1

f:,2

... f:,n





 .

2 Project for direct method

(a) Use Algorithms 1, 2 and 3 (Gaussian elimination) to reduce A in (7) to an upper triangular matrix and modify the entries of b accordingly. Compare and plot the CPU times for reducing A to upper triangular with various n by using these three algorithms. (Use “tic” and “toc” functions in MATLAB to estimate the CPU times.)

(5)

Require: Nonsingular matrix A and right hand side vector b.

Ensure: This algorithm implements the Gaussian elimination procedure to re- duceAtoupper triangularand modify the entries ofbaccordingly.

1: for k = 1, . . . , n − 1 do

2: Let p be the smallest integer with k ≤ p ≤ n and apk6= 0.

3: If @ p, then stop.

4: If p 6= k, then perform (Ep) ↔ (Ek).

5: for i = k + 1, . . . , n do

6: Compute t = A(i, k)/A(k, k);

7: Set A(i, k) = 0;

8: Update b(i) = b(i) − t × b(k);

9: for j = k + 1, . . . , n do

10: Update A(i, j) = A(i, j) − t × A(k, j);

11: end for

12: end for

13: end for

Algorithm 1: Gaussian elimination

Ensure: This algorithm implements the Gaussian elimination procedure to reduce A to upper triangular and modify the entries of b accordingly.

1: for k = 1, . . . , n − 1 do

5: for i = k + 1, . . . , n do

6: Compute t = A(i, k)/A(k, k);

7: Set A(i, k) = 0;

8: Update b(i) = b(i) − t × b(k);

9: UpdateA(i, k + 1 : n) = A(i, k + 1 : n) − t × A(k, k + 1 : n);

10: end for

11: end for

Algorithm 2: Vector version of Gaussian elimination

(6)

Ensure: This algorithm implements the Gaussian elimination procedure to reduce A to upper triangular and modify the entries of b accordingly.

1: for k = 1, . . . , n − 1 do

5: Computet = A(k + 1 : n, k)/A(k, k);

6: SetA(k + 1 : n, k) = 0;

7: UpdateA(k + 1 : n, k + 1 : n) = A(k + 1 : n, k + 1 : n) − t × A(k, k + 1 : n);

8: Updateb(k + 1 : n) = b(k + 1 : n) − b(k) × t.

9: end for

Algorithm 3: Matrix version of Gaussian elimination

(b) Use backward substitution to solve the upper triangular linear system in (a). Plot the CPU times for solving such linear system with various n.

(c) Compare the CPU times for using left matrix divide “A \ b” in MATLAB with that in (a) and (b).

(d) Store the matrix A with sparse format. Plot the CPU times for generating matrix A and solving the associated linear systems by left matrix divide

“A \ b” with various n.

3 Project for iterative method

(e) Use Jacobi method to solve linear system (6).

Given an initial vector x⁽⁰⁾, rewrite the linear system as:

a11x^(k)₁ + a12x^(k−1)₂ + a13x^(k−1)₃ + · · · + a1nx^(k−1)n = b1

a21x^(k−1)₁ +a22x^(k)₂ + a23x^(k−1)₃ + · · · + a2nx^(k−1)n = b2

... an1x^(k−1)₁ + an2x^(k−1)₂ + an3x^(k−1)₃ + · · · +annx^(k)n = bn. If we decompose the coefficient matrix A as

A = L + D + U,

where D is the diagonal part, L is the strictly lower triangular part, and U is the strictly upper triangular part, of A, then we derive the iterative formulation for Jacobi method:

x^(k)= −D⁻¹(L + U )x^(k−1)+ D⁻¹b.

• Use Algorithm 4 with initial vector x⁽⁰⁾ = [1, · · · , 1]^> to solve linear system (6). Plot the CPU times and iteration numbers k for solving such linear system with various n.

(7)

Require: Given x⁽⁰⁾, tolerance T OL, maximum number of iteration M . Ensure: The solution x.

1: Set k = 1.

2: Compute x = −D⁻¹(L + U )x⁽⁰⁾+ D⁻¹b.

3: while k ≤ M and kx − x⁽⁰⁾k2≥ T OL do

4: Set k = k + 1, x⁽⁰⁾= x;

5: Compute x = −D⁻¹(L + U )x⁽⁰⁾+ D⁻¹b;

6: end while

Algorithm 4: Jacobi method

(f) Use Gauss-Seidel method to solve linear system (6).

a11x^(k)₁ + a12x^(k−1)₂ + a13x^(k−1)₃ + · · · + a1nx^(k−1)n = b1

a21x^(k)₁ +a22x^(k)₂ + a23x^(k−1)₃ + · · · + a2nx^(k−1)n = b2

a31x^(k)₁ +a32x^(k)₂ +a33x^(k)₃ + · · · + a3nx^(k−1)n = b3

... an1x^(k)₁ +an2x^(k)₂ +an3x^(k)₃ + · · · +annx^(k)n = bn.

This improvement induce the Gauss-Seidel method. The iteration of the Gauss-Seidel method is defined as follows:

x^(k)= −(D + L)⁻¹U x^(k−1)+ (D + L)⁻¹b.

Require: Given x⁽⁰⁾, tolerance T OL, maximum number of iteration M . Ensure: The solution x.

1: Set k = 1.

2: Compute x = −(D + L)⁻¹U x⁽⁰⁾+ (D + L)⁻¹b.

3: while k ≤ M and kx − x⁽⁰⁾k₂≥ T OL do

4: Set k = k + 1, x⁽⁰⁾= x;

5: Compute x = −(D + L)⁻¹U x⁽⁰⁾+ (D + L)⁻¹b;

6: end while

Algorithm 5: Gauss-Seidel method

1. Use MATLAB functions “triu(A,1)” and “tril(A,-1)” to extract the strictly upper and lower triangular parts of A, respectively.

2. Use Algorithm 5 with initial vector x⁽⁰⁾= [1, · · · , 1]^> to solve linear system (6). Plot the CPU times and iteration numbers k for solving such linear system with various n.

3. Compare the results produced by Jacobi and Gauss-Seidel methods.

(8)

(g) Use SSOR method to solve linear system (6).

a₁₁x^(k)₁ + a₁₂x^(k−1)₂ + a₁₃x^(k−1)₃ + · · · + a_1nx^(k−1)n = b₁ a₂₁x^(k)₁ +a₂₂x^(k)₂ + a₂₃x^(k−1)₃ + · · · + a_2nx^(k−1)n = b₂ a₃₁x^(k)₁ +a₃₂x^(k)₂ +a₃₃x^(k)₃ + · · · + a_3nx^(k−1)n = b₃

... a_n1x^(k)₁ +a_n2x^(k)₂ +a_n3x^(k)₃ + · · · +a_nnx^(k)n = b_n.

Let the approximate solution x^(k,i) produced by Gauss-Seidel method be defined by

x^(k,i)=h

x^(k)₁ , . . . , x^(k)_i−1, x^(k−1)_i , . . . , x^(k−1)_n i^T and

r^(k)_i =h

r^(k)_1i , r_2i^(k), . . . , r_ni^(k)i^T

= b − Ax^(k,i)

be the corresponding residual vector. Then the ith component of r^(k)_i is

r_ii^(k)= bi−

i−1

X

j=1

aijx^(k)_j −

n

X

j=i+1

aijx^(k−1)_j − aiix^(k−1)_i ,

so

aiix^(k−1)_i + r_ii^(k)= bi−

i−1

X

j=1

aijx^(k)_j −

n

X

j=i+1

aijx^(k−1)_j = aiix^(k)_i .

Consequently, the Gauss-Seidel method can be characterized as choosing x^(k)_i to satisfy

x^(k)_i = x^(k−1)_i +r^(k)_ii aii

.

Relaxation method is modified the Gauss-Seidel procedure to

x^(k)_i = x^(k−1)_i + ωr^(k)_ii a_ii

= x^(k−1)_i + ω aii



b_i−

i−1

X

j=1

a_ijx^(k)_j −

n

X

j=i+1

a_ijx^(k−1)_j − a_iix^(k−1)_i





= (1 − ω)x^(k−1)_i + ω a_ii



bi−

i−1

X

j=1

aijx^(k)_j −

n

X

j=i+1

aijx^(k−1)_j



 (8)

(9)

for certain choices of positive ω. These methods are called for ω < 1: under relaxation,

ω = 1: Gauss-Seidel method, ω > 1: over relaxation.

Over-relaxation methods are called SOR (Successive over-relaxation). To determine the matrix of the SOR method, we rewrite (8) as

aiix^(k)_i + ω

i−1

X

j=1

aijx^(k)_j = (1 − ω)aiix^(k−1)_i − ω

n

X

j=i+1

aijx^(k−1)_j + ωbi,

so that if A = L + D + U , then we have

(D + ωL)x^(k)= [(1 − ω)D − ωU ] x^(k−1)+ ωb.

Theorem 1 (Ostrowski-Reich) If A is positive definite and the relaxation parameter ω satisfying 0 < ω < 2, then the SOR iteration converges for any initial vector x⁽⁰⁾.

Let A be symmetric and A = D + L + L^T. The idea is in fact to imple- ment the SOR formulation twice, one forward and one backward, at each iteration. That is, SSOR method defines

(D + ωL)x^(k−¹²⁾ = (1 − ω)D − ωL^T x^(k−1)+ ωb, (9) (D + ωL^T)x^(k) = [(1 − ω)D − ωL] x^(k−¹²⁾+ ωb. (10) Define

Mω: = D + ωL,

Nω: = (1 − ω)D − ωL^T. Then from the iterations (9) and (10), it follows that

x^(k)= M_ω^−TN_ω^TM_ω⁻¹Nω x^(k−1)+ ω M_ω^−TN_ω^TM_ω⁻¹+ M_ω^−T b

≡ T (ω)x^(k−1)+ M (ω)⁻¹b, where

M (ω) = 1

ω(2 − ω)(D + ωL)D⁻¹ D + ωL^T . 1. Take x⁽⁰⁾= [1, · · · , 1]^> as an initial vector.

2. Use MATLAB functions “triu(A,1)” and “tril(A,-1)” to extract the strictly upper and lower triangular parts of A, respectively.

3. Fixed n = 100 and uniformly took 40 values for the parameter ω in the interval (0, 2), show the iteration numbers and CPU times of SSOR iterative method for each ω. Find the optimal value ω^∗ of the parameter ω.

(10)

4. Compare the iteration numbers and CPU times for Jacobi, Gauss- Seidel and SSOR(ω^∗) iterative methods with various n.

(h) Use conjugate gradients method to solve linear system (6).

1. Use MATLAB function pcg without any preconditioner:

[x, flag, relres, iter] = pcg(A, b, tol, maxit) 2. Use MATLAB function pcg with a given preconditioner:

[x, flag, relres, iter] = pcg(A, b, tol, maxit, M), [x, flag, relres, iter] = pcg(A, b, tol, maxit, M1, M2), [x, flag, relres, iter] = pcg(A, b, tol, maxit, [], M2), [x, flag, relres, iter] = pcg(A, b, tol, maxit, MFUN).

(i) Jacobi method: A = D + (L + U ), M = D xk+1= −D⁻¹(L + U )xk+ D⁻¹b (ii) Gauss-Seidel: A = (D + L) + U , M = D + L

xk+1= −(D + L)⁻¹U xk+ (D + L)⁻¹b.

(iii) SSOR: A = D + L + L^T, M = M (ω)

x^(k)= M_ω^−TN_ω^TM_ω⁻¹Nω x^(k−1)+ M (ω)⁻¹b, where

M (ω) = 1

ω(2 − ω)(D + ωL)D⁻¹ D + ωL^T . (iv) M may be a function handle MFUN returning M⁻¹x

[x, flag, relres, iter] = pcg(A, b, tol, maxit, ...

@(x)precSSOR(x,omega,mtxLower,mtxdiag) 3. Compare the iteration numbers and CPU times for pcg by using

different preconditioner with various n.