Numerical Methods 2021 — Midterm 2

(1)

Numerical Methods 2021 — Midterm 2

Solutions

Problem 1 (20 pts). Consider an upper triangular matrix A stored in a compressed row format (CSR)

(a) (5 pts) Let

A =





5 2 8 0 4 1 0 0 3





Show how A is stored in compressed row format in the following table.

Here we consider array index beginning with 1.

For column indexes with the same row index, you should store them in an ascending order in the table.

array index 1 2 3 4 5 6 a

acol ind arow ptr

(b) (10 pts) Assume A is stored in the same way as in (a). Give a code to solve any upper triangular linear equation

Ax = b, where we assume A_ii 6= 0.

(c) (5 pts) Let

A =





5 2 8 0 4 1 0 0 3



 and b =



 68 11 21



 Run your code to solve the system and output your answer.

Solution.

(a) The CSR matrix format of A is

array index 1 2 3 4 5 6

a 5 2 8 4 1 3

acol ind 1 2 3 2 3 3 arow ptr 1 4 6 7

(2)

(b) function x = upper_linear_solver(A, b)

n = size(A, 2); % A is m, n dim array [a, acol_ind, arow_ptr] = getCSR(A);

for i = n:-1:1

Aijxj = 0; % sum of Aij * xj, where j = i+1, ..., n Aii = a(arow_ptr(i));

for k = (arow_ptr(i)+1):(arow_ptr(i+1)-1) j = acol_ind(k); % column c, value v Aijxj = Aijxj + a(k) * x(j);

end

x(i) = (b(i) - Aijxj) / Aii;

end end (c) iter 1

Aijxj = 0

x[3] = b[3] - Aijxj / 3 = 7 iter 2

Aijxj = 0

Aijxj = Aijxj + a[5] * x[acol_ind[5]]

= Aijxj + (a[5] * x[3])

= Aijxj + (1 * 7)

= 7

x[2] = b[2] - Aijxj / 4 = 1 iter 3

Aijxj = 0

= Aijxj + (a[2] * x[2])

= Aijxj + (2 * 1)

= 2

= Aijxj + (a[3] * x[3])

= Aijxj + (8 * 7)

= 58

x[1] = b[1] - Aijxj / 5 = 2 ans =

2 1 7

Problem 2 (30 pts). Consider the following linear equation, Ax = b,

where A is an n-by-n symmetric positive-definite matrix. Assume A is a sparse matrix stored in compressed column (CSC) format. For row indexes with the same column index, we require that the CSC format stores them in an ascending order.

(3)

(a) (10 pts) In the Gauss-Seidel method, in each iteration we update the i-th element of x, x_i, by a value δ

xi ← xi+ δ.

Suppose we already cached a vector

c = b − Ax,

How to represent the value δ by using values in c and diagonal values of A?

Suppose we would like to maintain the vector c by using the new x. How to update c by using δ and components in A?

(b) (15 pts) Give the code to do the Gauss-Seidel method by using the technique in the prob. (a).

We require that it can take any vector as the first iterate.

Note that you may need to extract diagonal elements to a dense vector first.

Also, you may need to initialize an dense vector c to adapt the technique derived in prob. (a).

(c) (5 pts) Let

A =





4 1 1 1 4 0 1 0 3



 and b =



 18 25 25





Show how A is stored in compressed column format (CSC) in the following table.

Here we consider array index beginning with 1.

1 2 3 4 5 6 7 a

arow ind acol ptr Given initial solution

x0 =



 0 14

0





Run your code to solve the system. You need to details such as δ, x and the vector c at each step.

Solution.

(a) The new value ¯x_i by using the Gauss-Seidel method is

¯

x_i = bi−P

j6=iAijxj

A_ii . If we minus both sides with the original value x_i, we then have

δ = ¯x_i− x_i = b_i−P

j6=iA_ijx_j

A_ii − x_i = b_i−P

jA_ijx_j A_ii = ci

A_ii. For c, we update it by

c = b − A(x + δe_i) = c − δ





 A_1i

... A_ni





, where ei = [0, · · · , 1, · · · , 0]^T is a standard unit vector.

(4)

(b) function y = Gauss_Seidel(A, x, b) n = size(A, 2);

[a, arow_ind, acol_ptr] = getCSC(A);

% Initialize b - Ax and Adiag for i = 1:n

c(i) = b(i);

for j = acol_ptr(i):(acol_ptr(i+1)-1) r = arow_ind(j); v = a(j);

c(r) = c(r) - v * x(i);

if i == r

Adiag(i) = v;

end end end

% Run Gauss_Seidel until b - Ax is zero vector while sum(c) ˜= 0

for i = 1:n

delta = c(i) / Adiag(i);

if delta ˜= 0

x(i) = x(i) + delta;

for j = acol_ptr(i):(acol_ptr(i+1)-1) r = arow_ind(j); v = a(j);

c(r) = c(r) - v * delta;

end end end end y = x;

end

(c) The CSC format of A is

1 2 3 4 5 6 7

a 4 1 1 1 4 1 3

arow ind 1 2 3 1 2 1 3 acol ptr 1 4 6 8

iter 1

update x[1]

delta = 1

x = [1 14 0]

c = [0 -32 24]

update x[2]

delta = -8

(5)

x = [1 6 0]

c = [8 0 24]

update x[3]

delta = 8

x = [1 6 8]

c = [0 0 0]

y = 1 6 8

Problem 3 (35 pts). Consider a linear system Ax = b:





2 0 2 0 1 0 2 0 3



x =





−1

−1 1





(a) (5 pts) Is the matrix A symmetric positive definite? If True, please prove it. Otherwise, give a counterexample to show that is False.

(b) (5 pts) Take x₀ =0 0 0^T. Do two CG iterations and show what x₁ and x₂ are.

(c) (5 pts) Is x₂ a solution or not?

(d) (10 pts) Let r1 and p₁ be vectors from the aforementioned procedures. Solve the following opti- mization problem with the variable p:

minp kp − r₁k²₂

s.t. p ∈ span{Ap₁}^⊥

(1)

How is the solution of this problem connected to p₂ obtained in the CG procedure?

(e) (5 pts) If we denote the solution of (1) as ¯p₂ and calculate ¯α2, x2 by

¯

α₂ = p¯^T₂r₁

¯

p^T₂A¯p₂, x₂ = x₁+ ¯α₂p¯₂. What are ¯α2, x2 ?

(f) (5 pts) In slides, we have a theorem by writing

A = I + B

and checking the relationship between rank(B) and the number of CG steps. From what you have observed in (a)-(d), can you say that rank(B) must be greater or equal to some value? Then check B to confirm the result.

One may indeed make mistakes in doing the calculation. However, if you understand the concept of CG, you should be able to easily validate your results.

(6)

Solution.

(a) Since

A = A^T, A is symmetric. Given a vector v = [v₁, v₂, v₃]^T,

v^TAv = 2v²₁+ 4v₁v₃+ 3v₃²+ v²₂ = 2(v₁+ v₃)²+ v²₃+ v₂² That implies

v^TAv = 0 iff v = 0.

Therefore, A is a symmetric positive definite matrix.

(b)

x₀ =



 0 0 0



, r₀ = b =





−1

−1 1



,

⇒p₁ =





−1

−1 1



, α₁ = r^T₀r₀ p^T₁Ap₁ = 3

2, x₁ = x₀ + α₁p₁ =





−3/2

−3/2 3/2



, r₁ = r₀− α₁Ap₁ =





−1 1/2

−1/2





⇒β₂ = r^T₁r₁ r^T₀r₀ = 1

2, p₂ = r₁+ β₂p₁ =





−3/2 0 0



, α₂ = r^T₁r₁ p^T₂Ap₂ = 1

3, x₂ = x₁+ α₂p₂ =





−2

−3/2 3/2



,

(c) Since

r₂ = r₁ − α₂Ap₂ =



 0 1/2 1/2



 is not zero, x₂ is not the solution.

(d) We can derive that

span{Ap₁} =



 0

−t t



, t ∈ R, so span{Ap₁}^⊥ is equivalent to

−p2 + p3 = 0 Therefore, (1) is equivalent to

minp





p1+ 1 p₂− 1/2 p₃+ 1/2





2

≡ min

p





p1+ 1 p₂− 1/2 p₂+ 1/2





2

≡ min

p (p₁+ 1)² + (p₂− 1/2)²+ (p₂+ 1/2)² s.t. − p₂+ p₃ = 0

and

(p₁+ 1)²+ (p₂− 1/2)²+ (p₂+ 1/2)² = (p₁+ 1)²+ 2p²₂+ 1/2.

Thereby, the solution is −1 0 0^T, and we can see that p is parallel to p₂.

(7)

(e)

¯ p₂ =





−1 0 0



, ¯α₂ = p¯^T₂r₁ p¯^T₂A¯p₂ = 1

2, x₂ = x₁+ ¯α₂p¯₂ =





−3/2

−3/2 3/2



+ 1 2





−1 0 0



=





−2

−3/2 3/2



. (f) From the theorem on page 6 of Conjugate gradient methods part4,

the rank of B is at least two (2)

because x₂ is not a solution. Let us check on the rank of B. The matrix B can be found by

A =





2 0 2 0 1 0 2 0 3



= I +





1 0 2 0 0 0 2 0 2



= I + B

The rank of B is two since there are only two linear independent rows in B. Thus, the result (2) holds.

Problem 4 (15 pts). In ours slides, we use

f (x) = cos(x)

as an example to run the Newton method. Now we would like to check if it satisfies the theorem proved in slides and have the quadratic convergence.

(a) (5 pts) Is f (x) Lipschitz continuous? If so, prove it and give an α.

(b) (5 pts) For this particular f (x), can you directly apply Taylor expansion to prove Lemma 3? (Hint:

Check the Taylor theorem.) (c) (5 pts) Now consider

x^∗ = π 2 and

{x^k} → x^∗.

Find ¯β and δ such that the second result of the theorem holds. That is,

|x^k+1− x^∗| ≤ δ|x^k− x^∗|², ∀k ≥ L.

Solution.

(a) We know that

f⁰(x) = − sin(x).

By Mean Value Theorem, for any x and y, there exists t between x and y such that sin(x) − sin(y) = (x − y) sin⁰(t).

We have

| sin⁰(t)| = | cos(t)| ≤ 1, so

| sin(x) − sin(y)| = |(x − y) sin⁰(t)| = |x − y|| cos(t)| ≤ |x − y|, ∀x, y.

Thus we can choose

α = 1.

(8)

(b) Taylor Theorem tells us that the Taylor expansion of f (y) at x:

cos(y) = cos(x) − sin(x)(y − x) − 1

2cos(t)(y − x)², where t is between x and y. Therefore, the error term is

e(x, y) = −1

2cos(t)(y − x)², and

|e(x, y)| = |1

2cos(t)(y − x)²|

= 1

2| cos(t)||(y − x)²|

≤ 1

2|y − x|² (c) Follow the slides, we have

β = |f¯ ⁰(x^∗)⁻¹| =

1 sin(π/2)

= 1 and

δ = α ¯β = 1.