•••• Why Studying NP-Completeness ? NP-Completeness : Concepts

(1)

NP-Completeness :

Concepts

•

• • Why Studying NP-Completeness ?

♣ Pursuing your Ph.D.

♣ Keeping your job

Before studying NP-completeness:

“I can’t find an efficient algorithm,I guess I’m just too dumb.”

(2)

After studying NP-completeness:

“I can’t find an efficient algorithm, because no such algorithm is possible!”

“I can’t find an efficient algorithm, but neither can all these famous people.”

(3)

•

• • Measure to Time Complexity

l: measure to the time complexity of an algorithm

The discussion of NP-completeness considers l the input size,i.e.,the total length of all inputs to the algorithm.

Two assumptions:

(1) all inputs are integers(a rational number can be represented by a pair of integers);

(2) each integer has a binary representation.

Ex.Sortinga₁, a₁, …, a_n.

l = ⁿ ( _i

)

i

a

=

 

+

∑ 

²



1

log 1 .

(4)

Ex.Consider the following procedure.

input(n);

s ←←←← 0;

fori ←←← 1 to n← do s ←←←← s+i;

output(s).

l = log₂n+1.

The procedure takesO(n)=O(2^l)time.

⇒ an exponential-time algorithm !

(5)

•

• • Polynomial-Time Algorithms

vs.

Exponential-Time Algorithms

Suppose that your computer takes 1 second to perform 10⁶ operations.

The following is the time requirement for your computer to performf(n)operations,where f(n) = n, n², n³, n⁵, 2ⁿ, 3ⁿandn = 10, 20, 30, 40, 50, 60.

(6)

The following shows the largest value ofnsuch thatf(n)operations can be performed in 1 hour on a faster computer.

(7)

An algorithm is referred to as apolynomial-time algorithm,if its time complexity can be bounded above by a polynomial function of input size.

An algorithm is referred to as anexponential-time algorithm,if its time complexity cannot be thus bounded(even if the function is not normally regarded as an exponential one,liken^logⁿ).

Usually,a problem is referred to as tractable if it can be solved with a polynomial-time algorithm, and intractable otherwise.

The two tables above give us a reason why polynomial-time algorithms are much more desirable than exponential-time algorithms.

They also motive us to study the theory of NP-completeness.

(8)

•

• • Maximal

vs.

Maximum

Ex.

maximal cliques :{1, 2, 3}, {2, 3, 4, 5}, {4, 6}

maximum cliques :{2, 3, 4, 5}

(9)

•

• • Decision Problems

vs.

Optimization Problems

A decision problem asks a solution of“yes”or

“no”.

An optimization problem asks a solution of an optimal value(a maximumora minimum).

Ex. The maximum clique problem can be expressed as a decision problem as follows.

Instance: An undirected graphG=(V, E)anda positive integerk≤≤≤≤|V|.

Question: Does G contain a clique of size≥≥≥≥k?

It can be also expressed as an optimization problem as follows.

Instance: An undirected graphG=(V, E).

Question: What is the size of a maximum clique of G?

(10)

Ex. The traveling salesman problem can be expressed as a decision problem as follows.

Instance: A set C of m cities,distances d_i,j>0 for all pairs of citiesi, j∈∈∈∈C,and a positive integer k.

Question: Is there a tour of length≤≤≤≤k that starts at any city,visits each of the other m−−−−1 cities exactly once,and returns to the initial city?

It can be also expressed as an optimization problem as follows.

Instance: A set C of m citiesanddistances d_i,j>0 for all pairs of citiesi, j∈∈∈∈C.

Question: What is the length of a shortest tour that starts at any city,visits each of the other m−−−−1 cities exactly once, and returns to the initial city?

(11)

Ex. The problem of sortinga₁, a₁, …, a_ncan be expressed as a decision problem as follows.

Instance: Givena₁, a₂, …, a_nand a positive integer k.

Question: Is there a permutation ofa₁, a₂,…,a_n, denoted bya’₁, a’₂,…,a’_n,such that

|a’₂−−−−a’₁|+|a’₃−−−−a’₂|+…+|a’_n−−−−a’_n−₋₋−1|≤≤≤≤k?

An optimization problem is “harder” than its corresponding decision problem.

Since the NP-completeness concerns whether or not a problem can be solved in polynomial time, the discussion of NP-completeness considers only decision problems.

(If a decision problem is not polynomial-time solvable,then its corresponding optimization problem is not polynomial-time solvable either.)

(12)

•

• • Problem Reduction

A problem P₁ reduces to another problem P₂, denoted by P₁ ∝∝∝∝ P₂, if any instance of P₁ can be transformed into an instance of P₂ such that the solution for P₁ can be obtained from the

solution for P₂.

T∝∝∝∝ : the reduction time.

T: the time required to obtain the solution for P₁ from the solution for P₂.

Since the NP-completeness concerns whether or not a problem can be solved in polynomial time, we consider only the reductions with both T∝∝∝∝ and T polynomial.

(Thus,P₂∈∈∈∈P⇒⇒⇒⇒ P₁∈∈∈∈P or P₁∉∉∉∉P⇒⇒⇒⇒ P₂∉∉∉∉P.) IfP₁∝∝∝∝P₂andP₂∝∝∝∝P₃,thenP₁∝∝∝∝P₃.

(13)

•

• • P,

NP,

and

NP-Complete

Three classes of decision problems: P,NP,andNP-complete.

P: the set of decision problems that can be solved in polynomial time by deterministic algorithms.

NP: the set of decision problems that can be solved in polynomial time by non-deterministic

algorithms.

Any non-deterministic algorithm consists of two phases:guessing and checking.

(14)

For the maximum clique problem,the guessing phase will return a clique,and the checking phase will decide whether or not the clique size is greater than or equal to k.

For the traveling salesman problem,the guessing phase will return a tour,and the checking phase will decide whether or not the tour length is greater than or equal to k.

A decision problem has an AFFIRMATIVE answer.

⇔⇔

⇔⇔ The guessing is SUCCESSFUL.

Notice that non-deterministic algorithms are imaginary.A more detailed description of non- deterministic algorithms and more illustrative examples can be found in Ref. (2).

(15)

Every decision problem in P is also in NP,i.e., P ⊆⊆⊆⊆ NP.

An NP problem isNP-completeif every NP problem can reduce to it in polynomial time.

⇒

⇒⇒

⇒ If any NP-complete problem can be solved in polynomial time,then every NP problem can be solved in polynomial time(i.e.,P=NP).

(Intuitively, NP-complete problems are the

“hardest” problems in NP.)

It is one of the most famous open problems in computer science whetherP≠≠≠≠NPorP=NP.

(16)

WhenP≠≠≠≠NP,

P

NP

NP-Complete

(There exist problems in NP that are neither in P, nor in NP-complete(see Chap.7 in Ref. (1).)

WhenP=NP,

P = NP = NP-Complete

Almost all people believeP≠≠≠≠NP.

(17)

A problem isNP-hardif an NP-complete problem can be reduced to it in polynomial time.

(Equivalently,a problem isNP-hardif every NP problem can be reduced to it in polynomial time.)

⇒

⇒⇒

⇒ If any NP-hard problem can be solved in polynomial time,then all NP-complete problems can be solved in polynomial time.

(Intuitively,NP-hard problems are “harder”

than NP-complete problems.)

NP NP-hard

NP-complete

The class of NP-hard problems contains both decision problems and optimization problems.

(18)

If an NP-hard problem is in NP,then it is an NP-complete problem.

(Intuitively,NP-complete problems are an “easier”

subclass of NP-hard problems.)

The corresponding optimization problems of NP-complete problems are NP-hard.

The well-known halting problem (a decision problem), which is to determine whether or not an algorithm will terminate with a given input, is NP-hard, but not NP-complete.

(19)

•

• • Pseudo-Polynomial Time Algorithms

Ex. Given a setS={a₁, a₁, …, a_n}of integers and an integer M>0,the sum-of-subset problem is to determine whether or not there exists a subset

of S whose sum is equal to M.

This problem can be solved inO(nM)time by dynamic programming as follows.

Lett(i, j)=true, if there exists a subset of {a₁, a₂, …, a_i}whose sum is equal to j, and falseelse.

Then,

t(i, j)=t(i−−−−1, j)+t(i−−−−1, j−−−−a_i),wherei>1.

Initially,t(1, j)=true, ifj=0orj=a₁, andfalse else.

The answer ist(n, M).

(20)

Although the time complexity is exponential with respect toM,the problem is considered polynomial-time solvable,ifMis bounded.

An algorithm like this is usually referred to as apseudo-polynomial time algorithm.

An NP-complete problem is in the strong sense if and only ifthere exists no pseudo-polynomial time algorithm for solving it(unlessP=NP).

Intuitively, NP-complete problems in the strong sense are “harder” NP-complete problems (refer to Ref.(1)).

(21)

•

• • The Satisfiability Problem and Cook’s Theorem

The satisfiability problem,which is the first NP-complete problem,is defined as follows.

Instance: A set U of Boolean variables and a collection C of clauses over U.

Question: Is there an assignment of U that can satisfy C?

Ex. WhenU={x₁, x₂, x₃}andC={x₁∨∨∨∨x₂∨∨∨∨x₃, x1, x

2},the assignment of U: x₁←←←←F, x₂←←←←Fandx₃←←←←T,can satisfy C(i.e., (x₁∨∨∨∨x₂∨∨∨∨x₃)∧∧∧∧(x

1)∧∧∧∧(x

2) = T).

(22)

Ex. WhenU={x₁, x₂}andC={x₁∨∨∨∨x₂,x₁∨∨∨∨ x

2, x1 ∨∨∨∨x₂, x

1 ∨∨∨∨ x

2},no assignment of U can satisfy C.

Cook’s Theorem: The satisfiability problem is NP-complete.

The proof ofCook’s Theorem, which is rather lengthy and complex, can be found inRef.(1) andRef.(2).

There is an informal proof ofCook’s Theorem in the textbook.

(23)

•

• • Six Basic NP-Complete Problems

(P1)3-Satisfiability

Instance: A set U of variables and a collection C={c₁, c₂,…,c_m} of clauses over U, where each clause of C contains three literals.

Question: Is there a satisfying truth assignment for C?

Ex. WhenU = {x₁, x₂, x₃}andC = {x₁∨∨∨∨x₂∨∨∨∨x₃, x1∨∨∨∨x

2∨∨∨∨x₃},the assignment of U: x₁←←←←T, x₂←←←←Fandx₃←←←←F,can satisfy C.

Ex. WhenU = {x₁, x₂, x₃}andC = {x₁∨∨∨∨x₂∨∨∨∨x₃, x

1∨∨∨∨x₂∨∨∨∨x₃, x₁∨∨∨∨x

2∨∨∨∨x₃, x₁∨∨∨∨x₂∨∨∨∨x

3, x

1∨∨∨∨x

2∨∨∨∨x₃, x

1∨∨∨∨x₂∨∨∨∨x

3, x₁∨∨∨∨x

2∨∨∨∨x

3, x

1∨∨∨∨x

2∨∨∨∨x

3}, no assignment of U can satisfy C.

(24)

(P2)Vertex Cover

Instance: An undirected graph G=(V, E) and a positive integer k≤≤≤≤|V|.

Question: Does G contain a vertex cover of size at mostk,i.e.,a subset V’⊆⊆⊆⊆V such that |V’|≤≤≤≤k and for each (u, v)∈∈∈∈E,at least one ofuandvbelongs to V’?

Ex.

|V’| = 4, 5⇒⇒⇒⇒ V’ is a vertex cover;

|V’| = 3: {1, 2, 3}, {1, 3, 4}, {1, 3, 5}, {2, 3, 4}, and {2, 3, 5} are vertex covers;

|V’| < 3⇒⇒⇒⇒ V’ is not a vertex cover.

(25)

(P3)3-Dimensional Matching

Instance: A set M⊆⊆⊆⊆W××××X××××Y,where W,Xand Yare three disjoint sets,each having qelements.

Question: DoesMcontain a matching,i.e.,a subset M’⊆⊆⊆⊆M such that each element of W,XandYappears inM’exactly

once (|M’|=q)?

Ex. Suppose W={a, b}, X={c, d}, and Y={e, f}.

If M={(a, c, f), (b, d, e), (a, d, f)}, thenM contains a matching M’={(a, c, f), (b, d, e)}.

If M={(a, c, f), (b, c, e), (b, d, f)}, then M

does not contain a matching.

(26)

(P4)Clique

Instance: An undirected graph G=(V, E) and a positive integer k≤≤≤≤|V|.

Question: DoesGcontain a clique of size at leastk, i.e.,a subset V’⊆⊆⊆⊆V such

that |V’|≥≥≥≥k and every two vertices ofV’are adjacent inG?

Ex.

|V’| = 4, 5 ⇒⇒⇒⇒ V’ is not a clique;

|V’| = 3: {1, 2, 3} is a clique;

|V’| = 2: {1, 2}, {1, 3}, {2, 3}, {3, 4} and {3, 5} are cliques.

(27)

(P5)Hamiltonian Cycle

Instance: An undirected graph G=(V, E).

Question: DoesGcontain a Hamiltonian cycle, i.e.,an ordering (v₁, v₂,…,v_|V|) of the vertices ofGsuch that (v₁, v_|V|)∈∈∈∈

E and (v_i, v_i+1)∈∈∈∈Efor all 1≤≤≤≤i<|V|?

Ex.

The left graph has a Hamiltonian cycle, but the right graph does not.

(28)

(P6)Partition

Instance: A multiset A={a₁, a₂,…,a_|A|} of positive integers.

Question: Does there exist A’⊆⊆⊆⊆A such that

i

a A' i

∑

a

∈

=

j

a A A' j

a

−

∑

∈

?

Ex. The multiset {2, 2, 4, 4, 8} can be divided into

{2, 4, 4} and {2, 8} whose sums are equal.

On the other hand, {2, 2, 4, 4, 7} cannot be divided similarly.

(29)

The six NP-complete problems above were shown in Ref.(1) in the following way,where each“→→→→” represents a reduction“∝∝∝∝”(for example, Vertex Cover∝∝∝∝ Clique).

Satisfiability

3-Satisfiability

3-Dimensional Matching

Partition

Vertex Cover

Clique Hamiltonian

Cycle

It is still possible to show these NP-complete problems(and others)in a different way,i.e., using different known NP-complete problems.

A list of NP-complete problems can be found in Appendix of Ref.(1).

(30)

•

• • Two-Sided Analysis of Problems

If some restrictions are imposed on a problemΠΠΠΠ, then a restricted subproblemΠΠΠΠ’ofΠΠΠΠresults.

SupposeΠΠΠΠ, ΠΠΠΠ’∈∈∈∈NPandP≠≠≠≠NP.

ΠΠΠ

Π’is NP-complete ⇒⇒⇒⇒ ΠΠΠΠis NP-complete.

Π ΠΠ

Πis NP-complete ⇒⇒⇒ ⇒ ΠΠΠΠ’is inP or NP-complete orneither.

Π Π Π Π Π

Π ΠΠ’

(31)

(“→→→→”means“a subproblem of”)

The frontier is narrowed down,if some open problems are shown to be inPor NP-complete.

Ex.Letdbe the maximal vertex degree inG.

BothVertex CoverandHamiltonian Cycle are inP ifd≤≤≤≤2,and NP-complete ifd≥≥≥≥3.

(32)

Ex. Graph 3-Colorability

Instance: An undirected graph G=(V, E).

Question: Is G 3-colorable,i.e.,does there exist a functionf: V →→→→ {1, 2, 3}

such thatf(u)≠≠≠≠f(v)for all edges (u, v)∈∈∈∈E?

Graph 3-Colorability is in Pifd≤≤≤≤3,and NP-completeifd≥≥≥≥4orG is planar.

Ex.

(33)

Ex. Precedence Constrained Scheduling Instance: A set T of“tasks”,each of

“length”1,a partial order p on T,a“deadline”d,and m“processors”.

Question: Is there a“schedule”f: T→→→→{0, 1, …, d}such thatf(t)<f(t’)iftp t’,

andfor eachi∈∈∈∈{0, 1, …, d},

|{t∈∈∈∈T: f(t)=i}|≤≤≤≤m?

(34)

•

• • Coping with NP-Hard Problems

optimal polynomial solution ? time ?

greedy (heuristic) not yes

algorithms guaranteed

dynamic yes experimentally

programming & efficient

branch-and-bound algorithms

genetic algorithms & not experimentally ant algorithms guaranteed efficient approximation a guaranteed yes

algorithms error bound (exclusive of approximation

schemes) randomized a high probability yes algorithms

or

yes a high probability

average polynomial yes in average case time algorithm