The Proof (concluded)

(1)

Unapproximability of tsp

^a

Theorem 80 The approximation threshold of tsp is 1 unless P = NP.

• Suppose there is a polynomial-time ²-approximation algorithm for tsp for some ² < 1.

• We shall construct a polynomial-time algorithm for the NP-complete hamiltonian cycle.

• Given any graph G = (V, E), construct a tsp with | V | cities with distances

d_ij =





1, if { i, j } ∈ E

|V |

1−², otherwise

aSahni and Gonzales (1976).

(2)

The Proof (concluded)

• Run the alleged approximation algorithm on this tsp.

• Suppose a tour of cost |V | is returned.

– This tour must be a Hamiltonian cycle.

• Suppose a tour with at least one edge of length _1−²^{|V |} is returned.

– The total length of this tour is > _1−²^{|V |} .

– Because the algorithm is ²-approximate, the optimum is at least 1 − ² times the returned tour’s length.

– The optimum tour has a cost exceeding | V |.

– Hence G has no Hamiltonian cycles.

(3)

knapsack Has an Approximation Threshold of Zero

^a

Theorem 81 For any ², there is a polynomial-time

²-approximation algorithm for knapsack.

• We have n weights w₁, w₂, . . . , w_n ∈ Z⁺, a weight limit W , and n values v₁, v₂, . . . , v_n ∈ Z⁺.^b

• We must find an S ⊆ {1, 2, . . . , n} such that P

i∈S w_i ≤ W and P

i∈S v_i is the largest possible.

aIbarra and Kim (1975).

bIf the values are fractional, the result is slightly messier but the main conclusion remains correct. Contributed by Mr. Jr-Ben Tian (R92922045) on December 29, 2004.

(4)

The Proof (continued)

• Let

V = max{v₁, v₂, . . . , v_n}.

• Clearly, P

i∈S v_i ≤ nV .

• Let 0 ≤ i ≤ n and 0 ≤ v ≤ nV .

• W (i, v) is the minimum weight attainable by selecting some of the first i items with a total value of v.

• Set W (0, v) = ∞ for v ∈ { 1, 2, . . . , nV } and W (i, 0) = 0 for i = 0, 1, . . . , n.^a

aContributed by Mr. Ren-Shuo Liu (D98922016) and Mr. Yen-Wei Wu (D98922013) on December 28, 2009.

(5)

The Proof (continued)

• Then, for 0 ≤ i < n,

W (i + 1, v) = min{W (i, v), W (i, v − v_i+1) + w_i+1}.

• Finally, pick the largest v such that W (n, v) ≤ W .

• The running time is O(n²V ), not polynomial time.

• Key idea: Limit the number of precision bits.

(6)

The Proof (continued)

• Define

v_i⁰ = 2^b j v_i 2^b

k .

– This is equivalent to zeroing each v_i’s last b bits.

• From the original instance

x = (w₁, . . . , w_n, W, v₁, . . . , v_n), define the approximate instance

x⁰ = (w₁, . . . , w_n, W, v₁⁰ , . . . , v_n⁰ ).

(7)

The Proof (continued)

• Solving x⁰ takes time O(n²V /2^b).

– The algorithm only performs subtractions on the v_i-related values.

– So the b last bits can be removed from the calculations.

– That is, use v_i⁰ = ¥_v

2^bi

¦ in the calculations.

– Then multiply the returned value by 2^b.

• The solution S⁰ is close to the optimum solution S:

X

i∈S⁰

v_i ≥ X

i∈S⁰

v_i⁰ ≥ X

i∈S

v_i⁰ ≥ X

i∈S

(v_i − 2^b) ≥ X

i∈S

v_i − n2^b.

(8)

The Proof (continued)

• Hence X

i∈S⁰

v_i ≥ X

i∈S

v_i − n2^b.

• Without loss of generality, assume w_i ≤ W for all i.

– Otherwise item i is redundant.

• V is a lower bound on opt.

– Picking any single item with value ≤ V is a legitimate choice.

• The relative error from the optimum is ≤ n2^b/V : P

i∈S v_i − P

i∈S⁰ v_i

P v ≤

P

i∈S v_i − P

i∈S⁰ v_i

V ≤ n2^b

V .

(9)

The Proof (concluded)

• Suppose we pick b = blog₂ ^²V_n c.

• The algorithm becomes ²-approximate (see Eq. (10) on p. 639).

• The running time is then O(n²V /2^b) = O(n³/²), a polynomial in n and 1/².^a

aIt hence depends on the value of 1/². Thanks to a lively class dis- cussion on December 20, 2006. If we fix ² and let the problem size increase, then the complexity is cubic. Contributed by Mr. Ren-Shan Luoh (D97922014) on December 23, 2008.

(10)

Pseudo-Polynomial-Time Algorithms

• Consider problems with inputs that consist of a

collection of integer parameters (tsp, knapsack, etc.).

• An algorithm for such a problem whose running time is a polynomial of the input length and the value (not length) of the largest integer parameter is a

pseudo-polynomial-time algorithm.^a

• On p. 665, we presented a pseudo-polynomial-time algorithm for knapsack that runs in time O(n²V ).

• How about tsp (d), another NP-complete problem?

aGarey and Johnson (1978).

(11)

No Pseudo-Polynomial-Time Algorithms for tsp (d)

• By definition, a pseudo-polynomial-time algorithm becomes polynomial-time if each integer parameter is limited to having a value polynomial in the input length.

• Corollary 43 (p. 344) showed that hamiltonian path is reducible to tsp (d) with weights 1 and 2.

• As hamiltonian path is NP-complete, tsp (d) cannot have pseudo-polynomial-time algorithms unless P = NP.

• tsp (d) is said to be strongly NP-hard.

• Many weighted versions of NP-complete problems are strongly NP-hard.

(12)

Polynomial-Time Approximation Scheme

• Algorithm M is a polynomial-time approximation scheme (PTAS) for a problem if:

– For each ² > 0 and instance x of the problem, M runs in time polynomial (depending on ²) in | x |.

∗ Think of ² as a constant.

– M is an ²-approximation algorithm for every ² > 0.

(13)

Fully Polynomial-Time Approximation Scheme

• A polynomial-time approximation scheme is fully polynomial (FPTAS) if the running time depends polynomially on | x | and 1/².

– Maybe the best result for a “hard” problem.

– For instance, knapsack is fully polynomial with a running time of O(n³/²) (p. 663).

(14)

Square of G

• Let G = (V, E) be an undirected graph.

• G² has nodes {(v₁, v₂) : v₁, v₂ ∈ V } and edges

{{ (u, u⁰), (v, v⁰) } : (u = v ∧ { u⁰, v⁰ } ∈ E) ∨ { u, v } ∈ E}.

1

2

3

(1,1)

G

(1,2) (1,3)

(2,1) (2,2) (2,3)

(3,1) (3,2) (3,3)

G²

(15)

Independent Sets of G and G

²

Lemma 82 G(V, E) has an independent set of size k if and only if G² has an independent set of size k².

• Suppose G has an independent set I ⊆ V of size k.

• {(u, v) : u, v ∈ I} is an independent set of size k² of G².

1

2

3

(1,1)

G

(1,2) (1,3)

(2,1) (2,2) (2,3)

(3,1) (3,2) (3,3)

G²

(16)

The Proof (continued)

• Suppose G² has an independent set I² of size k².

• U ≡ {u : ∃v ∈ V (u, v) ∈ I²} is an independent set of G.

1

2

3

(1,1)

G

(1,2) (1,3)

(2,1) (2,2) (2,3)

(3,1) (3,2) (3,3)

G²

• | U | is the number of “rows” that the nodes in I² occupy.

(17)

The Proof (concluded)

^a

• If | U | ≥ k, then we are done.

• Now assume | U | < k.

• As the k² nodes in I² cover fewer than k “rows,” there must be a “row” in possession of > k nodes of I².

• Those > k nodes will be independent in G as each “row”

is a copy of G.

aThanks to a lively class discussion on December 29, 2004.

(18)

Approximability of independent set

• The approximation threshold of the maximum independent set is either zero or one (it is one!).

Theorem 83 If there is a polynomial-time ²-approximation algorithm for independent set for any 0 < ² < 1, then there is a polynomial-time approximation scheme.

• Let G be a graph with a maximum independent set of size k.

• Suppose there is an O(nⁱ)-time ²-approximation algorithm for independent set.

• We seek a polynomial-time ²⁰-approximation algorithm

(19)

The Proof (continued)

• By Lemma 82 (p. 675), the maximum independent set of G² has size k².

• Apply the algorithm to G².

• The running time is O(n²ⁱ).

• The resulting independent set has size ≥ (1 − ²) k².

• By the construction in Lemma 82 (p. 675), we can obtain an independent set of size ≥ p

(1 − ²) k² for G.

• Hence there is a (1 − √

1 − ²)-approximation algorithm for independent set by Eq. (11) on p. 640.

(20)

The Proof (concluded)

• In general, we can apply the algorithm to G²^` to obtain an (1 − (1 − ²)²^−`)-approximation algorithm for

independent set.

• The running time is n²^`ⁱ.^a

• Now pick ` = dlog _log(1−²^log(1−²)0)e.

• The running time becomes nⁱ^log(1−²0)^log(1−²) .

• It is an ²⁰-approximation algorithm for independent set.

aIt is not fully polynomial.

(21)

Comments

• independent set and node cover are reducible to each other (Corollary 40, p. 309).

• node cover has an approximation threshold at most 0.5 (p. 645).

• But independent set is unapproximable (see the textbook).

• independent set limited to graphs with degree ≤ k is called k-degree independent set.

• k-degree independent set is approximable (see the textbook).

(22)

On P vs. NP

(23)

Density

^a

The density of language L ⊆ Σ^∗ is defined as dens_L(n) = |{x ∈ L : | x | ≤ n}|.

• If L = {0, 1}^∗, then dens_L(n) = 2ⁿ⁺¹ − 1.

• So the density function grows at most exponentially.

• For a unary language L ⊆ {0}^∗,

dens_L(n) ≤ n + 1.

– Because L ⊆ {², 0, 00, . . . ,

z }| {n

00 · · · 0, . . .}.

aBerman and Hartmanis (1977).

(24)

Sparsity

• Sparse languages are languages with polynomially bounded density functions.

• Dense languages are languages with superpolynomial density functions.

(25)

Self-Reducibility for sat

• An algorithm exhibits self-reducibility if it finds a certificate by exploiting algorithms for the decision version of the same problem.

• Let φ be a boolean expression in n variables x₁, x₂, . . . , x_n.

• t ∈ {0, 1}^j is a partial truth assignment for x₁, x₂, . . . , x_j.

• φ[ t ] denotes the expression after substituting the truth values of t for x₁, x₂, . . . , x_{| t |} in φ.

(26)

An Algorithm for sat with Self-Reduction

We call the algorithm below with empty t.

1: if | t | = n then

2: return φ[ t ];

3: else

4: return φ[ t0 ] ∨ φ[ t1 ];

5: end if

The above algorithm runs in exponential time, by visiting all the partial assignments (or nodes on a depth-n binary tree).

(27)

NP-Completeness and Density

^a

Theorem 84 If a unary language U ⊆ {0}^∗ is NP-complete, then P = NP.

• Suppose there is a reduction R from sat to U .

• We use R to find a truth assignment that satisfies

boolean expression φ with n variables if it is satisfiable.

• Specifically, we use R to prune the exponential-time exhaustive search on p. 686.

• The trick is to keep the already discovered results φ[ t ] in a table H.

aBerman (1978).

(28)

1: if | t | = n then

2: return φ[ t ];

3: else

4: if (R(φ[ t ]), v) is in table H then

5: return v;

6: else

7: if φ[ t0 ] = “satisfiable” or φ[ t1 ] = “satisfiable” then

8: Insert (R(φ[ t ]), “satisfiable”) into H;

9: return “satisfiable”;

10: else

11: Insert (R(φ[ t ]), “unsatisfiable”) into H;

12: return “unsatisfiable”;

13: end if

14: end if

(29)

The Proof (continued)

• Since R is a reduction, R(φ[ t ]) = R(φ[ t⁰ ]) implies that φ[ t ] and φ[ t⁰ ] must be both satisfiable or unsatisfiable.

• R(φ[ t ]) has polynomial length ≤ p(n) because R runs in log space.

• As R maps to unary numbers, there are only polynomially many p(n) values of R(φ[ t ]).

• How many nodes of the complete binary tree (of invocations/truth assignments) need to be visited?

• If that number is a polynomial, the overall algorithm runs in polynomial time and we are done.

(30)

The Proof (continued)

• A search of the table takes time O(p(n)) in the random access memory model.

• The running time is O(M p(n)), where M is the total number of invocations of the algorithm.

• The invocations of the algorithm form a binary tree of depth at most n.

(31)

The Proof (continued)

• There is a set T = {t₁, t₂, . . .} of invocations (partial truth assignments, i.e.) such that:

1. |T | ≥ (M − 1)/(2n).

2. All invocations in T are recursive (nonleaves).

3. None of the elements of T is a prefix of another.

(32)

VWVWHS'HOHWH OHDYHV0−

QRQOHDYHVUHPDLQLQJ

QGVWHS6HOHFWDQ\

ERWWRPXQGHOHWHG LQYRFDWLRQWDQGDGG LWWR7

UGVWHS'HOHWHDOOWV DWPRVWQDQFHVWRUV SUHIL[HVIURP

IXUWKHUFRQVLGHUDWLRQ

(33)

An Example

r

a c

d e f

g h i j

l k

1

2 3

4 5

T = { h, j }.

(34)

The Proof (continued)

• All invocations t ∈ T have different R(φ[ t ]) values.

– None of h, j ∈ T is a prefix of the other.

– The invocation of one started after the invocation of the other had terminated.

– If they had the same value, the one that was invoked second would have looked it up, and therefore would not be recursive, a contradiction.

• The existence of T implies that there are at least (M − 1)/(2n) different R(φ[ t ]) values in the table.

(35)

The Proof (concluded)

• We already know that there are at most p(n) such values.

• Hence (M − 1)/(2n) ≤ p(n).

• Thus M ≤ 2np(n) + 1.

• The running time is therefore O(M p(n)) = O(np²(n)).

• We comment that this theorem holds for any sparse language, not just unary ones.^a

aMahaney (1980).

(36)

coNP-Completeness and Density

Theorem 85 (Fortung (1979)) If a unary language U ⊆ {0}^∗ is coNP-complete, then P = NP.

• Suppose there is a reduction R from sat complement to U .

• The rest of the proof is basically identical except that, now, we want to make sure a formula is unsatisfiable.

(37)

Exponential Circuit Complexity

• Almost all boolean functions require 2ⁿ

2n

gates to compute (generalized Theorem 14 on p. 164).

• Progress of using circuit complexity to prove exponential lower bounds for NP-complete problems has been slow.

– As of January 2006, the best lower bound is 5n − o(n).^a

aIwama and Morizumi (2002).

(38)

Exponential Circuit Complexity for NP-Complete Problems

• We shall prove exponential lower bounds for NP-complete problems using monotone circuits.

– Monotone circuits are circuits without ¬ gates.

• Note that this does not settle the P vs. NP problem or any of the conjectures on p. 545.

(39)

The Power of Monotone Circuits

• Monotone circuits can only compute monotone boolean functions.

• They are powerful enough to solve a P-complete problem, monotone circuit value (p. 257).

• There are NP-complete problems that are not monotone;

they cannot be computed by monotone circuits at all.

• There are NP-complete problems that are monotone;

they can be computed by monotone circuits.

– hamiltonian path and clique.

(40)

clique

_n,k

• clique_n,k is the boolean function deciding whether a graph G = (V, E) with n nodes has a clique of size k.

• The input gates are the ¡_n

2

¢ entries of the adjacency matrix of G.

– Gate g_ij is set to true if the associated undirected edge { i, j } exists.

• clique_n,k is a monotone function.

• Thus it can be computed by a monotone circuit.

• This does not rule out that nonmonotone circuits for clique_n,k may use fewer gates.

(41)

Crude Circuits

• One possible circuit for clique_n,k does the following.

1. For each S ⊆ V with |S| = k, there is a subcircuit with O(k²) ∧-gates testing whether S forms a clique.

2. We then take an or of the outcomes of all the ¡_n

k

¢ subsets S₁, S₂, . . . , S(ⁿ_k).

• This is a monotone circuit with O(k²¡_n

k

¢) gates, which is exponentially large unless k or n − k is a constant.

• A crude circuit CC(X₁, X₂, . . . , X_m) tests if any of X_i ⊆ V forms a clique.

– The above-mentioned circuit is CC(S₁, S₂, . . . , S(ⁿ_k)).

(42)

Sunflowers

• Fix p ∈ Z⁺ and ` ∈ Z⁺.

• A sunflower is a family of p sets {P₁, P₂, . . . , P_p}, called petals, each of cardinality at most `.

• All pairs of sets in the family must have the same intersection (called the core of the sunflower).

FRUH

(43)

A Sample Sunflower

{{1, 2, 3, 5}, {1, 2, 6, 9}, {0, 1, 2, 11}, {1, 2, 12, 13}, {1, 2, 8, 10}, {1, 2, 4, 7}}

Æ¿³È

É¿³Ì

Ë¿³ÄÃ Ç¿³Ê

Ã¿³ÄÄ ÄÅ¿³ÄÆ

(44)

The Erd˝os-Rado Lemma

Lemma 86 Let Z be a family of more than M = (p − 1)^``!

nonempty sets, each of cardinality ` or less. Then Z must contain a sunflower (of size p).

• Induction on `.

• For ` = 1, p different singletons form a sunflower (with an empty core).

• Suppose ` > 1.

• Consider a maximal subset D ⊆ Z of disjoint sets.

– Every set in Z − D intersects some set in D.

(45)

The Proof of the Erd˝os-Rado Lemma (continued)

• Suppose D contains at least p sets.

– D constitutes a sunflower with an empty core.

• Suppose D contains fewer than p sets.

– Let C be the union of all sets in D.

– | C | ≤ (p − 1)` and C intersects every set in Z.

– There is a d ∈ C that intersects more than

M

(p−1)` = (p − 1)^`−1(` − 1)! sets in Z.

– Consider Z⁰ = {Z − {d} : Z ∈ Z, d ∈ Z}.

– Z⁰ has more than M⁰ = (p − 1)^`−1(` − 1)! sets.

(46)

The Proof of the Erd˝os-Rado Lemma (concluded)

• (continued)

– M⁰ is just M with ` replaced with ` − 1.

– Z⁰ contains a sunflower by induction, say {P₁, P₂, . . . , P_p}.

– Now,

{P₁ ∪ {d}, P₂ ∪ {d}, . . . , P_p ∪ {d}}

is a sunflower in Z.

(47)

Comments on the Erd˝os-Rado Lemma

• A family of more than M sets must contain a sunflower.

• Plucking a sunflower entails replacing the sets in the sunflower by its core.

• By repeatedly finding a sunflower and plucking it, we can reduce a family with more than M sets to a family with at most M sets.

• If Z is a family of sets, the above result is denoted by pluck(Z).

• Note: pluck(Z) is not unique.

(48)

An Example of Plucking

• Recall the sunflower on p. 703:

Z = {{1, 2, 3, 5}, {1, 2, 6, 9}, {0, 1, 2, 11}, {1, 2, 12, 13}, {1, 2, 8, 10}, {1, 2, 4, 7}}

• Then

pluck(Z) = {{1, 2}}.

(49)

Razborov’s Theorem

Theorem 87 (Razborov (1985)) There is a constant c such that for large enough n, all monotone circuits for clique_n,k with k = n^1/4 have size at least n^cn^1/8.

• We shall approximate any monotone circuit for clique_n,k by a restricted kind of crude circuit.

• The approximation will proceed in steps: one step for each gate of the monotone circuit.

• Each step introduces few errors (false positives and false negatives).

• But the resulting crude circuit has exponentially many errors.

(50)

The Proof (concluded)

Unapproximability of tsp

The Proof (concluded)

knapsack Has an Approximation Threshold of Zero

The Proof (continued)

The Proof (continued)

The Proof (continued)

The Proof (continued)

The Proof (continued)

The Proof (concluded)

Pseudo-Polynomial-Time Algorithms

No Pseudo-Polynomial-Time Algorithms for tsp (d)

Polynomial-Time Approximation Scheme

Fully Polynomial-Time Approximation Scheme

Square of G

Independent Sets of G and G

The Proof (continued)

The Proof (concluded)

Approximability of independent set

The Proof (continued)

The Proof (concluded)

Comments

On P vs. NP

Density

Sparsity

Self-Reducibility for sat

An Algorithm for sat with Self-Reduction

NP-Completeness and Density

The Proof (continued)

The Proof (continued)

The Proof (continued)

VWVWHS'HOHWH OHDYHV 0− 

QRQOHDYHVUHPDLQLQJ

QGVWHS6HOHFWDQ\

ERWWRPXQGHOHWHG LQYRFDWLRQWDQGDGG LWWR7

UGVWHS'HOHWHDOOWV DWPRVWQDQFHVWRUV SUHIL[HV IURP

IXUWKHUFRQVLGHUDWLRQ

An Example

The Proof (continued)

The Proof (concluded)

coNP-Completeness and Density

Exponential Circuit Complexity

The Power of Monotone Circuits

clique

Crude Circuits

Sunflowers

FRUH

A Sample Sunflower

The Erd˝os-Rado Lemma

The Proof of the Erd˝os-Rado Lemma (continued)

The Proof of the Erd˝os-Rado Lemma (concluded)

Comments on the Erd˝os-Rado Lemma

An Example of Plucking

Razborov’s Theorem

Alexander Razborov (1963–)

VWVWHS'HOHWH OHDYHV0−

QGVWHS6HOHFWDQ\

UGVWHS'HOHWHDOOWV DWPRVWQDQFHVWRUV SUHIL[HVIURP