The divide-and-conquer approach - Designing algorithms

Third Edition

2.3 Designing algorithms

2.3.1 The divide-and-conquer approach

Many useful algorithms are recursive in structure: to solve a given problem, they call themselves recursively one or more times to deal with closely related sub-problems. These algorithms typically follow a divide-and-conquer approach: they break the problem into several subproblems that are similar to the original prob-lem but smaller in size, solve the subprobprob-lems recursively, and then combine these solutions to create a solution to the original problem.

The divide-and-conquer paradigm involves three steps at each level of the recur-sion:

Divide the problem into a number of subproblems that are smaller instances of the same problem.

Conquer the subproblems by solving them recursively. If the subproblem sizes are small enough, however, just solve the subproblems in a straightforward manner.

Combine the solutions to the subproblems into the solution for the original prob-lem.

The merge sort algorithm closely follows the divide-and-conquer paradigm. In-tuitively, it operates as follows.

Divide: Divide the n-element sequence to be sorted into two subsequences of n=2 elements each.

Conquer: Sort the two subsequences recursively using merge sort.

Combine: Merge the two sorted subsequences to produce the sorted answer.

The recursion “bottoms out” when the sequence to be sorted has length 1, in which case there is no work to be done, since every sequence of length 1 is already in sorted order.

The key operation of the merge sort algorithm is the merging of two sorted sequences in the “combine” step. We merge by calling an auxiliary procedure MERGE.A; p; q; r/, where A is an array and p, q, and r are indices into the array such that p q < r. The procedure assumes that the subarrays AŒp : : q and AŒq C 1 : : r are in sorted order. It merges them to form a single sorted subarray that replaces the current subarray AŒp : : r.

Our MERGE procedure takes time ‚.n/, where n D r p C 1 is the total number of elements being merged, and it works as follows. Returning to our card-playing motif, suppose we have two piles of cards face up on a table. Each pile is sorted, with the smallest cards on top. We wish to merge the two piles into a single sorted output pile, which is to be face down on the table. Our basic step consists of choosing the smaller of the two cards on top of the face-up piles, removing it from its pile (which exposes a new top card), and placing this card face down onto

the output pile. We repeat this step until one input pile is empty, at which time we just take the remaining input pile and place it face down onto the output pile.

Computationally, each basic step takes constant time, since we are comparing just the two top cards. Since we perform at most n basic steps, merging takes ‚.n/

time.

The following pseudocode implements the above idea, but with an additional twist that avoids having to check whether either pile is empty in each basic step.

We place on the bottom of each pile a sentinel card, which contains a special value that we use to simplify our code. Here, we use 1 as the sentinel value, so that whenever a card with 1 is exposed, it cannot be the smaller card unless both piles have their sentinel cards exposed. But once that happens, all the nonsentinel cards have already been placed onto the output pile. Since we know in advance that exactly r p C 1 cards will be placed onto the output pile, we can stop once we have performed that many basic steps.

MERGE.A; p; q; r/

1 n1 D q p C 1 2 n₂ D r q

3 let LŒ1 : : n1C 1 and RŒ1 : : n2C 1 be new arrays 4 fori D 1 to n₁

5 LŒi D AŒp C i 1

6 forj D 1 to n2

7 RŒj D AŒq C j 8 LŒn1C 1 D 1 9 RŒn₂C 1 D 1 10 i D 1

11 j D 1

12 fork D p to r 13 ifLŒi RŒj

14 AŒk D LŒi

15 i D i C 1

16 elseAŒk D RŒj

17 j D j C 1

In detail, the MERGEprocedure works as follows. Line 1 computes the length n1

of the subarray AŒp : : q, and line 2 computes the length n2 of the subarray AŒq C 1 : : r. We create arrays L and R (“left” and “right”), of lengths n1C 1 and n2 C 1, respectively, in line 3; the extra position in each array will hold the sentinel. The for loop of lines 4–5 copies the subarray AŒp : : q into LŒ1 : : n1, and the for loop of lines 6–7 copies the subarray AŒq C 1 : : r into RŒ1 : : n2.

Lines 8–9 put the sentinels at the ends of the arrays L and R. Lines 10–17,

illus-A

Figure 2.3 The operation of lines 10–17 in the call MERGE.A; 9; 12; 16/, when the subarray AŒ9 : : 16 contains the sequence h2; 4; 5; 7; 1; 2; 3; 6i. After copying and inserting sentinels, the array L contains h2; 4; 5; 7; 1i, and the array R contains h1; 2; 3; 6; 1i. Lightly shaded positions in A contain their ﬁnal values, and lightly shaded positions in L and R contain values that have yet to be copied back into A. Taken together, the lightly shaded positions always comprise the values originally in AŒ9 : : 16, along with the two sentinels. Heavily shaded positions in A contain values that will be copied over, and heavily shaded positions in L and R contain values that have already been copied back into A. (a)–(h) The arrays A, L, and R, and their respective indices k, i , and j prior to each iteration of the loop of lines 12–17.

trated in Figure 2.3, perform the r p C 1 basic steps by maintaining the following loop invariant:

At the start of each iteration of the for loop of lines 12–17, the subarray AŒp : : k 1 contains the k p smallest elements of LŒ1 : : n₁ C 1 and RŒ1 : : n2 C 1, in sorted order. Moreover, LŒi and RŒj are the smallest elements of their arrays that have not been copied back into A.

We must show that this loop invariant holds prior to the ﬁrst iteration of the for loop of lines 12–17, that each iteration of the loop maintains the invariant, and that the invariant provides a useful property to show correctness when the loop terminates.

Initialization: Prior to the ﬁrst iteration of the loop, we have k D p, so that the subarray AŒp : : k 1 is empty. This empty subarray contains the k p D 0 smallest elements of L and R, and since i D j D 1, both LŒi and RŒj are the smallest elements of their arrays that have not been copied back into A.

Figure 2.3, continued (i) The arrays and indices at termination. At this point, the subarray in AŒ9 : : 16 is sorted, and the two sentinels in L and R are the only two elements in these arrays that have not been copied into A.

Maintenance: To see that each iteration maintains the loop invariant, let us ﬁrst suppose that LŒi RŒj . Then LŒi is the smallest element not yet copied back into A. Because AŒp : : k 1 contains the k p smallest elements, after line 14 copies LŒi into AŒk, the subarray AŒp : : k will contain the k p C 1 smallest elements. Incrementing k (in the for loop update) and i (in line 15) reestablishes the loop invariant for the next iteration. If instead LŒi > RŒj , then lines 16–17 perform the appropriate action to maintain the loop invariant.

Termination: At termination, k D r C 1. By the loop invariant, the subarray AŒp : : k 1, which is AŒp : : r, contains the k p D r p C 1 smallest elements of LŒ1 : : n1 C 1 and RŒ1 : : n₂ C 1, in sorted order. The arrays L and R together contain n1C n2C 2 D r p C 3 elements. All but the two largest have been copied back into A, and these two largest elements are the sentinels.

To see that the MERGE procedure runs in ‚.n/ time, where n D r p C 1, observe that each of lines 1–3 and 8–11 takes constant time, the for loops of lines 4–7 take ‚.n1C n2/ D ‚.n/ time,⁷ and there are n iterations of the for loop of lines 12–17, each of which takes constant time.

We can now use the MERGE procedure as a subroutine in the merge sort al-gorithm. The procedure MERGE-SORT.A; p; r/ sorts the elements in the subar-ray AŒp : : r. If p r, the subarsubar-ray has at most one element and is therefore already sorted. Otherwise, the divide step simply computes an index q that par-titions AŒp : : r into two subarrays: AŒp : : q, containing dn=2e elements, and AŒq C 1 : : r, containing bn=2c elements.⁸

MERGE-SORT.A; p; r/

1 ifp < r

2 q D b.p C r/=2c 3 MERGE-SORT.A; p; q/

4 MERGE-SORT.A; q C 1; r/

5 MERGE.A; p; q; r/

To sort the entire sequence A D hAŒ1; AŒ2; : : : ; AŒni, we make the initial call MERGE-SORT.A; 1; A:length/, where once again A:length D n. Figure 2.4 il-lustrates the operation of the procedure bottom-up when n is a power of 2. The algorithm consists of merging pairs of 1-item sequences to form sorted sequences of length 2, merging pairs of sequences of length 2 to form sorted sequences of length 4, and so on, until two sequences of length n=2 are merged to form the ﬁnal sorted sequence of length n.

在文檔中 ALGORITHMS INTRODUCTION TO (頁 51-55)