Homework 3 Solutions

Exercise 8.1. Suppose that an ordered file of size N is to be combined with an unordered file of size M, with M much smaller than N. How many times faster than resorting is the suggested merge-based method, as a function of M, for N = 10³, 10⁶, and 10⁹? Assume that you have a sorting program that takes about c₁N lg N seconds to sort a file of size N and a merging program that takes about c₂(N + M) seconds to merge a file of size N with one of size M, with c₁ $\approx$ c₂?

Appending the small file to the larger one and then sorting will take about c₁(N + M)lg(N + M) seconds; if N is much larger than M, then this will be about c₁N lg N seconds.

Sorting the small file first and then merging the two ordered files will take about c₁M lg M + c₂(N + M) seconds. Again, if N is much larger than M, then this will be about c₁M lg M + c₂N; without knowing more about the values of M and N, we can't say whether c₁M lg M or c₂N will be dominant.

The ratio of the running times will therefore be about

$\displaystyle {\frac{{c_1N\lg N}}{{c_1M\lg M+c_2N}}}$ $\displaystyle \approx$ $\displaystyle {\frac{{N\lg N}}{{M\lg M+N}}}$ ,

using the fact that c₁ $\approx$ c₂.

We may make the following table of this ratio, using the approximation lg 10³ = 10:

M	N = 10³	N = 10⁶	N = 10⁹
1	10	20	30
4	10	20	30
16	9	20	30
64	7	20	30
256	3	20	30
1K		20	30
4K		19	30
16K		16	30
64K		10	30
256K		4	30
1M			29
4M			28
16M			21
64M			11
256M			4

So if ``much larger'' means ``at least four times larger'', then the running time with the merge will be at least three or four times faster, reaching the limit, for small M, of lg N times faster.

Exercise 8.2. How does the strategy of using insertion sort for the whole file compare with the two methods postulated in Exercise 8.1? (Assume that the small file is random, so each insertion goes about halfway into the large file, and the running time is about c₃MN/2, with c₃ approximately the same as the other constants.)
Let us see how many times faster even the O(N lg N) sort is than insertion sort (so the merge-based algorithm would be even better). The ratio we are interested in is

$\displaystyle {\frac{{c_3MN/2}}{{c_1N\lg N}}}$ $\displaystyle \approx$ $\displaystyle {\frac{{M}}{{2\lg N}}}$ .

When N = 10³, this becomes ${\frac{{1}}{{20}}}$ M, so insertion sort will be slowest for any M > 20. For M $\leq$ 20, insertion will be faster than resorting, but in this range, the merge method is faster than resorting by a factor of 10, so insertion only wins when M = 1 or 2.
When N = 10⁶, the ratio becomes ${\frac{{1}}{{40}}}$ M, so insertion sort will be slowest for any M > 40. For M $\leq$ 40, insertion will beat resorting, but now the merge is better by a factor of 20, so again insertion is only a win for M = 1 or 2.
Finally, when N = 10⁹, the ratio is ${\frac{{1}}{{60}}}$ M; just as above, insertion is slowest for M > 60 and resorting is slowest for M $\leq$ 60. Merge is better than resorting by a factor of 30 when M is very small, so it will be better than insertion again for any M > 2.
It is reasonable that the break-even point between insertion and merge is at about M = 2, independent of N, since insertion makes one pass through (roughly half of) the large file for each of the M items, while merge can do the entire job in one pass (assuming the time to sort the M items is negligible).
Exercise 8.5. Show how the keys A E Q S U Y E I N O S T are merged using Program 8.2, in the style of the example diagrammed in Figure 8.1.
Recall that Program 8.2 first arranges the keys in a bitonic sequence, by copying the second subfile in reverse order after the first, so that the merge can be performed without sentinals or end-of-file tests.

$\begin{figure}\begin{center}\sf\setlength{\tabcolsep}{0pt}\begin{tabular}{cccccc... ...smash Q\vphantom X}& S & S & T & U & Y \par\end{tabular}\end{center}\end{figure}$
Exercise 8.6. Explain why Program 8.2 is not stable, and develop a version that is stable.
Suppose that the largest value in the first array is equal to both of the last two elements of the second array. When the second array is reversed, this will produce three adjacent equal elements in the middle of the combined array. Program 8.2 will move all three of these elements back into the merged array in order from left to right; this will have the effect of reversing the previous order of the two equal elements from the end of the second array. Here is an example:
$\begin{figure}\begin{center}\sf\setlength{\tabcolsep}{0pt}\begin{tabular}{cccccc... ...ebox[12pt]{\phantom{X}}}& A & B1 & B3 & B2 \end{tabular}\end{center}\end{figure}$

One way to fix this is to change the test in the inner loop to if (aux[j] < aux[i] || i > m); this will force the elements from the second half to be copied using j instead of i. This still slows down the inner loop, so an even better solution is to realize that we can avoid copying the problem elements in the first place--if the largest elements of the second array are (at least) as large as the largest element of the first, then they can stay right where they are and not participate in the merge. For symmetry, we can do the same with the smallest elements of the first array. Here is the code:
```
template <class Item>
void merge(Item a[], int l, int m, int r) {
  int i, j;
  static Item aux[maxN];
  while (r > m && a[r] >= a[m]) r--;
  while (l <= m && a[l] <= a[m+1]) l++;
  for (i = m+1; i > l; i--) aux[i-1] = a[i-1];
  for (j = m; j < r; j++) aux[r+m-j] = a[j+1];
  for (int k = l; k <= r; k++)
    if (aux[j] < aux[i]) a[k] = aux[j--]; else a[k] = aux[i++];
}
```
Exercise 8.9. Show the merges that Program 8.3 does to sort the keys E A S Y Q U E S T I O N.

$\begin{figure}\begin{center}\sf\setlength{\tabcolsep}{0pt}\begin{tabular}{cccccc... ...& {\smash Q\vphantom X}& S & S & T & U & Y \end{tabular}\end{center}\end{figure}$

Note that the merge in the last step is just the one that was performed in Exercise 8.5.
Exercise 8.24. Show the merges that bottom-up mergesort (Program 8.5) does for the keys E A S Y Q U E S T I O N.

$\begin{figure}\begin{center}\sf\setlength{\tabcolsep}{0pt}\begin{tabular}{cccccc... ...& {\smash Q\vphantom X}& S & S & T & U & Y \end{tabular}\end{center}\end{figure}$
Exercise 9.1. A letter means insert and an asterisk means remove the maximum in the sequence
P R I O * R * * I * T * Y * * * Q U E * * * U * E.
Give the sequence of values returned by the remove the maximum operations.
The sequence will be R R P O T Y I I U Q E U. The E that was inserted at the end will be the only item remaining in the queue.
Exercise 9.2. Add to the conventions of Exercise 9.1 a plus sign to mean join and parentheses to delimit the priority queue created by the operations within them. Give the contents of the priority queue after the sequence
( ( ( P R I O * ) + ( R * I T * Y * ) ) * * * ) + ( Q U E * * * U * E ).

The sequence P R I O * will return R, leaving a priority queue containing P, I, and O.
The sequence R * I T * Y * will return R T Y, leaving a priority queue containing I.
Joining these two queues and removing three maximum elements will return P O I and leave a queue containing I.
The sequence Q U E * * * U * E will return U Q E U and leave a priority queue containing E.
Finally, joining these last two queues will produce a priority queue containing I and E.
Exercise 9.3. Explain how to use a priority queue ADT to implement a stack ADT.
In a stack, the most recently inserted item will be the first to be removed. Therefore, to implement this with a priority queue, we just need to make sure that more recently inserted elements have higher priorities than older elements. This is easy to arrange; just assign priority 1 to the first item inserted, then priority 2 to the next, etc.
Exercise 9.4. Explain how to use a priority queue ADT to implement a queue ADT.
In a queue, the oldest item will be the first to be removed. We may implement this as above for the stack ADT, except the priorities of successive insertions need to be smaller. For example, we could give the first item a priority of -1, then priority -2 for the second, etc.
Exercise 9.5. Criticize the following idea: To implement find the maximum in constant time, why not keep track of the maximum value inserted so far, then return that value for find the maximum?
This would be fine if all we ever wanted to do was find the maximum value; of course, if that were all we needed, then we wouldn't have to save any of the other values at all.
If we also want to be removing items, then when the current maximum value is removed, we will need to examine all of the remaining items to determine the largest remaining value; therefore, while insert and find the maximum will take constant time, remove and remove the maximum will both take linear time. This is almost the same behavior as using an unordered collection (array or linked list), except the running times of remove and find the maximum have been swapped.
Exercise 9.21. Give the heap that results when the keys E A S Y Q U E S T I O N are inserted into an initially empty heap.

$\begin{figure}\begin{center} \sf\bfseries\footnotesize {\color{white}\setlength\... ...stree{\Tcircle{N}}{\Tcircle{E} \Tp} \Tcircle{E} } }}} \end{center}\end{figure}$