Part II While HeapSort (as we saw in Studio 6) is more amenable than MergeSort to an in-place implementation, MergeSort has its own advantages. For this problem, suppose you want to perform MergeSort on a really huge array A. The array is so big that it doesn't fit in your computer's memory and has to be stored in the cloud. More specifically, assume that our computer has enough memory to hold 36 elements, for some constant b, but A has size n much greater than b. We can call read (X, i, B) to read a chunk of b elements from an array X (in the cloud) starting at index i into a local array B. Similarly, we can call write (C, X, i) to write a chunk of b elements stored in local array C to a cloud array X starting at index i. Here's a proposed (incomplete!) implementation of the merge operation that merges cloud arrays X and Y into cloud array Z. The code uses local arrays A, B, and C, each of size b, to cache X, Y, and Z. For simplicity, we assume that the input arrays X and Y have sizes a multiple of b, and that reading past the end of either X or Y returns values ∞ as in the studio. "mod" is the integer modulo operator (% in Java). MERGE(X, Y, Z) i +0 j+0 k +0 READ(X, 0, A) READ(Y, 0, B) while A[i mod b] <∞ or B[j mod b] < ∞ do u ← A[i mod b] v← B[j mod b] C[k mod b] = min(u, v) if u

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question

10, 11, 12

Part II
While HeapSort (as we saw in Studio 6) is more amenable than MergeSort to an in-place implementation,
MergeSort has its own advantages. For this problem, suppose you want to perform MergeSort on a really
huge array A. The array is so big that it doesn't fit in your computer's memory and has to be stored in the
cloud.
More specifically, assume that our computer has enough memory to hold 36 elements, for some constant
b, but A has size n much greater than b. We can call read (X, i, B) to read a chunk of b elements from an
array X (in the cloud) starting at index i into a local array B. Similarly, we can call write (C, X, i) to
write a chunk of 6 elements stored in local array C to a cloud array X starting at index i.
Here's a proposed (incomplete!) implementation of the merge operation that merges cloud arrays X and
Y into cloud array Z. The code uses local arrays A, B, and C, each of size b, to cache X, Y, and Z. For
simplicity, we assume that the input arrays X and Y have sizes a multiple of b, and that reading past the
end of either X or Y returns values ∞o as in the studio. "mod" is the integer modulo operator (% in Java).
MERGE(X, Y, Z)
i +0
j40
k 0
READ(X, 0, A)
READ(Y, 0, B)
while A[i mod b] <∞ or B[j mod b] <∞ do
u ← A[i mod b]
v← Bj mod b]
C[k mod b] = min(u, v)
if u < v
BLOCK1
else
BLOCK2
k++
if k mod b = 0
WRITE(C, Z, k-b)
10. Supply blocks of pseudocode to replace the lines "BLOCK1" and "BLOCK2" to complete the imple-
mentation of the merge operation. Consider which parts of the regular merge algorithm are missing,
and when/how to fetch more data from the cloud.
11. Exactly how many times must the merge function call each of read and write to merge two arrays
of size n/2 into an array of size n, assuming that n/2 is divisible by b? (Hint: Beware off-by-1 errors
when thinking about how many elements must be examined!)
12. What is the total number of cloud read and write operations performed by MergeSort to sort an array
A of size n stored in the cloud? Give an asymptotic answer in terms of n, the number of elements in
A, and b, the chunk size. How does this cost compare to the asymptotic cost of merge-sorting an array
of size n held entirely in your computer's memory?
13. Why might MergeSort be preferable to HeapSort in the cloud sorting model, where the cost is based
on the number of chunk reads and writes? If there is an asymptotic difference between MergeSort and
HeapSort in the cloud, include it in your argument.
Transcribed Image Text:Part II While HeapSort (as we saw in Studio 6) is more amenable than MergeSort to an in-place implementation, MergeSort has its own advantages. For this problem, suppose you want to perform MergeSort on a really huge array A. The array is so big that it doesn't fit in your computer's memory and has to be stored in the cloud. More specifically, assume that our computer has enough memory to hold 36 elements, for some constant b, but A has size n much greater than b. We can call read (X, i, B) to read a chunk of b elements from an array X (in the cloud) starting at index i into a local array B. Similarly, we can call write (C, X, i) to write a chunk of 6 elements stored in local array C to a cloud array X starting at index i. Here's a proposed (incomplete!) implementation of the merge operation that merges cloud arrays X and Y into cloud array Z. The code uses local arrays A, B, and C, each of size b, to cache X, Y, and Z. For simplicity, we assume that the input arrays X and Y have sizes a multiple of b, and that reading past the end of either X or Y returns values ∞o as in the studio. "mod" is the integer modulo operator (% in Java). MERGE(X, Y, Z) i +0 j40 k 0 READ(X, 0, A) READ(Y, 0, B) while A[i mod b] <∞ or B[j mod b] <∞ do u ← A[i mod b] v← Bj mod b] C[k mod b] = min(u, v) if u < v BLOCK1 else BLOCK2 k++ if k mod b = 0 WRITE(C, Z, k-b) 10. Supply blocks of pseudocode to replace the lines "BLOCK1" and "BLOCK2" to complete the imple- mentation of the merge operation. Consider which parts of the regular merge algorithm are missing, and when/how to fetch more data from the cloud. 11. Exactly how many times must the merge function call each of read and write to merge two arrays of size n/2 into an array of size n, assuming that n/2 is divisible by b? (Hint: Beware off-by-1 errors when thinking about how many elements must be examined!) 12. What is the total number of cloud read and write operations performed by MergeSort to sort an array A of size n stored in the cloud? Give an asymptotic answer in terms of n, the number of elements in A, and b, the chunk size. How does this cost compare to the asymptotic cost of merge-sorting an array of size n held entirely in your computer's memory? 13. Why might MergeSort be preferable to HeapSort in the cloud sorting model, where the cost is based on the number of chunk reads and writes? If there is an asymptotic difference between MergeSort and HeapSort in the cloud, include it in your argument.
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 4 steps

Blurred answer
Knowledge Booster
Arrays
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Similar questions
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education