Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Distribution sort with randomized cycle
Vitter J., Hutchinson D.  Discrete algorithms (Proceedings of the Twelfth Annual ACM-SIAM Symposium on Discrete Algorithms, Washington, D.C, Jan 7-9, 2001)77-86.2001.Type:Proceedings
Date Reviewed: Jul 25 2006

Efficiently managing parallel resources is the natural and, with the disappearance of Moore’s Law, possibly the only way to increase the computational power of computing systems. Since disk access is slow, this predicament has been understood for quite some time in the storage community, where large clusters of parallel independent disks are routinely used to handle ever-increasing data volumes. Adapting classical in-memory algorithms, such as sorting, to such architectures while avoiding disk access bottlenecks is difficult; this paper introduces a new out-of-core randomized sorting algorithm: randomized cyclic distribution (RCD) sort.

In RCD, the data is partitioned into several buckets, using judiciously chosen pivot values. These buckets are stored into the memory blocks of parallel disks, and the overall process is recursively applied until the dataset is small enough to be handled in memory. The mapping of buckets to blocks lies at the core of the algorithm and its efficiency; here, randomization is used to pick a cycle order of the disk numbers, and blocks are allocated in a round-robin manner accordingly. The paper studies the theoretical probabilistic properties of RCD and its variants, showing its expected input/output optimality, and its practical performance, providing experimental data supporting these claims, particularly when dealing with small numbers of disks. This disk allocation approach is a vast improvement over a straightforward static disk striping solution. This somewhat mathematical paper should be of interest to scientists and engineers interested in optimizing algorithms based on external memory.

Reviewer:  P. Jouvelot Review #: CR133106 (0707-0697)
Bookmark and Share
 
Sorting And Searching (F.2.2 ... )
 
 
Parallelism And Concurrency (F.1.2 ... )
 
 
General (G.2.0 )
 
 
General (D.4.0 )
 
 
Modes Of Computation (F.1.2 )
 
Would you recommend this review?
yes
no
Other reviews under "Sorting And Searching": Date
Binary search trees of almost optimal height
Anderson A., Icking C., Klein R., Ottmann T. (ed) Acta Informatica 28(2): 165-178, 1990. Type: Article
May 1 1992
More nearly optimal algorithms for unbounded searching, part II
Reingold E., Shen X. SIAM Journal on Computing 20(1): 184-208, 1991. Type: Article
Apr 1 1992
A simple linear-time algorithm for in situ merging
Mannila H., Ukkonen E. Information Processing Letters 18(4): 203-208, 1984. Type: Article
Feb 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy