Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Dynamic load balancing for I/O-intensive applications on clusters
Qin X., Jiang H., Manzanares A., Ruan X., Yin S. ACM Transactions on Storage5 (3):1-38,2009.Type:Article
Date Reviewed: Feb 18 2010

The domain of optimizing process execution for high-performance computing (HPC) applications is explored in this paper. These applications execute many smaller processes in parallel on large computing clusters. The challenge in this paper is to schedule the processes on the available hardware in a way that minimizes runtime. Runtime performance depends on the hardware on which the process runs and the location of the input data for that process. The scheduling algorithm’s choices include where to run each process, and whether to move a running process from one computation node to another. This approach considers not only the memory and central processing unit (CPU) requirements of each process, but also its input/output (I/O) footprint. Previous scheduling algorithms were oblivious to this aspect of process execution.

The authors correctly note that modern clusters are not homogeneous. As hardware ages, the operations team performs selective upgrades to machines, networks, and disks. A heterogeneous platform introduces further complications. On the one hand, the system wants to use the highest performance components, while, on the other hand, idle resources should be utilized so as to improve the overall throughput.

The algorithms developed herein are not particularly complicated. They define thresholds for determining placement or migration of a process. The paper continues with simulations of clusters and applications, strongly suggesting that this algorithm is significantly better than others.

This paper is appropriate for those interested in the impact of I/O on large-scale parallel applications. The specific results form a good starting point for new system design and for distributed computational platforms such as Map/Reduce, Hadoop, and Condor.

Reviewer:  Elliot Jaffe Review #: CR137737 (1006-0594)
Bookmark and Share
  Featured Reviewer  
 
Scheduling (D.4.1 ... )
 
 
Simulation (D.4.8 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Scheduling": Date
The gradient model load balancing method
Lin F., Keller R. (ed) IEEE Transactions on Software Engineering 13(1): 32-38, 1987. Type: Article
Sep 1 1987
Preemptive scheduling of a multiprocessor system with memories to minimize maximum lateness
Lai T., Sahni S. SIAM Journal on Computing 13(4): 690-704, 1984. Type: Article
Jul 1 1985
Scheduling independent tasks on uniform processors
Dobson G. SIAM Journal on Computing 13(4): 705-716, 1984. Type: Article
Apr 1 1986
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy