Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Enabling efficient multithreaded MPI communication through a library-based implementation of MPI endpoints
Sridharan S., Dinan J., Kalamkar D.  SC 2014 (Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, New Orleans, LA, Nov 16-21, 2014)487-498.2014.Type:Proceedings
Date Reviewed: Feb 19 2015

This paper presents an implementation of the new endpoint proposal in the message passing interface (MPI) standard. The objective of the proposal is to allow threads to efficiently access network resources and communicate. The paper proposes EP-lib, a library-based implementation that can be used by any MPI library to support the endpoint proposal. The main concept of the library uses additional processes (proxies) that intercept endpoint-based communication and efficiently access the network and do the transfer. Using an Ohio State University (OSU) micro-benchmark as well as a lattice quantum chromodynamics (QCD) application kernel and a fast Fourier transform (FFT) communication benchmark, the authors evaluate the impact of the new design on the hybrid MPI+OpenMP programming model’s performance. Further, the evaluation shows that the overhead of EP-lib is a negligible constant value. This is mainly due to an efficient implementation that maps the user space data to the proxy space.

The paper is well written and flows very well. The authors discuss their designs with regard to the different MPI operations and their semantics. The paper’s contributions and comparisons to related work are highlighted.

I have a few concerns with the paper that are not exhaustively discussed:

(1) As jobs are automatically changed to surrogate jobs (with more processes), how are the processes computed?
(2) Can the number of processes change dynamically during execution if the number of OpenMP threads varies between the different sections?
(3) What exactly is the mapping between the number of threads and proxies? If the proxies are multithreaded, how do they efficiently access the network without sharing structures?
Reviewer:  Khaled Hamidouche Review #: CR143197 (1506-0480)
Bookmark and Share
  Reviewer Selected
 
 
Parallel Programming (D.1.3 ... )
 
 
Message Sending (D.4.4 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Parallel Programming": Date
How to write parallel programs: a first course
Carriero N. (ed), Gelernter D. (ed), MIT Press, Cambridge, MA, 1990. Type: Book (9780262031714)
Jul 1 1992
Parallel computer systems
Koskela R., Simmons M., ACM Press, New York, NY, 1990. Type: Book (9780201509373)
May 1 1992
Parallel functional languages and compilers
Szymanski B. (ed), ACM Press, New York, NY, 1991. Type: Book (9780201522433)
Sep 1 1993
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy