Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies
Girbal S., Vasilache N., Bastoul C., Cohen A., Parello D., Sigler M., Temam O. International Journal of Parallel Programming34 (3):261-317,2006.Type:Article
Date Reviewed: Feb 6 2007

The research presented in this paper is important and fruitful. However, it is not very accessible due to its length (56 pages) and presentation.

As the increasing complexity of hardware architectures moves them away from the usual programming languages, the importance of the optimizing parts of compilers grows. In order to achieve only a fraction of the peak performance theoretically possible on a recent 64-bit architecture, the compiler must do extremely complicated work. Worse, this work should be tailored to the program to be compiled. This is true for the specific optimizations to be done and for their specific order.

According to the authors, this is because optimizations occur on a syntactic level. They propose a new program representation, called polyhedral, which is claimed to circumvent the limitations of the syntactic representations. They consider the classical loop transformations for automatic parallelization and locality enhancement, and generalize them in their new framework. They show how complex compositions of these optimizations can be necessary in order to achieve the performance possible on the architecture. They also show how these compositions can be easily implemented using their framework.

An application of these ideas on a specific SPEC benchmark leads to very convincing results: compared to the peak performance attainable on the best available compiler, their tool achieves a 32 to 38 percent speedup. Compared to the base SPEC performance numbers, they achieve a 51 to 92 percent speedup for Athlon XP or 64 architectures. However, one should note that these results are obtained using what they call a semi-automatic optimization: in fact, the sequence of transformations is specifically aimed at the chosen benchmark. Thus, although this research already results in distributable GPL software, additional efforts are clearly needed. People interested in this line of work should certainly read this paper.

Reviewer:  O. Lecarme Review #: CR133894
Bookmark and Share
  Featured Reviewer  
 
Compilers (D.3.4 ... )
 
 
Operational Semantics (F.3.2 ... )
 
 
Optimization (D.3.4 ... )
 
 
Program Transformation (I.2.2 ... )
 
 
Automatic Programming (I.2.2 )
 
 
Processors (D.3.4 )
 
  more  
Would you recommend this review?
yes
no
Other reviews under "Compilers": Date
An architecture for combinator graph reduction
Philip John J., Academic Press Prof., Inc., San Diego, CA, 1990. Type: Book (9780124192409)
Feb 1 1992
Crafting a compiler with C
Fischer C., Richard J. J., Benjamin-Cummings Publ. Co., Inc., Redwood City, CA, 1991. Type: Book (9780805321661)
Feb 1 1992
A methodology and notation for compiler front end design
Brown C., Paul W. J. Software--Practice & Experience 14(4): 335-346, 1984. Type: Article
Jun 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy