Computing Reviews, the leading online review service for computing literature.

Search

Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies
Girbal S., Vasilache N., Bastoul C., Cohen A., Parello D., Sigler M., Temam O. International Journal of Parallel Programming34 (3):261-317,2006.Type:Article

Date Reviewed: Feb 6 2007

The research presented in this paper is important and fruitful. However, it is not very accessible due to its length (56 pages) and presentation. As the increasing complexity of hardware architectures moves them away from the usual programming languages, the importance of the optimizing parts of compilers grows. In order to achieve only a fraction of the peak performance theoretically possible on a recent 64-bit architecture, the compiler must do extremely complicated work. Worse, this work should be tailored to the program to be compiled. This is true for the specific optimizations to be done and for their specific order. According to the authors, this is because optimizations occur on a syntactic level. They propose a new program representation, called polyhedral, which is claimed to circumvent the limitations of the syntactic representations. They consider the classical loop transformations for automatic parallelization and locality enhancement, and generalize them in their new framework. They show how complex compositions of these optimizations can be necessary in order to achieve the performance possible on the architecture. They also show how these compositions can be easily implemented using their framework. An application of these ideas on a specific SPEC benchmark leads to very convincing results: compared to the peak performance attainable on the best available compiler, their tool achieves a 32 to 38 percent speedup. Compared to the base SPEC performance numbers, they achieve a 51 to 92 percent speedup for Athlon XP or 64 architectures. However, one should note that these results are obtained using what they call a semi-automatic optimization: in fact, the sequence of transformations is specifically aimed at the chosen benchmark. Thus, although this research already results in distributable GPL software, additional efforts are clearly needed. People interested in this line of work should certainly read this paper.

Reviewer: O. Lecarme	Review #: CR133894

Compilers (D.3.4 ... )

Operational Semantics (F.3.2 ... )

Optimization (D.3.4 ... )

Program Transformation (I.2.2 ... )

Automatic Programming (I.2.2 )

Processors (D.3.4 )

Would you recommend this review?

yes

Other reviews under "Compilers":	Date

An architecture for combinator graph reduction Philip John J., Academic Press Prof., Inc., San Diego, CA, 1990. Type: Book (9780124192409)	Feb 1 1992

Crafting a compiler with C Fischer C., Richard J. J., Benjamin-Cummings Publ. Co., Inc., Redwood City, CA, 1991. Type: Book (9780805321661)	Feb 1 1992

A methodology and notation for compiler front end design Brown C., Paul W. J. Software--Practice & Experience 14(4): 335-346, 1984. Type: Article	Jun 1 1985

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy