Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Vitruvius+: an area-efficient RISC-V decoupled vector coprocessor for high performance computing applications
Minervini F., Palomar O., Unsal O., Reggiani E., Quiroga J., Marimon J., Rojas C., Figueras R. ACM Transactions on Architecture and Code Optimization20 (2):1-25,2023.Type:Article
Date Reviewed: Aug 2 2023

Vector processors had their heyday in the 1980s, before classical supercomputers were mostly replaced by multiprocessors. Today, however, vector processors are experiencing a renaissance: their efficient exploitation of data-level parallelism has the potential for superior energy efficiency, which is the top priority in the pursuit of exascale performance. Toward this goal, this paper introduces the Vitruvius+ engine, a vector coprocessor that “implements the RISC-V vector extension (RVV) 0.7.1 and can be easily connected to a scalar [RISC-V] core using the Open Vector Interface standard.”

The core of the paper describes the four notable features of Vitruvius+:

  • The “out-of-order chaining” of memory-to-arithmetic instructions allows for overlapping the arrival of groups of vector elements in the vector register file with their further processing in the pipelined functional units;
  • “Fast moves” replace the execution of vector-vector move operations by renaming, such that multiple logical registers can be associated to the same physical register;
  • “Switched ring reconfiguration” optimizes data shifts between the eight vector processor “lanes” by reverting their ring connections; and
  • The execution of “vector reduction” instructions are enhanced by utilizing the eight lanes for tree-structured parallel reductions.

The Vitrivius+ design is experimentally evaluated in great detail by logic gate synthesis, demonstrating a higher peak efficiency than other vector processors.

The paper is very well written and systematically structured, which enables the reader to process the material step-by-step, understand the rationale of the design decisions, and get an idea of the potential of the architecture. In its next generation, Vitruvius+ will support the latest version RVV-1.0, for which the main challenges and their solutions are sketched.

Reviewer:  Wolfgang Schreiner Review #: CR147625 (2309-0121)
Bookmark and Share
  Editor Recommended
Featured Reviewer
 
 
Efficiency (G.4 ... )
 
 
Computing Equipment Management (K.6.2 ... )
 
 
Processors (B.4.1 ... )
 
 
Vector Display Devices (I.3.1 ... )
 
 
Performance (D.4.8 )
 
 
Processors (D.3.4 )
 
  more  
Would you recommend this review?
yes
no
Other reviews under "Efficiency": Date
A new algorithm for the evaluation of the incomplete gamma function on vector computers
Früchtl H., Otto P. ACM Transactions on Mathematical Software 20(4): 436-446, 1994. Type: Article
Nov 1 1995
QR-like algorithms for the nonsymmetric eigenvalue problem
Haag J., Watkins D. ACM Transactions on Mathematical Software 19(3): 407-418, 1993. Type: Article
Jun 1 1994
Inter-process communications in MVS/XA and applications for scientific and engineering information processing
Marinescu D. Software--Practice & Experience 16(5): 489-501, 1986. Type: Article
Dec 1 1986
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy