Computing Reviews, the leading online review service for computing literature.

Search

Vitruvius+: an area-efficient RISC-V decoupled vector coprocessor for high performance computing applications
Minervini F., Palomar O., Unsal O., Reggiani E., Quiroga J., Marimon J., Rojas C., Figueras R. ACM Transactions on Architecture and Code Optimization20 (2):1-25,2023.Type:Article

Date Reviewed: Aug 2 2023

Vector processors had their heyday in the 1980s, before classical supercomputers were mostly replaced by multiprocessors. Today, however, vector processors are experiencing a renaissance: their efficient exploitation of data-level parallelism has the potential for superior energy efficiency, which is the top priority in the pursuit of exascale performance. Toward this goal, this paper introduces the Vitruvius+ engine, a vector coprocessor that “implements the RISC-V vector extension (RVV) 0.7.1 and can be easily connected to a scalar [RISC-V] core using the Open Vector Interface standard.” The core of the paper describes the four notable features of Vitruvius+: The “out-of-order chaining” of memory-to-arithmetic instructions allows for overlapping the arrival of groups of vector elements in the vector register file with their further processing in the pipelined functional units; “Fast moves” replace the execution of vector-vector move operations by renaming, such that multiple logical registers can be associated to the same physical register; “Switched ring reconfiguration” optimizes data shifts between the eight vector processor “lanes” by reverting their ring connections; and The execution of “vector reduction” instructions are enhanced by utilizing the eight lanes for tree-structured parallel reductions. The Vitrivius+ design is experimentally evaluated in great detail by logic gate synthesis, demonstrating a higher peak efficiency than other vector processors. The paper is very well written and systematically structured, which enables the reader to process the material step-by-step, understand the rationale of the design decisions, and get an idea of the potential of the architecture. In its next generation, Vitruvius+ will support the latest version RVV-1.0, for which the main challenges and their solutions are sketched.

Reviewer: Wolfgang Schreiner	Review #: CR147625 (2309-0121)

Efficiency (G.4 ... )

Computing Equipment Management (K.6.2 ... )

Processors (B.4.1 ... )

Vector Display Devices (I.3.1 ... )

Performance (D.4.8 )

Processors (D.3.4 )

Would you recommend this review?

yes

Other reviews under "Efficiency":	Date

A new algorithm for the evaluation of the incomplete gamma function on vector computers Früchtl H., Otto P. ACM Transactions on Mathematical Software 20(4): 436-446, 1994. Type: Article	Nov 1 1995

QR-like algorithms for the nonsymmetric eigenvalue problem Haag J., Watkins D. ACM Transactions on Mathematical Software 19(3): 407-418, 1993. Type: Article	Jun 1 1994

Inter-process communications in MVS/XA and applications for scientific and engineering information processing Marinescu D. Software--Practice & Experience 16(5): 489-501, 1986. Type: Article	Dec 1 1986

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy