Computing Reviews, the leading online review service for computing literature.

Search

A comparison of CPUs, GPUs, FPGAs, and massively parallel processor arrays for random number generation
Thomas D., Howes L., Luk W. FPGA 2009 (Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays, Monterey, California, Feb 22-24, 2009)63-72.2009.Type:Proceedings

Date Reviewed: May 1 2009

As random number generators (RNGs) are widely used in Monte Carlo methods and signal processing, high-performance RNGs have been extensively researched, both as algorithms and hardware architectures. Hardware-wise, the performance and frequency of single central processing units (CPUs) is increasing, but power limitations, the memory wall, and other difficulties exist. At the same time, new architectures, such as field-programmable gate arrays (FPGAs), graphics processing units (GPUs), and multicore processors, are adopted for high-performance applications. This paper identifies “the most appropriate RNG for generating the uniform, Gaussian, and exponential distribution, taking into account the characteristics and architecture of each device.” Because of their embarrassing parallelism, RNG algorithms are ideal candidates for these architectures. The authors show that because of the intrinsic fine-grained parallelism and reconfigurability, FPGA achieves, impressively, 30 times the performance and 175 times the power efficiency of traditional CPUs. However, it should be noted that RNGs are very rarely implemented alone. When embedded in a system, the overall performance is determined by other factors as well, such as performance of other algorithms and communication overheads. The results of this paper provide useful information for considering RNG algorithms on different platforms and choosing suitable platforms for systems that include RNGs.

Reviewer: Junqing Sun	Review #: CR136772 (1007-0691)

Performance Analysis And Design Aids (B.8.2 )

Would you recommend this review?

yes

Other reviews under "Performance Analysis And Design Aids":	Date

Circuit design with VHDL Pedroni V., MIT Press, Cambridge, MA, 2004. Type: Book (9780262162241)	Dec 20 2004

An Algorithm for Optimally Exploiting Spatial and Temporal Locality in Upper Memory Levels Temam O. IEEE Transactions on Computers 48(2): 150-158, 1999. Type: Article	Aug 1 1999

Understanding Why Correlation Profiling Improves the Predictability of Data Cache Misses in Nonnumeric Applications Mowry T., Luk C. IEEE Transactions on Computers 49(4): 369-384, 2000. Type: Article	Jun 1 2001

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy