Accelerating Compute-Intensive Applications with GPUs and FPGAs
Top Cited Papers
- 1 June 2008
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Accelerators are special purpose processors designed to speed up compute-intensive sections of applications. Two extreme endpoints in the spectrum of possible accelerators are FPGAs and GPUs, which can often achieve better performance than CPUs on certain workloads. FPGAs are highly customizable, while GPUs provide massive parallel execution resources and high memory bandwidth. Applications typically exhibit vastly different performance characteristics depending on the accelerator. This is an inherent problem attributable to architectural design, middleware support and programming style of the target platform. For the best application-to-accelerator mapping, factors such as programmability, performance, programming cost and sources of overhead in the design flows must be all taken into consideration. In general, FPGAs provide the best expectation of performance, flexibility and low overhead, while GPUs tend to be easier to program and require less hardware resources. We present a performance study of three diverse applications - Gaussian elimination, data encryption standard (DES), and Needleman-Wunsch - on an FPGA, a GPU and a multicore CPU system. We perform a comparative study of application behavior on accelerators considering performance and code complexity. Based on our results, we present an application characteristic to accelerator platform mapping, which can aid developers in selecting an appropriate target architecture for their chosen application.Keywords
This publication has 11 references indexed in Scilit:
- NVIDIA Tesla: A Unified Graphics and Computing ArchitectureIEEE Micro, 2008
- Scalable Parallel Programming with CUDAQueue, 2008
- FPGA accelerator for real-time skin segmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Comparing FPGAs to Graphics Accelerators and the Playstation 2 Using a Unified Source DescriptionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Have GPUs made FPGAs redundant in the field of video processing?Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Efficient Hardware Data Mining with the Apriori Algorithm on FPGAsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Introduction to the Cell multiprocessorIBM Journal of Research and Development, 2005
- Fast computation of database operations using graphics processorsPublished by Association for Computing Machinery (ACM) ,2004
- From VHDL register transfer level to SystemC transaction level modeling: a comparative case studyPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Linear algebra operators for GPU implementation of numerical algorithmsACM Transactions on Graphics, 2003