SPLG: A Tuned Signal Processing Library for GPU Architectures
Abstract
In order to increase the efficiency of existing software many works are incorporating GPU processing. However, despite the current advances in GPU languages and tools, taking advantage of their parallel architecture is still far more complex than programming standard multi-core CPUs. Performance profiling and analysis of known applications provides a useful insight of the hardware architecture and memory hierarchy. Afterwards, this analysis can be used to identify potential bottlenecks and tune other software so it can make a more efficient usage of the available resources. In this work we implement a small signal processing library which will be used to characterize the performance of most recent NVIDIA GPU architectures. The methodology used in our signal processing library is based on a series of building blocks that enable us to easily design several well-known algorithms with little effort. The library was built paying special attention to flexibility and adaptability. In this work we also show how a generic approach can be used to easily design these GPU algorithms while obtaining competitive performance, which results specially interesting from the productivity standpoint.
Keywords:
Graphics processing units, Registers, Libraries, Transforms, Arrays, Instruction sets, Signal processing, FFT, DCT, Hartley, GPGPU, CUDA, tuned library
Published
2013-10-23
How to Cite
BLANCO, Jacobo Lobeiras; AMOR, Margarita; DOALLO, Ramón.
SPLG: A Tuned Signal Processing Library for GPU Architectures. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 25. , 2013, Porto de Galinhas/PE.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2013
.
p. 184-191.
