Sciweavers

458 search results - page 1 / 92
» Performance study of mapping irregular computations on GPUs
Sort
View
IPPS
2010
IEEE
13 years 2 months ago
Optimization of linked list prefix computations on multithreaded GPUs using CUDA
We present a number of optimization techniques to compute prefix sums on linked lists and implement them on multithreaded GPUs using CUDA. Prefix computations on linked structures ...
Zheng Wei, Joseph JáJá
SASP
2008
IEEE
162views Hardware» more  SASP 2008»
13 years 11 months ago
Accelerating Compute-Intensive Applications with GPUs and FPGAs
—Accelerators are special purpose processors designed to speed up compute-intensive sections of applications. Two extreme endpoints in the spectrum of possible accelerators are F...
Shuai Che, Jie Li, Jeremy W. Sheaffer, Kevin Skadr...
EGH
2009
Springer
13 years 2 months ago
Understanding the efficiency of ray traversal on GPUs
We discuss the mapping of elementary ray tracing operations-acceleration structure traversal and primitive intersection--onto wide SIMD/SIMT machines. Our focus is on NVIDIA GPUs,...
Timo Aila, Samuli Laine
HIPC
2005
Springer
13 years 10 months ago
Performance Study of LU Decomposition on the Programmable GPU
With the increasing programmability of GPUs (graphics processing units), these units are emerging as an attractive computing platform not only for traditional graphics computation ...
Fumihiko Ino, Manabu Matsui, Keigo Goda, Kenichi H...