Improving Cache Locality for Ray Casting with CUDA

13 years 7 months ago

Download www-hagi.ist.osaka-u.ac.jp

Abstract: In this paper, we present an acceleration method for texture-based ray casting on the compute uniﬁed device architecture (CUDA) compatible graphics processing unit (GPU). Since ray casting is a memory-intensive application, our method increases the hit rate of the texture cache during rendering. To achieve this, our method dynamically selects the width and height of thread blocks (TBs) such that each warp, which is a series of 32 threads simultaneously processed on the GPU, can achieve high data locality for speciﬁc viewpoints. The objective of this selection is to allow every warp rather than every thread to access data with a small stride, because the GPU executes multiple threads at the same time. In experiments using a GeForce GTX 480 card (i.e., the latest Fermi architecture), we ﬁnd that the speedup of our method ranges

Yuki Sugimoto, Fumihiko Ino, Kenichi Hagihara

Real-time Traffic