Sciweavers

931 search results - page 187 / 187
» Compiling for vector-thread architectures
Sort
View
ICFP
2012
ACM
11 years 7 months ago
Nested data-parallelism on the gpu
Graphics processing units (GPUs) provide both memory bandwidth and arithmetic performance far greater than that available on CPUs but, because of their Single-Instruction-Multiple...
Lars Bergstrom, John H. Reppy