Sciweavers

83
Voted
PPOPP
2015
ACM
9 years 11 months ago
GPU-SM: shared memory multi-GPU programming
Discrete GPUs in modern multi-GPU systems can transparently access each other’s memories through the PCIe interconnect. Future systems will improve this capability by including ...
Javier Cabezas, Marc Jordà, Isaac Gelado, N...
PPOPP
2015
ACM
9 years 11 months ago
Barrier elision for production parallel programs
Large scientific code bases are often composed of several layers of runtime libraries, implemented in multiple programming languages. In such situation, programmers often choose ...
Milind Chabbi, Wim Lavrijsen, Wibe de Jong, Koushi...
89
Voted
PPOPP
2015
ACM
9 years 11 months ago
The SprayList: a scalable relaxed priority queue
High-performance concurrent priority queues are essential for applications such as task scheduling and discrete event simulation. Unfortunately, even the best performing implement...
Dan Alistarh, Justin Kopinsky, Jerry Li, Nir Shavi...
92
Voted
PPOPP
2015
ACM
9 years 11 months ago
A performance study of Java garbage collectors on multicore architectures
In the last few years, managed runtime environments such as the Java Virtual Machine (JVM) are increasingly used on large-scale multicore servers. The garbage collector (GC) repre...
Maria Carpen Amarie, Patrick Marlier, Pascal Felbe...