Sciweavers

115
Voted
PPOPP
2015
ACM
10 years 2 months ago
GPU-SM: shared memory multi-GPU programming
Discrete GPUs in modern multi-GPU systems can transparently access each other’s memories through the PCIe interconnect. Future systems will improve this capability by including ...
Javier Cabezas, Marc Jordà, Isaac Gelado, N...
133
Voted
PPOPP
2015
ACM
10 years 2 months ago
Barrier elision for production parallel programs
Large scientific code bases are often composed of several layers of runtime libraries, implemented in multiple programming languages. In such situation, programmers often choose ...
Milind Chabbi, Wim Lavrijsen, Wibe de Jong, Koushi...
122
Voted
PPOPP
2015
ACM
10 years 2 months ago
The SprayList: a scalable relaxed priority queue
High-performance concurrent priority queues are essential for applications such as task scheduling and discrete event simulation. Unfortunately, even the best performing implement...
Dan Alistarh, Justin Kopinsky, Jerry Li, Nir Shavi...
124
Voted
PPOPP
2015
ACM
10 years 2 months ago
A performance study of Java garbage collectors on multicore architectures
In the last few years, managed runtime environments such as the Java Virtual Machine (JVM) are increasingly used on large-scale multicore servers. The garbage collector (GC) repre...
Maria Carpen Amarie, Patrick Marlier, Pascal Felbe...