Sciweavers

135
Voted
PPOPP
2015
ACM
10 years 1 months ago
Distributed memory code generation for mixed Irregular/Regular computations
Many applications feature a mix of irregular and regular computational structures. For example, codes using adaptive mesh refinement (AMR) typically use a collection of regular b...
Mahesh Ravishankar, Roshan Dathathri, Venmugil Ela...
132
Voted
PPOPP
2015
ACM
10 years 1 months ago
Optimization of asynchronous graph processing on GPU with hybrid coloring model
Modern GPUs have been widely used to accelerate the graph processing for complicated computational problems regarding graph theory. Many parallel graph algorithms adopt the asynch...
Xuanhua Shi, Junling Liang, Sheng Di, Bingsheng He...
126
Voted
PPOPP
2015
ACM
10 years 1 months ago
Low-overhead software transactional memory with progress guarantees and strong semantics
Software transactional memory offers an appealing alternative to locks by improving programmability, reliability, and scalability. However, existing STMs are impractical because t...
Minjia Zhang, Jipeng Huang, Man Cao, Michael D. Bo...
126
Voted
PPOPP
2015
ACM
10 years 1 months ago
Barrier elision for production parallel programs
Large scientific code bases are often composed of several layers of runtime libraries, implemented in multiple programming languages. In such situation, programmers often choose ...
Milind Chabbi, Wim Lavrijsen, Wibe de Jong, Koushi...
118
Voted
PPOPP
2015
ACM
10 years 1 months ago
Optimization for performance and energy for batched matrix computations on GPUs
As modern hardware keeps evolving, an increasingly effective approach to develop energy efficient and high-performance solvers is to design them to work on many small size indepe...
Azzam Haidar, Tingxing Dong, Piotr Luszczek, Stani...
Distributed And Parallel Computing
Top of PageReset Settings