Sciweavers

1141 search results - page 12 / 229
» Compiler-Directed Performance Model Construction for Paralle...
Sort
View
PPOPP
2009
ACM
15 years 7 months ago
NePalTM: design and implementation of nested parallelism for transactional memory systems
Abstract. Transactional memory (TM) promises to simplify construction of parallel applications by allowing programmers to reason about interactions between concurrently executing c...
Haris Volos, Adam Welc, Ali-Reza Adl-Tabatabai, Ta...
ICCS
2005
Springer
15 years 6 months ago
Bulk Synchronous Parallel ML: Modular Implementation and Performance Prediction
Abstract. BSML is a library for parallel programming with the functional language Objective Caml. It is based on an extension of the λcalculus by parallel operations on a parallel...
Frédéric Loulergue, Fréd&eacu...
97
Voted
EUROPAR
2006
Springer
15 years 4 months ago
Model-Based Performance Diagnosis of Master-Worker Parallel Computations
Parallel performance tuning naturally involves a diagnosis process to locate and explain sources of program inefficiency. Proposed is an approach that exploits parallel computation...
Li Li, Allen D. Malony
88
Voted
IPPS
2007
IEEE
15 years 6 months ago
Automatic Performance Diagnosis of Parallel Computations with Compositional Models
Performance tuning involves a diagnostic process to locate and explain sources of program inefficiency. A performance diagnosis system can leverage knowledge of performance cause...
Li Li, Allen D. Malony
97
Voted
IPPS
2009
IEEE
15 years 7 months ago
Phaser accumulators: A new reduction construct for dynamic parallelism
A reduction is a computation in which a common operation, such as a sum, is to be performed across multiple pieces of data, each supplied by a separate task. We introduce phaser a...
Jun Shirako, David M. Peixotto, Vivek Sarkar, Will...