Sciweavers

332 search results - page 6 / 67
» T: integrated building blocks for parallel computing
Sort
View
78
Voted
PPOPP
2010
ACM
15 years 20 days ago
Helper locks for fork-join parallel programming
Helper locks allow programs with large parallel critical sections, called parallel regions, to execute more efficiently by enlisting processors that might otherwise be waiting on ...
Kunal Agrawal, Charles E. Leiserson, Jim Sukha
79
Voted
EUROPAR
2009
Springer
14 years 8 months ago
Parallel Variable-Length Encoding on GPGPUs
Variable-Length Encoding (VLE) is a process of reducing input data size by replacing fixed-length data words with codewords of shorter length. As VLE is one of the main building bl...
Ana Balevic
HPCA
2008
IEEE
15 years 11 months ago
Runahead Threads to improve SMT performance
In this paper, we propose Runahead Threads (RaT) as a valuable solution for both reducing resource contention and exploiting memory-level parallelism in Simultaneous Multithreaded...
Tanausú Ramírez, Alex Pajuelo, Olive...
115
Voted
ISCA
1995
IEEE
93views Hardware» more  ISCA 1995»
15 years 2 months ago
Optimizing Memory System Performance for Communication in Parallel Computers
Communicationin aparallel systemfrequently involvesmoving data from the memory of one node to the memory of another; this is the standard communication model employedin message pa...
Thomas Stricker, Thomas R. Gross
113
Voted
EUROPAR
2008
Springer
15 years 20 days ago
Load-Balancing for a Block-Based Parallel Adaptive 4D Vlasov Solver
This work is devoted to the numerical resolution of the 4D Vlasov equation using an adaptive mesh of phase space. We previously proposed a parallel algorithm designed for distribut...
Olivier Hoenen, Eric Violard