Sciweavers

3656 search results - page 687 / 732
» Parallel programming in Split-C
Sort
View
193
Voted
TOG
2012
230views Communications» more  TOG 2012»
13 years 5 months ago
Decoupling algorithms from schedules for easy optimization of image processing pipelines
Using existing programming tools, writing high-performance image processing code requires sacrificing readability, portability, and modularity. We argue that this is a consequenc...
Jonathan Ragan-Kelley, Andrew Adams, Sylvain Paris...
121
Voted
ASPLOS
1998
ACM
15 years 6 months ago
Accelerating Multi-Media Processing by Implementing Memoing in Multiplication and Division Units
This paper proposes a technique that enables performing multi-cycle (multiplication, division, square-root ...) computations in a single cycle. The technique is based on the notio...
Daniel Citron, Dror G. Feitelson, Larry Rudolph
PLDI
1996
ACM
15 years 6 months ago
A Reduced Multipipeline Machine Description that Preserves Scheduling Constraints
High performance compilers increasingly rely on accurate modeling of the machine resources to efficiently exploit the instruction level parallelism of an application. In this pape...
Alexandre E. Eichenberger, Edward S. Davidson
127
Voted
ASPLOS
1992
ACM
15 years 6 months ago
Access Normalization: Loop Restructuring for NUMA Compilers
: In scalable parallel machines, processors can make local memory accesses much faster than they can make remote memory accesses. In addition, when a number of remote accesses must...
Wei Li, Keshav Pingali
145
Voted
CAV
2010
Springer
239views Hardware» more  CAV 2010»
15 years 6 months ago
Model Checking of Linearizability of Concurrent List Implementations
Abstract. Concurrent data structures with fine-grained synchronization are notoriously difficult to implement correctly. The difficulty of reasoning about these implementations do...
Pavol Cerný, Arjun Radhakrishna, Damien Zuf...