Sciweavers

201 search results - page 11 / 41
» Estimating the Parallel Start-Up Overhead for Parallelizing ...
Sort
View
83
Voted
SC
1991
ACM
15 years 3 months ago
Interprocedural transformations for parallel code generation
We present a new approach that enables compiler optimization of procedure calls and loop nests containing procedure calls. We introduce two interprocedural transformationsthat mov...
Mary W. Hall, Ken Kennedy, Kathryn S. McKinley
DATE
2005
IEEE
125views Hardware» more  DATE 2005»
15 years 5 months ago
Lightweight Multitasking Support for Embedded Systems using the Phantom Serializing Compiler
Embedded software continues to play an ever increasing role in the design of complex embedded applications. In part, the elevel of abstraction provided by a high-level programming...
André C. Nácul, Tony Givargis
EUROPAR
2004
Springer
15 years 5 months ago
Exploiting Spatial Store Locality Through Permission Caching in Software DSMs
Abstract. Fine-grained software-based distributed shared memory (SWDSM) systems typically maintain coherence with in-line checking code at load and store operations to shared memor...
Håkan Zeffer, Zoran Radovic, Oskar Grenholm,...
110
Voted
IWMM
2010
Springer
118views Hardware» more  IWMM 2010»
15 years 4 months ago
Speculative parallelization using state separation and multiple value prediction
With the availability of chip multiprocessor (CMP) and simultaneous multithreading (SMT) machines, extracting thread level parallelism from a sequential program has become crucial...
Chen Tian, Min Feng, Rajiv Gupta
94
Voted
ISLPED
2006
ACM
105views Hardware» more  ISLPED 2006»
15 years 5 months ago
Reducing power through compiler-directed barrier synchronization elimination
Interprocessor synchronization, while extremely important for ensuring execution correctness, can be very costly in terms of both power and performance overheads. Unfortunately, m...
Mahmut T. Kandemir, Seung Woo Son