Sciweavers

1141 search results - page 176 / 229
» Compiler-Directed Performance Model Construction for Paralle...
Sort
View
PODC
2011
ACM
14 years 3 months ago
From bounded to unbounded concurrency objects and back
We consider the power of objects in the unbounded concurrency shared memory model, where there is an infinite set of processes and the number of processes active concurrently may...
Yehuda Afek, Adam Morrison, Guy Wertheim
PPAM
2007
Springer
15 years 6 months ago
Cloth Simulation in the SILC Matrix Computation Framework: A Case Study
This paper presents a case study of numerical simulations in an easy-to-use matrix computation framework named Simple Interface for Library Collections (SILC), which allows users t...
Tamito Kajiyama, Akira Nukada, Reiji Suda, Hidehik...
112
Voted
ISCA
1993
IEEE
153views Hardware» more  ISCA 1993»
15 years 4 months ago
An Adaptive Cache Coherence Protocol Optimized for Migratory Sharing
Parallel programs that use critical sections and are executed on a shared-memory multiprocessor with a writeinvalidate protocol result in invalidation actions that could be elimin...
Per Stenström, Mats Brorsson, Lars Sandberg
CODES
2005
IEEE
15 years 6 months ago
SOMA: a tool for synthesizing and optimizing memory accesses in ASICs
Arbitrary memory dependencies and variable latency memory systems are major obstacles to the synthesis of large-scale ASIC systems in high-level synthesis. This paper presents SOM...
Girish Venkataramani, Tiberiu Chelcea, Seth Copen ...
97
Voted
ASPLOS
2004
ACM
15 years 6 months ago
Low-overhead memory leak detection using adaptive statistical profiling
Sampling has been successfully used to identify performance optimization opportunities. We would like to apply similar techniques to check program correctness. Unfortunately, samp...
Matthias Hauswirth, Trishul M. Chilimbi