Sciweavers

474 search results - page 86 / 95
» Parallel Data Flow Analysis for OpenMP Programs
Sort
View
EUROPAR
2001
Springer
15 years 1 months ago
Performance of High-Accuracy PDE Solvers on a Self-Optimizing NUMA Architecture
High-accuracy PDE solvers use multi-dimensional fast Fourier transforms. The FFTs exhibits a static and structured memory access pattern which results in a large amount of communic...
Sverker Holmgren, Dan Wallin
ESCIENCE
2006
IEEE
15 years 3 months ago
A Middleware Framework for Maximum Likelihood Evaluation over Dynamic Grids
We have designed a maximum likelihood fitter using the actor model to distribute the computation over a heterogeneous network. The prototype implementation uses the SALSA program...
Wei-Jen Wang, Kaoutar El Maghraoui, John Cummings,...
ICPPW
2002
IEEE
15 years 2 months ago
Near-Optimal Loop Tiling by Means of Cache Miss Equations and Genetic Algorithms
The effectiveness of the memory hierarchy is critical for the performance of current processors. The performance of the memory hierarchy can be improved by means of program transf...
Jaume Abella, Antonio González, Josep Llosa...
SOSP
1989
ACM
14 years 10 months ago
Performance of Firefly RPC
In this paper, we report on the performance of the remote procedure call implementation for the Firefly multiprocessor and analyze the implementation to account precisely for all ...
Michael D. Schroeder, Michael Burrows
DSD
2010
IEEE
161views Hardware» more  DSD 2010»
14 years 9 months ago
Design of Trace-Based Split Array Caches for Embedded Applications
—Since many embedded systems execute a predefined set of programs, tuning system components to application programs and data is the approach chosen by many design techniques to o...
Alice M. Tokarnia, Marina Tachibana