Sciweavers

2609 search results - page 14 / 522
» Optimizing for parallelism and data locality
Sort
View
EUROPAR
2007
Springer
15 years 11 months ago
Locality Optimized Shared-Memory Implementations of Iterated Runge-Kutta Methods
Iterated Runge-Kutta (IRK) methods are a class of explicit solution methods for initial value problems of ordinary differential equations (ODEs) which possess a considerable poten...
Matthias Korch, Thomas Rauber
VLDB
1989
ACM
155views Database» more  VLDB 1989»
15 years 9 months ago
Parallel Processing of Recursive Queries in Distributed Architectures
This paper presents a parallel algorithm for recursive query processing and shows how it can be efficiently implemented in a local computer network. The algorithm relies on an int...
Guy Hulin
LCR
2000
Springer
121views System Software» more  LCR 2000»
15 years 9 months ago
Optimizing Mutual Exclusion Synchronization in Explicitly Parallel Programs
Abstract. We present two new compiler optimizations for explicitly parallel programs based on the CSSAME form: Lock-Independent Code Motion (LICM) and Mutex Body Localization (MBL)...
Diego Novillo, Ronald C. Unrau, Jonathan Schaeffer
ICS
2003
Tsinghua U.
15 years 10 months ago
Estimating cache misses and locality using stack distances
Cache behavior modeling is an important part of modern optimizing compilers. In this paper we present a method to estimate the number of cache misses, at compile time, using a mac...
Calin Cascaval, David A. Padua
MICRO
2003
IEEE
121views Hardware» more  MICRO 2003»
15 years 10 months ago
Exploiting Value Locality in Physical Register Files
The physical register file is an important component of a dynamically-scheduled processor. Increasing the amount of parallelism places increasing demands on the physical register...
Saisanthosh Balakrishnan, Gurindar S. Sohi