Sciweavers

115 search results - page 17 / 23
» Fusion of Loops for Parallelism and Locality
Sort
View
SAC
2010
ACM
15 years 5 months ago
Haptic manipulation of rational parametric planar cubics using shape constraints
In this paper, we show how to deform a planar rational cubic based on a local interpolation constraint while retaining the qualitative shape of the curve. An impedance-type, paral...
Christoph Fünfzig, Philippe Thomin, Gudrun Al...
LCTRTS
2005
Springer
15 years 5 months ago
Cache aware optimization of stream programs
Effective use of the memory hierarchy is critical for achieving high performance on embedded systems. We focus on the class of streaming applications, which is increasingly preval...
Janis Sermulins, William Thies, Rodric M. Rabbah, ...
IPPS
1997
IEEE
15 years 3 months ago
The Sparse Cyclic Distribution against its Dense Counterparts
Several methods have been proposed in the literature for the distribution of data on distributed memory machines, either oriented to dense or sparse structures. Many of the real a...
Gerardo Bandera, Manuel Ujaldon, María A. T...
IPPS
2009
IEEE
15 years 6 months ago
High-order stencil computations on multicore clusters
Stencil computation (SC) is of critical importance for broad scientific and engineering applications. However, it is a challenge to optimize complex, highorder SC on emerging clus...
Liu Peng, Richard Seymour, Ken-ichi Nomura, Rajiv ...
ICS
2003
Tsinghua U.
15 years 4 months ago
Estimating cache misses and locality using stack distances
Cache behavior modeling is an important part of modern optimizing compilers. In this paper we present a method to estimate the number of cache misses, at compile time, using a mac...
Calin Cascaval, David A. Padua