Sciweavers

45 search results - page 8 / 9
» Cache Performance Optimizations for Parallel Lattice Boltzma...
Sort
View
IEEEPACT
2000
IEEE
13 years 10 months ago
The Effect of Code Reordering on Branch Prediction
Branch prediction accuracy is a very important factor for superscalarprocessor performance. The ability topredict the outcome of a branch allows the processor to effectively use a...
Alex Ramírez, Josep-Lluis Larriba-Pey, Mate...
IPPS
2005
IEEE
13 years 11 months ago
Automated Analysis of Memory Access Behavior
Abstract— We developed an automated environment to measure the memory access behavior of applications on high performance clusters. Code optimization for processor caches is cruc...
Michael Gerndt, Tianchao Li
IEEEPACT
2009
IEEE
14 years 13 days ago
Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling
—Analytical models have been used to estimate optimal values for parameters such as tile sizes in the context of loop nests. However, important algorithms such as fast Fourier tr...
Basilio B. Fraguela, Yevgen Voronenko, Markus P&uu...
MICRO
1997
IEEE
116views Hardware» more  MICRO 1997»
13 years 10 months ago
Tuning Compiler Optimizations for Simultaneous Multithreading
Compiler optimizations are often driven by specific assumptions about the underlying architecture and implementation of the target machine. For example, when targeting shared-mem...
Jack L. Lo, Susan J. Eggers, Henry M. Levy, Sujay ...
ICDE
2004
IEEE
85views Database» more  ICDE 2004»
14 years 7 months ago
GODIVA: Lightweight Data Management for Scientific Visualization Applications
Scientific visualization applications are very dataintensive, with high demands for I/O and data management. Developers of many visualization tools hesitate to use traditional DBM...
Xiaosong Ma, Marianne Winslett, John Norris, Xiang...