Sciweavers

19 search results - page 1 / 4
» Automatic Tuning Matrix Multiplication Performance on Graphi...
Sort
View
EUROPAR
2005
Springer
13 years 10 months ago
Automatic Tuning of PDGEMM Towards Optimal Performance
Sophisticated parallel matrix multiplication algorithms like PDGEMM exhibit a complex structure and can be controlled by a large set of parameters including blocking factors and bl...
Sascha Hunold, Thomas Rauber
SAIG
2000
Springer
13 years 8 months ago
Code Generators for Automatic Tuning of Numerical Kernels: Experiences with FFTW
Achieving peak performance in important numerical kernels such as dense matrix multiply or sparse-matrix vector multiplication usually requires extensive, machine-dependent tuning ...
Rich Vuduc, James Demmel
PARA
2004
Springer
13 years 10 months ago
A Family of High-Performance Matrix Multiplication Algorithms
During the last half-decade, a number of research efforts have centered around developing software for generating automatically tuned matrix multiplication kernels. These include ...
John A. Gunnels, Fred G. Gustavson, Greg Henry, Ro...
ICCS
2001
Springer
13 years 9 months ago
Statistical Models for Automatic Performance Tuning
Achieving peak performance from library subroutines usually requires extensive, machine-dependent tuning by hand. Automatic tuning systems have emerged in response, and they typic...
Rich Vuduc, James Demmel, Jeff Bilmes