Sciweavers

11 search results - page 3 / 3
» A high-performance, low-power linear algebra core
Sort
View
PPOPP
2010
ACM
14 years 2 months ago
Scaling LAPACK panel operations using parallel cache assignment
In LAPACK many matrix operations are cast as block algorithms which iteratively process a panel using an unblocked algorithm and then update a remainder matrix using the high perf...
Anthony M. Castaldo, R. Clint Whaley