Free Online Productivity Tools
i2Speak
i2Symbol
i2OCR
iTex2Img
iWeb2Print
iWeb2Shot
i2Type
iPdf2Split
iPdf2Merge
i2Bopomofo
i2Arabic
i2Style
i2Image
i2PDF
iLatex2Rtf
Sci2ools

IPPS

2008

IEEE

2008

IEEE

—The performance bottleneck for many scientiﬁc applications is the cost of memory access inside linear algebra kernels. Tuning such kernels for memory efﬁciency is a complex task that degrades the productivity of computational scientists. Software libraries such as the Basic Linear Algebra Subprograms (BLAS) ameliorate this problem by providing a standard interface for which computer scientists and hardware vendors have created highly-tuned implementations. Scientiﬁc applications often require a sequence of BLAS operations, which presents further opportunities for memory optimization. However, because BLAS are tuned in isolation they do not take advantage of these opportunities. This phenomenon motivated the recent addition of several routines to the BLAS that each perform a sequence operations. Unfortunately, the exact sequence of operations needed in a given situation is highly application dependent, so many more such routines are needed. In this paper we present preliminary ...

Related Content

Added |
31 May 2010 |

Updated |
31 May 2010 |

Type |
Conference |

Year |
2008 |

Where |
IPPS |

Authors |
Jeremy G. Siek, Ian Karlin, Elizabeth R. Jessup |

Comments (0)