Search Sciweavers | Sciweavers

39 search results - page 2 / 8

» Optimized Dense Matrix Multiplication on a Many-Core Archite...

click to vote

EUROPAR
2006
Springer

147views Distributed And Parallel Com...» more EUROPAR 2006»

Optimization of Dense Matrix Multiplication on IBM Cyclops-64: Challenges and Experiences

13 years 9 months ago

Download www.capsl.udel.edu

Abstract. This paper presents a study of performance optimization of dense matrix multiplication on IBM Cyclops-64(C64) chip architecture. Although much has been published on how t...

Ziang Hu, Juan del Cuvillo, Weirong Zhu, Guang R. ...

claim paper

Read More »

click to vote

MST
2010

95views more MST 2010»

Optimal Sparse Matrix Dense Vector Multiplication in the I/O-Model

13 years 12 days ago

Download www.win.tue.nl

Michael A. Bender, Gerth Stølting Brodal, R...

claim paper

Read More »

click to vote

EUROPAR
2007
Springer

168views Distributed And Parallel Com...» more EUROPAR 2007»

Toward Scalable Matrix Multiply on Multithreaded Architectures

13 years 11 months ago

Download userweb.cs.utexas.edu

We show empirically that some of the issues that aﬀected the design of linear algebra libraries for distributed memory architectures will also likely aﬀect such libraries for s...

Bryan Marker, Field G. Van Zee, Kazushige Goto, Gr...

claim paper

Read More »

click to vote

ICCS
2001
Springer

111views Applied Computing» more ICCS 2001»

Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY

13 years 10 months ago

Download bebop.cs.berkeley.edu

Abstract. Sparse matrix-vector multiplication is an important computational kernel that tends to perform poorly on modern processors, largely because of its high ratio of memory op...

Eun-Jin Im, Katherine A. Yelick

claim paper

Read More »

click to vote

PCI
2005
Springer

101views Information Technology» more PCI 2005»

Tuning Blocked Array Layouts to Exploit Memory Hierarchy in SMT Architectures

13 years 11 months ago

Download www.cslab.ece.ntua.gr

Cache misses form a major bottleneck for memory-intensive applications, due to the signiﬁcant latency of main memory accesses. Loop tiling, in conjunction with other program tran...

Evangelia Athanasaki, Kornilios Kourtis, Nikos Ana...

claim paper

Read More »

« Prev « First page 2 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers