Search Sciweavers | Sciweavers

2609 search results - page 203 / 522

» Optimizing for parallelism and data locality

242

click to vote

PARLE
1994

153views Distributed And Parallel Com...» more PARLE 1994»

Run-Time Optimization of Sparse Matrix-Vector Multiplication on SIMD Machines

15 years 9 months ago

Download cgi2.cs.rpi.edu

Sparse matrix-vector multiplication forms the heart of iterative linear solvers used widely in scientific computations (e.g., finite element methods). In such solvers, the matrix-v...

Louis H. Ziantz, Can C. Özturan, Boleslaw K. ...

claim paper

Read More »

159

click to vote

LCTRTS
2010
Springer

173views System Software» more LCTRTS 2010»

Operation and data mapping for CGRAs with multi-bank memory

16 years 28 days ago

Download www2.ee.kth.se

Coarse Grain Reconﬁgurable Architectures (CGRAs) promise high performance at high power efﬁciency. They fulﬁl this promise by keeping the hardware extremely simple, and movi...

Yongjoo Kim, Jongeun Lee, Aviral Shrivastava, Yunh...

claim paper

Read More »

177

Voted

KES
2005
Springer

139views Information Technology» more KES 2005»

Learning Method for Automatic Acquisition of Translation Knowledge

15 years 11 months ago

Download sig.media.eng.hokudai.ac.jp

This paper presents a new learning method for automatic acquisition of translation knowledge from parallel corpora. We apply this learning method to automatic extraction of bilingu...

Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi

claim paper

Read More »

162

click to vote

EUROPAR
2011
Springer

244views Distributed And Parallel Com...» more EUROPAR 2011»

Model-Driven Tile Size Selection for DOACROSS Loops on GPUs

14 years 5 months ago

Download www.cse.unsw.edu.au

DOALL loops are tiled to exploit DOALL parallelism and data locality on GPUs. In contrast, due to loop-carried dependences, DOACROSS loops must be skewed ﬁrst in order to make ti...

Peng Di, Jingling Xue

claim paper

Read More »

191

click to vote

IPPS
2008
IEEE

174views Distributed And Parallel Com...» more IPPS 2008»

Design and optimization of a distributed, embedded speech recognition system

16 years 15 days ago

Download www.ece.umd.edu

In this paper, we present the design and implementation of a distributed sensor network application for embedded, isolated-word, real-time speech recognition. In our system design...

Chung-Ching Shen, William Plishker, Shuvra S. Bhat...

claim paper

Read More »

« Prev « First page 203 / 522 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers