Search Sciweavers | Sciweavers

5171 search results - page 544 / 1035

» Deterministic Parallel Processing

178

click to vote

PVM
2010
Springer

176views Distributed And Parallel Com...» more PVM 2010»

Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues

15 years 3 months ago

Download www.unixer.de

Abstract. Designing and tuning parallel applications with MPI, particularly at large scale, requires understanding the performance implications of diﬀerent choices of algorithms ...

Torsten Hoefler, William Gropp, Rajeev Thakur, Jes...

claim paper

Read More »

125

click to vote

IPPS
2010
IEEE

134views Distributed And Parallel Com...» more IPPS 2010»

Optimization of linked list prefix computations on multithreaded GPUs using CUDA

15 years 2 months ago

Download www.umiacs.umd.edu

We present a number of optimization techniques to compute prefix sums on linked lists and implement them on multithreaded GPUs using CUDA. Prefix computations on linked structures ...

Zheng Wei, Joseph JáJá

claim paper

Read More »

175

click to vote

SASP
2009
IEEE

291views Hardware» more SASP 2009»

FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs

15 years 11 months ago

Download www.icims.csl.uiuc.edu

— As growing power dissipation and thermal effects disrupted the rising clock frequency trend and threatened to annul Moore’s law, the computing industry has switched its route...

Alexandros Papakonstantinou, Karthik Gururaj, John...

claim paper

Read More »

172

click to vote

ICPP
2009
IEEE

185views Distributed And Parallel Com...» more ICPP 2009»

Accelerating Checkpoint Operation by Node-Level Write Aggregation on Multicore Systems

15 years 11 months ago

Download nowlab.cse.ohio-state.edu

—Clusters and applications continue to grow in size while their mean time between failure (MTBF) is getting smaller. Checkpoint/Restart is becoming increasingly important for lar...

Xiangyong Ouyang, Karthik Gopalakrishnan, Dhabales...

claim paper

Read More »

157

click to vote

IPPS
2009
IEEE

163views Distributed And Parallel Com...» more IPPS 2009»

Accelerating HMMer on FPGAs using systolic array based architecture

15 years 11 months ago

Download www.hicomb.org

HMMer is a widely-used bioinformatics software package that uses profile HMMs (Hidden Markov Models) to model the primary structure consensus of a family of protein or nucleic aci...

Yanteng Sun, Peng Li, Guochang Gu, Yuan Wen, Yuan ...

claim paper

Read More »

« Prev « First page 544 / 1035 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers