Search Sciweavers | Sciweavers

52 search results - page 2 / 11

» Strategies and Implementation for Translating OpenMP Code fo...

click to vote

HPCA
2002
IEEE

158views Distributed And Parallel Com...» more HPCA 2002»

CableS: Thread Control and Memory Management Extensions for Shared Virtual Memory Clusters

14 years 4 months ago

Download www.ics.forth.gr

Clusters of high-end workstations and PCs are currently used in many application domains to perform large-scale computations or as scalable servers for I/O bound tasks. Although c...

Peter Jamieson, Angelos Bilas

claim paper

Read More »

click to vote

SC
2000
ACM

166views Applied Computing» more SC 2000»

Performance of Hybrid Message-Passing and Shared-Memory Parallelism for Discrete Element Modeling

13 years 8 months ago

Download www.ukhec.ac.uk

The current trend in HPC hardware is towards clusters of shared-memory (SMP) compute nodes. For applications developers the major question is how best to program these SMP cluster...

D. S. Henty

claim paper

Read More »

click to vote

ICCS
2005
Springer

128views Applied Computing» more ICCS 2005»

Fast Expression Templates

13 years 10 months ago

Download www10.informatik.uni-erlangen.de

Abstract. Expression templates (ET) can signiﬁcantly reduce the implementation eﬀort of mathematical software. For some compilers, especially for those of supercomputers, it ca...

Jochen Härdtlein, Alexander Linke, Christoph ...

claim paper

Read More »

click to vote

FPL
2009
Springer

172views Hardware» more FPL 2009»

Performance comparison of single-precision SPICE Model-Evaluation on FPGA, GPU, Cell, and multi-core processors

13 years 9 months ago

Download ic.ese.upenn.edu

Automated code generation and performance tuning techniques for concurrent architectures such as GPUs, Cell and FPGAs can provide integer factor speedups over multi-core processor...

Nachiket Kapre, André DeHon

claim paper

Read More »

click to vote

IPPS
2009
IEEE

93views Distributed And Parallel Com...» more IPPS 2009»

Phaser accumulators: A new reduction construct for dynamic parallelism

13 years 11 months ago

Download www.cs.rice.edu

A reduction is a computation in which a common operation, such as a sum, is to be performed across multiple pieces of data, each supplied by a separate task. We introduce phaser a...

Jun Shirako, David M. Peixotto, Vivek Sarkar, Will...

claim paper

Read More »

« Prev « First page 2 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers