Sciweavers

420 search results - page 51 / 84
» Scalable Parallel Programming with CUDA
Sort
View
CORR
2011
Springer
166views Education» more  CORR 2011»
14 years 8 months ago
An Overview of Portable Distributed Techniques
In this paper, we reviewed of several portable parallel programming paradigms for use in a distributed programming environment. The Techniques reviewed here are portable. These ar...
Sanjay Bansal, Nirved Pandey
IPPS
1996
IEEE
15 years 8 months ago
An Element-Based Concurrent Partitioner for Unstructured Finite Element Meshes
A concurrent partitioner for partitioning unstructured finite element meshes on distributed memory architectures is developed. The partitioner uses an element-based partitioning st...
Hong Q. Ding, Robert D. Ferraro
PPL
2008
106views more  PPL 2008»
15 years 4 months ago
On Implementing the Farm Skeleton
Algorithmic skeletons intend to simplify parallel programming by providing a higher abstraction compared to the usual message passing. Task and data parallel skeletons can be dist...
Michael Poldner, Herbert Kuchen
IEEEPACT
2005
IEEE
15 years 10 months ago
Communication Optimizations for Fine-Grained UPC Applications
Global address space languages like UPC exhibit high performance and portability on a broad class of shared and distributed memory parallel architectures. The most scalable applic...
Wei-Yu Chen, Costin Iancu, Katherine A. Yelick
PLDI
2012
ACM
13 years 7 months ago
Speculative separation for privatization and reductions
Automatic parallelization is a promising strategy to improve application performance in the multicore era. However, common programming practices such as the reuse of data structur...
Nick P. Johnson, Hanjun Kim, Prakash Prabhu, Ayal ...