Sciweavers

102 search results - page 21 / 21
» Debugging Dynamic Distributed Programs Using Global Predicat...
Sort
View
IEEEPACT
2007
IEEE
14 years 21 days ago
Performance Portable Optimizations for Loops Containing Communication Operations
Effective use of communication networks is critical to the performance and scalability of parallel applications. Partitioned Global Address Space languages like UPC bring the pro...
Costin Iancu, Wei Chen, Katherine A. Yelick
IPPS
2010
IEEE
13 years 4 months ago
Inter-block GPU communication via fast barrier synchronization
The graphics processing unit (GPU) has evolved from a fixedfunction processor with programmable stages to a programmable processor with many fixed-function components that deliver...
Shucai Xiao, Wu-chun Feng