Sciweavers

458 search results - page 17 / 92
» Potential Performance Improvement of Collective Operations i...
Sort
View
90
Voted
CLUSTER
2007
IEEE
15 years 4 months ago
Efficient asynchronous memory copy operations on multi-core systems and I/OAT
Bulk memory copies incur large overheads such as CPU stalling (i.e., no overlap of computation with memory copy operation), small register-size data movement, cache pollution, etc...
Karthikeyan Vaidyanathan, Lei Chai, Wei Huang, Dha...
114
Voted
PPOPP
2003
ACM
15 years 5 months ago
Improving server software support for simultaneous multithreaded processors
Simultaneous multithreading (SMT) represents a fundamental shift in processor capability. SMT's ability to execute multiple threads simultaneously within a single CPU offers ...
Luke McDowell, Susan J. Eggers, Steven D. Gribble
101
Voted
CLUSTER
2009
IEEE
15 years 7 months ago
Combining I/O operations for multiple array variables in parallel netCDF
—Parallel netCDF (PnetCDF) is a popular library used in many scientific applications to store scientific datasets. It provides high-performance parallel I/O while maintaining ...
Kui Gao, Wei-keng Liao, Alok N. Choudhary, Robert ...
95
Voted
EPIA
2007
Springer
15 years 6 months ago
Improving Evolutionary Algorithms with Scouting
The goal of an Evolutionary Algorithm(EA) is to find the optimal solution to a given problem by evolving a set of initial potential solutions. When the problem is multi-modal, an ...
Konstantinos Bousmalis, Gillian M. Hayes, Jeffrey ...
94
Voted
ICPP
2003
IEEE
15 years 5 months ago
High Performance and Reliable NIC-Based Multicast over Myrinet/GM-2
Multicast is an important collective operation for parallel programs. Some Network Interface Cards (NICs), such as Myrinet, have programmable processors that can be programmed to ...
Weikuan Yu, Darius Buntinas, Dhabaleswar K. Panda