Sciweavers

2609 search results - page 341 / 522
» Optimizing for parallelism and data locality
Sort
View
IPPS
2005
IEEE
15 years 10 months ago
Performance Analysis of MPI Collective Operations
Previous studies of application usage show that the performance of collective communications are critical for high-performance computing and are often overlooked when compared to ...
Jelena Pjesivac-Grbovic, Thara Angskun, George Bos...
ICPPW
2000
IEEE
15 years 9 months ago
Active Streaming in Transport Delay Minimization
In this paper we present a technique for reducing response delay for web systems, which is based on a proactive cache scheme. It combines predictive pre-fetching and streaming to ...
Javed I. Khan
WSC
2008
15 years 7 months ago
A flexible and scalable experimentation layer
Modeling and simulation frameworks for use in different application domains, throughout the complete development process, and in different hardware environments need to be highly ...
Jan Himmelspach, Roland Ewald, Adelinde M. Uhrmach...
ICASSP
2010
IEEE
15 years 5 months ago
Simultaneous search for all modes in multilinear models
Parallel factor (PARAFAC) analysis is an extension of a low rank decomposition to higher way arrays, usually called tensors. Most of existing methods are based on an alternating l...
Petr Tichavský, Zbynek Koldovský
CIKM
2009
Springer
15 years 11 months ago
Packing the most onto your cloud
Parallel dataflow programming frameworks such as Map-Reduce are increasingly being used for large scale data analysis on computing clouds. It is therefore becoming important to a...
Ashraf Aboulnaga, Ziyu Wang, Zi Ye Zhang