Sciweavers

401 search results - page 67 / 81
» Low Cost Cluster Architectures for Parallel and Distributed ...
Sort
View
186
Voted
ASPLOS
2009
ACM
15 years 10 months ago
QR decomposition on GPUs
QR decomposition is a computationally intensive linear algebra operation that factors a matrix A into the product of a unitary matrix Q and upper triangular matrix R. Adaptive sys...
Andrew Kerr, Dan Campbell, Mark Richards
85
Voted
SPAA
2005
ACM
15 years 3 months ago
Dynamic circular work-stealing deque
The non-blocking work-stealing algorithm of Arora, Blumofe, and Plaxton (henceforth ABP work-stealing) is on its way to becoming the multiprocessor load balancing technology of ch...
David Chase, Yossi Lev
CCGRID
2005
IEEE
15 years 3 months ago
Co-reservation with the concept of virtual resources
We present an architectural framework for specifying and processing co-reservations in Grid environments. Compared to other approaches, our co-reservation framework is more genera...
Thomas Röblitz, Alexander Reinefeld
IPPS
2003
IEEE
15 years 2 months ago
Applying Aspect-Orient Programming Concepts to a Component-Based Programming Model
Abstract— The execution environments For scientific applications have evolved significantly over the years. Vector and parallel architectures have provided significantly faste...
Thomas Eidson, Jack Dongarra, Victor Eijkhout
PVM
2005
Springer
15 years 3 months ago
Implementing Byte-Range Locks Using MPI One-Sided Communication
We present an algorithm for implementing byte-range locks using MPI passive-target one-sided communication. This algorithm is useful in any scenario in which multiple processes of ...
Rajeev Thakur, Robert B. Ross, Robert Latham