Sciweavers

5588 search results - page 227 / 1118
» Parallel Implementation of Bags
Sort
View
113
Voted
JPDC
2008
135views more  JPDC 2008»
15 years 3 months ago
Parallel block tridiagonalization of real symmetric matrices
Two parallel block tridiagonalization algorithms and implementations for dense real symmetric matrices are presented. Block tridiagonalization is a critical pre-processing step for...
Yihua Bai, Robert C. Ward
133
Voted
IJPP
2007
92views more  IJPP 2007»
15 years 3 months ago
Supporting Nested OpenMP Parallelism in the TAU Performance System
Nested OpenMP parallelism allows an application to spawn teams of nested threads. This hierarchical nature of thread creation and usage poses problems for performance measurement t...
Alan Morris, Allen D. Malony, Sameer Shende
TJS
2002
94views more  TJS 2002»
15 years 3 months ago
A Fully Implicit Parallel Ocean Model Using MUMPS
The formulation, implementation and performance of a new fully implicit parallel model of the ocean circulation is presented. Within this model, steady states can be traced in one ...
Jos de Kloe, Aad J. van der Steen, Hakan Öks&...
IPPS
2010
IEEE
15 years 1 months ago
An auto-tuning framework for parallel multicore stencil computations
Although stencil auto-tuning has shown tremendous potential in effectively utilizing architectural resources, it has hitherto been limited to single kernel instantiations; in addi...
Shoaib Kamil, Cy Chan, Leonid Oliker, John Shalf, ...
136
Voted
JPDC
2010
133views more  JPDC 2010»
14 years 10 months ago
Lock-free parallel dynamic programming
We show a method for parallelizing top down dynamic programs in a straightforward way by a careful choice of a lock-free shared hash table implementation and randomization of the ...
Alex Stivala, Peter J. Stuckey, Maria Garcia de la...