Sciweavers

224 search results - page 28 / 45
» A Flexible Class of Parallel Matrix Multiplication Algorithm...
Sort
View
JPDC
2006
120views more  JPDC 2006»
14 years 9 months ago
HeteroMPI: Towards a message-passing library for heterogeneous networks of computers
The paper presents Heterogeneous MPI (HeteroMPI), an extension of MPI for programming high-performance computations on heterogeneous networks of computers. It allows the applicati...
Alexey L. Lastovetsky, Ravi Reddy
ARC
2012
Springer
317views Hardware» more  ARC 2012»
13 years 5 months ago
A High Throughput FPGA-Based Implementation of the Lanczos Method for the Symmetric Extremal Eigenvalue Problem
Iterative numerical algorithms with high memory bandwidth requirements but medium-size data sets (matrix size ∼ a few 100s) are highly appropriate for FPGA acceleration. This pap...
Abid Rafique, Nachiket Kapre, George A. Constantin...
ICPADS
2006
IEEE
15 years 3 months ago
Scalable Hybrid Designs for Linear Algebra on Reconfigurable Computing Systems
—Recently, high-end reconfigurable computing systems that employ Field-Programmable Gate Arrays (FPGAs) as hardware accelerators for general-purpose processors have been built. T...
Ling Zhuo, Viktor K. Prasanna
SASP
2009
IEEE
291views Hardware» more  SASP 2009»
15 years 4 months ago
A parameterisable and scalable Smith-Waterman algorithm implementation on CUDA-compatible GPUs
—This paper describes a multi-threaded parallel design and implementation of the Smith-Waterman (SM) algorithm on compute unified device architecture (CUDA)-compatible graphic pr...
Cheng Ling, Khaled Benkrid, Tsuyoshi Hamada
SBACPAD
2007
IEEE
157views Hardware» more  SBACPAD 2007»
15 years 3 months ago
Exploring Novel Parallelization Technologies for 3-D Imaging Applications
Multi-dimensional imaging techniques involve the processing of high resolution images commonly used in medical, civil and remote-sensing applications. A barrier commonly encounter...
Diego Rivera, Dana Schaa, Micha Moffie, David R. K...