Sciweavers

5553 search results - page 338 / 1111
» Parallel Implementation of Sch
Sort
View
ISPDC
2008
IEEE
15 years 11 months ago
Scalable Dense Factorizations for Heterogeneous Computational Clusters
This paper discusses the design and the implementation of the LU factorization routines included in the Heterogeneous ScaLAPACK library, which is built on top of ScaLAPACK. These ...
Ravi Reddy Manumachu, Alexey L. Lastovetsky, Pedro...
DSD
2007
IEEE
133views Hardware» more  DSD 2007»
15 years 11 months ago
A Serial Logarithmic Number System ALU
Serial arithmetic uses less hardware than parallel arithmetic. Serial floating point (FP) is slower than parallel FP. The Logarithmic Number System (LNS) simplifies operations, ...
Mark G. Arnold, Panagiotis D. Vouzis
IPPS
2005
IEEE
15 years 10 months ago
Experiences with Soft-Core Processor Design
Soft-core processors exploit the flexibility of Field Programmable Gate Arrays (FPGAs) to allow a system designer to customize the processor to the needs of a target application....
Franjo Plavec, Blair Fort, Zvonko G. Vranesic, Ste...
EUROPAR
2004
Springer
15 years 10 months ago
Understanding the Behavior and Performance of Non-blocking Communications in MPI
The behavior and performance of MPI non-blocking message passing operations are sensitive to implementation specifics as they are heavily dependant on available system level buff...
Taher Saif, Manish Parashar
ML
2007
ACM
134views Machine Learning» more  ML 2007»
15 years 4 months ago
Status report: the manticore project
The Manticore project is an effort to design and implement a new functional language for parallel programming. Unlike many earlier parallel languages, Manticore is a heterogeneous...
Matthew Fluet, Nic Ford, Mike Rainey, John H. Repp...