Sciweavers

224 search results - page 33 / 45
» A Flexible Class of Parallel Matrix Multiplication Algorithm...
Sort
View
MICRO
2010
IEEE
149views Hardware» more  MICRO 2010»
14 years 9 months ago
Improving SIMT Efficiency of Global Rendering Algorithms with Architectural Support for Dynamic Micro-Kernels
Wide Single Instruction, Multiple Thread (SIMT) architectures often require a static allocation of thread groups that are executed in lockstep throughout the entire application ker...
Michael Steffen, Joseph Zambreno
SIAMSC
2010
198views more  SIAMSC 2010»
14 years 9 months ago
Analysis of Block Parareal Preconditioners for Parabolic Optimal Control Problems
In this paper, we describe block matrix algorithms for the iterative solution of large scale linear-quadratic optimal control problems arising from the optimal control of parabolic...
Tarek P. Mathew, Marcus Sarkis, Christian E. Schae...
ICCS
2007
Springer
15 years 5 months ago
A Combined Hardware/Software Optimization Framework for Signal Representation and Recognition
This paper describes a signal recognition system that is jointly optimized from mathematical representation, algorithm design and final implementation. The goal is to exploit sign...
Melina Demertzi, Pedro C. Diniz, Mary W. Hall, Ann...
CVPR
2012
IEEE
13 years 1 months ago
A theory of multi-layer flat refractive geometry
Flat refractive geometry corresponds to a perspective camera looking through single/multiple parallel flat refractive mediums. We show that the underlying geometry of rays corres...
Amit Agrawal, Srikumar Ramalingam, Yuichi Taguchi,...
88
Voted
CCGRID
2001
IEEE
15 years 2 months ago
XtremWeb: A Generic Global Computing System
Global Computing achieves high throughput computing by harvesting a very large number of unused computing resources connected to the Internet. This parallel computing model target...
Gilles Fedak, Cécile Germain, Vincent N&eac...