Sciweavers

242 search results - page 26 / 49
» Fast computation of general Fourier Transforms on GPUS
Sort
View
IPPS
2000
IEEE
15 years 4 months ago
A Novel Superscalar Architecture for Fast DCT Implementation
This paper presents a new superscalar architecture for fast discrete cosine transform (DCT). Comparing with the general SIMD architecture, it speeds up the DCT computation by a fac...
Zhang Yong, Min Zhang
ICS
2009
Tsinghua U.
15 years 6 months ago
Fast and scalable list ranking on the GPU
General purpose programming on the graphics processing units (GPGPU) has received a lot of attention in the parallel computing community as it promises to offer the highest perfo...
M. Suhail Rehman, Kishore Kothapalli, P. J. Naraya...
ICCV
2003
IEEE
16 years 1 months ago
Improved Fast Gauss Transform and Efficient Kernel Density Estimation
Evaluating sums of multivariate Gaussians is a common computational task in computer vision and pattern recognition, including in the general and powerful kernel density estimatio...
Changjiang Yang, Ramani Duraiswami, Nail A. Gumero...
87
Voted
CGF
2005
136views more  CGF 2005»
14 years 11 months ago
Practical CFD Simulations on Programmable Graphics Hardware using SMAC
The explosive growth in integration technology and the parallel nature of rasterization-based graphics APIs changed the panorama of consumer-level graphics: today, GPUs are cheap,...
Carlos Eduardo Scheidegger, João Luiz Dihl ...
JISE
2006
85views more  JISE 2006»
14 years 11 months ago
A Programming Methodology for Designing Block Recursive Algorithms
In this paper, we use the tensor product notation as the framework of a programming methodology for designing block recursive algorithms. We first express a computational problem ...
Min-Hsuan Fan, Chua-Huang Huang, Yeh-Ching Chung, ...