A Parallel Implementation of the 2D Wavelet Transform Using CUDA

15 years 10 months ago

Download ditec.um.es

There is a multicore platform that is currently concentrating an enormous attention due to its tremendous potential in terms of sustained performance: the NVIDIA Tesla boards. These cards intended for general-purpose computing on graphic processing units (GPGPUs) are used as dataparallel computing devices. They are based on the Computed Uniﬁed Device Architecture (CUDA) which is common to the latest NVIDIA GPUs. The bottom line is a multicore platform which provides an enormous potential performance beneﬁt driven by a non-traditional programming model. In this paper we try to provide some insight into the peculiarities of CUDA in order to target scientiﬁc computing by means of a speciﬁc example. In particular, we show that the parallelization of the two-dimensional fast wavelet transform for the NVIDIA Tesla C870 achieves a speedup of 20.8 for an image size of 8192x8192, when compared with the fastest host-only version implementation using OpenMP and including the data transfe...

Joaquín Franco, Gregorio Bernabé, Ju

Real-time Traffic

Multicore Platform | NVIDIA Tesla | NVIDIA Tesla Boards | Parallel Computing | PDP 2009 |

claim paper

» HighSpeed VLSI Implementation of 2D Discrete Wavelet Transform

» A SingleLoop Approach to SIMD Parallelization of 2D Wavelet Lifting

» A LowPower Pipelined Implementation of 2D Discrete Wavelet Transform

» Performance Comparison of SIMD Implementations of the Discrete Wavelet Transform

» A VLSI Architecture for a Fast Computation of the 2D Discrete Wavelet Transform

» SIMD Architectural Enhancements to Improve the Performance of the 2D Discrete Wavelet Tran...

» A new combination of 1D and 2D filter banks for effective multiresolution image representa...

» Image Segmentation Based on 2D Otsu Method with Histogram Analysis

Post Info
More Details (n/a)

Added	19 May 2010
Updated	19 May 2010
Type	Conference
Year	2009
Where	PDP
Authors	Joaquín Franco, Gregorio Bernabé, Juan Fernández, Manuel E. Acacio

Comments (0)

Sciweavers

A Parallel Implementation of the 2D Wavelet Transform Using CUDA

Multicore Platform | NVIDIA Tesla | NVIDIA Tesla Boards | Parallel Computing | PDP 2009 |

Explore & Download

Productivity Tools

Sciweavers