Sciweavers

26 search results - page 1 / 6
» Generating SIMD Vectorized Permutations
Sort
View
89
Voted
LCTRTS
2005
Springer
15 years 3 months ago
Generation of permutations for SIMD processors
Short vector (SIMD) instructions are useful in signal processing, multimedia, and scientific applications. They offer higher performance, lower energy consumption, and better res...
Alexei Kudriavtsev, Peter M. Kogge
96
Voted
CC
2008
Springer
15 years 6 days ago
Generating SIMD Vectorized Permutations
Abstract. This paper introduces a method to generate efficient vectorized implementations of small stride permutations using only vector load and vector shuffle instructions. These...
Franz Franchetti, Markus Püschel
96
Voted
PLDI
2006
ACM
15 years 4 months ago
Optimizing data permutations for SIMD devices
The widespread presence of SIMD devices in today’s microprocessors has made compiler techniques for these devices tremendously important. One of the most important and difficul...
Gang Ren, Peng Wu, David A. Padua
79
Voted
ASAP
2003
IEEE
99views Hardware» more  ASAP 2003»
15 years 1 months ago
Using Group Theory to Specify Application Specific Interconnection Networks for SIMD DSPs
We introduce another view of group theory in the field of interconnection networks. With this approach it is possible to specify application specific network topologies for permut...
Thorsten Dräger, Gerhard Fettweis
97
Voted
APPT
2009
Springer
15 years 4 months ago
Performance Improvement of Multimedia Kernels by Alleviating Overhead Instructions on SIMD Devices
SIMD extension is one of the most common and effective technique to exploit data-level parallelism in today’s processor designs. However, the performance of SIMD architectures i...
Asadollah Shahbahrami, Ben H. H. Juurlink