Sciweavers

5523 search results - page 194 / 1105
» Improving application performance with hardware data structu...
Sort
View
119
Voted
ISCA
1997
IEEE
113views Hardware» more  ISCA 1997»
15 years 8 months ago
Effects of Communication Latency, Overhead, and Bandwidth in a Cluster Architecture
This work provides a systematic study of the impact of communication performance on parallelapplications in a high performance network of workstations. We develop an experimental ...
Richard P. Martin, Amin Vahdat, David E. Culler, T...
128
Voted
ISCA
2008
IEEE
148views Hardware» more  ISCA 2008»
15 years 10 months ago
Atomic Vector Operations on Chip Multiprocessors
The current trend is for processors to deliver dramatic improvements in parallel performance while only modestly improving serial performance. Parallel performance is harvested th...
Sanjeev Kumar, Daehyun Kim, Mikhail Smelyanskiy, Y...
106
Voted
ICPR
2008
IEEE
15 years 10 months ago
Tree-structure lossless coding using intervector angle
In this paper, we propose tree structure lossless coding, by which compression data are arranged in a tree structure. Current compression methods show improved performance by prod...
Jun Rokui
133
Voted
SC
2004
ACM
15 years 9 months ago
The Potential of Computation Regrouping for Improving Locality
Improving program locality has become increasingly important on modern computer systems. An effective strategy is to group computations on the same data so that once the data are ...
Chen Ding, Maksim Orlovich
120
Voted
SC
1995
ACM
15 years 7 months ago
Index Array Flattening Through Program Transformation
This paper presents techniques for compiling loops with complex, indirect array accesses into loops whose array references have at most one level of indirection. The transformatio...
Raja Das, Paul Havlak, Joel H. Saltz, Ken Kennedy