Sciweavers

20 search results - page 2 / 4
» MPC-MPI: An MPI Implementation Reducing the Overall Memory C...
Sort
View
HPCA
2012
IEEE
12 years 27 days ago
Balancing DRAM locality and parallelism in shared memory CMP systems
Modern memory systems rely on spatial locality to provide high bandwidth while minimizing memory device power and cost. The trend of increasing the number of cores that share memo...
Min Kyu Jeong, Doe Hyun Yoon, Dam Sunwoo, Mike Sul...
ISCAS
2003
IEEE
126views Hardware» more  ISCAS 2003»
13 years 10 months ago
Low power block based FIR filtering cores
— The authors present a number of complete cores which are specially tailored for the low power implementation of FIR filters executed using block processing. The paper reveals t...
Ahmet T. Erdogan, Tughrul Arslan
CONCURRENCY
2010
110views more  CONCURRENCY 2010»
13 years 5 months ago
Redesigning the message logging model for high performance
Over the past decade the number of processors in the high performance facilities went up to hundreds of thousands. As a direct consequence, while the computational power follow th...
Aurelien Bouteiller, George Bosilca, Jack Dongarra
DATE
2008
IEEE
129views Hardware» more  DATE 2008»
13 years 11 months ago
Memory Technology for Extended Large-Scale Integration in Future Electronics Applications
Extending 2-D planar topologies in integrated circuits (ICs) to a 3-D implementation has the obvious benefits of reducing the overall footprint and average interconnection length,...
Dinesh Pamunuwa
SC
2005
ACM
13 years 11 months ago
Performance Modeling and Tuning Strategies of Mixed Mode Collective Communications
On SMP clusters, mixed mode collective MPI communications, which use shared memory communications within SMP nodes and point-to-point communications between SMP nodes, are more eï...
Meng-Shiou Wu, Ricky A. Kendall, Kyle Wright, Zhao...