Sciweavers

ACMMSP
2004
ACM
82views Hardware» more  ACMMSP 2004»
13 years 8 months ago
Programmer specified pointer independence
David Koes, Mihai Budiu, Girish Venkataramani
ACMMSP
2004
ACM
91views Hardware» more  ACMMSP 2004»
13 years 10 months ago
An empirical performance analysis of commodity memories in commodity servers
This work details a performance study of six different types of commodity memories in two commodity server nodes. A number of micro-benchmarks are used that measure low-level perf...
Darren J. Kerbyson, Michael Lang 0003, Gene Patino...
ACMMSP
2004
ACM
89views Hardware» more  ACMMSP 2004»
13 years 10 months ago
Automatic blocking of QR and LU factorizations for locality
Qing Yi, Ken Kennedy, Haihang You, Keith Seymour, ...
ACMMSP
2004
ACM
101views Hardware» more  ACMMSP 2004»
13 years 10 months ago
Metrics and models for reordering transformations
Irregular applications frequently exhibit poor performance on contemporary computer architectures, in large part because of their inefficient use of the memory hierarchy. Runtime ...
Michelle Mills Strout, Paul D. Hovland
ACMMSP
2004
ACM
125views Hardware» more  ACMMSP 2004»
13 years 10 months ago
Improving trace cache hit rates using the sliding window fill mechanism and fill select table
As superscalar processors become increasingly wide, it is inevitable that the large set of instructions to be fetched every cycle will span multiple noncontiguous basic blocks. Th...
Muhammad Shaaban, Edward Mulrane
ACMMSP
2004
ACM
92views Hardware» more  ACMMSP 2004»
13 years 10 months ago
Instruction combining for coalescing memory accesses using global code motion
Instruction combining is an optimization to replace a sequence of instructions with a more efficient instruction yielding the same result in a fewer machine cycles. When we use it...
Motohiro Kawahito, Hideaki Komatsu, Toshio Nakatan...
ACMMSP
2004
ACM
131views Hardware» more  ACMMSP 2004»
13 years 10 months ago
Reuse-distance-based miss-rate prediction on a per instruction basis
Feedback-directed optimization has become an increasingly important tool in designing and building optimizing compilers. Recently, reuse-distance analysis has shown much promise i...
Changpeng Fang, Steve Carr, Soner Önder, Zhen...