Sciweavers

1213 search results - page 145 / 243
» Power scalable processing using distributed arithmetic
Sort
View
IPPS
2010
IEEE
14 years 7 months ago
Overlapping computation and communication: Barrier algorithms and ConnectX-2 CORE-Direct capabilities
Abstract--This paper explores the computation and communication overlap capabilities enabled by the new CORE-Direct hardware capabilities introduced in the InfiniBand (IB) Host Cha...
Richard L. Graham, Stephen W. Poole, Pavel Shamis,...
FMCO
2008
Springer
109views Formal Methods» more  FMCO 2008»
14 years 11 months ago
Session-Based Compilation Framework for Multicore Programming
This paper outlines a general picture of our ongoing work under EU Mobius and Sensoria projects on a type-based compilation and execution framework for a class of multicore CPUs. O...
Nobuko Yoshida, Vasco Thudichum Vasconcelos, Herv&...
IPPS
2007
IEEE
15 years 4 months ago
A Flexible Resource Management Architecture for the Blue Gene/P Supercomputer
Blue Gene R /P is a massively parallel supercomputer intended as the successor to Blue Gene/L. It leverages much of the existing architecture of its predecessor to provide scalabi...
Sam Miller, Mark Megerian, Paul Allen, Tom Budnik
GECCO
2005
Springer
155views Optimization» more  GECCO 2005»
15 years 3 months ago
A developmental genetics-inspired approach to robot control
The need to build modular, scalable, and complex technology capable of adaptation, self-assembly, and self-repair has fuelled renewed interest in using approaches inspired by deve...
Sanjeev Kumar
DAC
2009
ACM
15 years 1 months ago
NUDA: a non-uniform debugging architecture and non-intrusive race detection for many-core
Traditional debug methodologies are limited in their ability to provide debugging support for many-core parallel programming. Synchronization problems or bugs due to race conditio...
Chi-Neng Wen, Shu-Hsuan Chou, Tien-Fu Chen, Alan P...