Sciweavers

1213 search results - page 145 / 243
» Power scalable processing using distributed arithmetic
Sort
View
IPPS
2010
IEEE
14 years 11 months ago
Overlapping computation and communication: Barrier algorithms and ConnectX-2 CORE-Direct capabilities
Abstract--This paper explores the computation and communication overlap capabilities enabled by the new CORE-Direct hardware capabilities introduced in the InfiniBand (IB) Host Cha...
Richard L. Graham, Stephen W. Poole, Pavel Shamis,...
FMCO
2008
Springer
109views Formal Methods» more  FMCO 2008»
15 years 3 months ago
Session-Based Compilation Framework for Multicore Programming
This paper outlines a general picture of our ongoing work under EU Mobius and Sensoria projects on a type-based compilation and execution framework for a class of multicore CPUs. O...
Nobuko Yoshida, Vasco Thudichum Vasconcelos, Herv&...
IPPS
2007
IEEE
15 years 8 months ago
A Flexible Resource Management Architecture for the Blue Gene/P Supercomputer
Blue Gene R /P is a massively parallel supercomputer intended as the successor to Blue Gene/L. It leverages much of the existing architecture of its predecessor to provide scalabi...
Sam Miller, Mark Megerian, Paul Allen, Tom Budnik
GECCO
2005
Springer
155views Optimization» more  GECCO 2005»
15 years 7 months ago
A developmental genetics-inspired approach to robot control
The need to build modular, scalable, and complex technology capable of adaptation, self-assembly, and self-repair has fuelled renewed interest in using approaches inspired by deve...
Sanjeev Kumar
DAC
2009
ACM
15 years 5 months ago
NUDA: a non-uniform debugging architecture and non-intrusive race detection for many-core
Traditional debug methodologies are limited in their ability to provide debugging support for many-core parallel programming. Synchronization problems or bugs due to race conditio...
Chi-Neng Wen, Shu-Hsuan Chou, Tien-Fu Chen, Alan P...