Sciweavers

ICPP
2009
IEEE

Improving Resource Availability by Relaxing Network Allocation Constraints on Blue Gene/P

13 years 11 months ago
Improving Resource Availability by Relaxing Network Allocation Constraints on Blue Gene/P
— High-end computing (HEC) systems have passed the petaflop barrier and continue to move toward the next frontier of exascale computing. As companies and research institutes continue to work toward architecting these enormous systems, it is becoming increasingly clear that these systems will utilize a significant amount of shared hardware between processing units, including shared caches, memory management engines, and network infrastructure. While these systems are optimized to use all of the hardware available in a dedicated manner to achieve the best performance, in practice, the shared nature of this hardware makes scheduling applications on it difficult and wasteful. For example, while the IBM Blue Gene/P system has been designed to use a torus network for efficient communication, some of the torus links (especially those connecting different racks) are shared between multiple racks. Thus, a job running on one rack, might preclude another job from running on a second rack in...
Narayan Desai, Darius Buntinas, Daniel Buettner, P
Added 23 May 2010
Updated 23 May 2010
Type Conference
Year 2009
Where ICPP
Authors Narayan Desai, Darius Buntinas, Daniel Buettner, Pavan Balaji, Anthony Chan
Comments (0)