A scalable, robust network for parallel computing

13 years 4 months ago
A scalable, robust network for parallel computing
CX, a network-based computational exchange, is presented. The system’s design integrates variations of ideas from other researchers, such as work stealing, non-blocking tasks, eager scheduling, and space-based coordination. The objectoriented API is simple, compact, and cleanly separates application logic from the logic that supports interprocess communication and fault tolerance. Computations, of course, run to completion in the presence of computational hosts that join and leave the ongoing computation. Such hosts, or producers, use task caching and prefetching to overlap computation with interprocessor communication. To break a potential task server bottleneck, a network of task servers is presented. Even though task servers are envisioned as reliable, the self-organizing, scalable network of n servers, described as a sibling-connected fat tree, tolerates a sequence of n − 1 server failures. Tasks are distributed throughout the server network via a simple “diffusion” proce...
Peter R. Cappello, Dimitros Mourloukos
Added 30 Jul 2010
Updated 30 Jul 2010
Type Conference
Year 2001
Where JAVA
Authors Peter R. Cappello, Dimitros Mourloukos
Comments (0)