Because of increasing hardware and software complexity, the running time of many computational science applications is now more than the mean-time-to-failure of highpeformance com...
Greg Bronevetsky, Daniel Marques, Keshav Pingali, ...
Cluster-based servers can substantially increase performance when nodes cooperate to globally manage resources. However, in this paper we show that cooperation results in a substa...
Volumetric energy backprojection captures the effects of myriad physical processes including global illumination and reconstruction. We present a method to perform efficient volu...
All systems, regardless of how carefully they have been constructed, suffer failures. This paper focuses on developing a formal understanding of failure with respect to system imp...
We have developed a middleware framework for workgroup environments that can support distributed software development and a variety of other application domains requiring document...