Sciweavers

FAST
2007

//TRACE: Parallel Trace Replay with Approximate Causal Events

13 years 6 months ago
//TRACE: Parallel Trace Replay with Approximate Causal Events
//TRACE1 is a new approach for extracting and replaying traces of parallel applications to recreate their I/O behavior. Its tracing engine automatically discovers inter-node data dependencies and inter-I/O compute times for each node (process) in an application. This information is reflected in per-node annotated I/O traces. Such annotation allows a parallel replayer to closely mimic the behavior of a traced application across a variety of storage systems. When compared to other replay mechanisms, //TRACE offers significant gains in replay accuracy. Overall, the average replay error for the parallel applications evaluated in this paper is below 6%.
Michael P. Mesnier, Matthew Wachs, Raja R. Sambasi
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2007
Where FAST
Authors Michael P. Mesnier, Matthew Wachs, Raja R. Sambasivan, Julio López, James Hendricks, Gregory R. Ganger, David R. O'Hallaron
Comments (0)