Sciweavers

CLUSTER
2003
IEEE

A Performance Monitor Based on Virtual Global Time for Clusters of PCs

13 years 10 months ago
A Performance Monitor Based on Virtual Global Time for Clusters of PCs
Debugging the performance of parallel and distributed systems remains a difficult task despite the widespread use of middleware packages for automatic distribution, communication and tasking in clusters. In this paper we present a performance monitoring tool for clusters of PCs that is based on the simple concept of accounting for resource usage and on the simple idea of mapping all performance related state of hardware performance counters and operating system variables backwards to the application level. In this way a monitoring tool can explain the most relevant performance metrics at a higher level that is easily understood by the application developer. The most important metric for distributed high performance applications remains the total execution time vs. the number of compute nodes involved, since it translates into the scalability of an application. As a detailed contribution of this paper, we closely look into what is needed to reverse map the low level performance counte...
Michela Taufer, Thomas Stricker
Added 04 Jul 2010
Updated 04 Jul 2010
Type Conference
Year 2003
Where CLUSTER
Authors Michela Taufer, Thomas Stricker
Comments (0)