Sciweavers

IPPS
2007
IEEE

Towards A Better Understanding of Workload Dynamics on Data-Intensive Clusters and Grids

13 years 10 months ago
Towards A Better Understanding of Workload Dynamics on Data-Intensive Clusters and Grids
This paper presents a comprehensive statistical analysis of workloads collected on data-intensive clusters and Grids. The analysis is conducted at different levels, including Virtual Organization (VO) and user behavior. The aggregation procedure and scaling analysis are applied to job arrival processes, leading to the identification of several basic patterns, namely, pseudo-periodicity, long range dependence (LRD), and (multi)fractals. It is shown that statistical measures based on interarrivals are of limited usefulness and count based measures should be trusted instead when it comes to correlations. We also study workload characteristics like job run time, memory consumption, and cross correlations between these characteristics. A “bag-of-tasks” behavior is empirically proved, strongly indicating temporal locality. We argue that pseudo-periodicity, LRD, and “bag-of-tasks” behavior are important workload properties on data-intensive clusters and Grids, which are not present ...
Hui Li, Lex Wolters
Added 03 Jun 2010
Updated 03 Jun 2010
Type Conference
Year 2007
Where IPPS
Authors Hui Li, Lex Wolters
Comments (0)