Key issues to address in autonomic job recovery for cluster computing are recognizing job failure; understanding the failure sufficiently to know if and how to restart the job; an...
Charles Earl, Emilio Remolina, Jim Ong, John Brown
—The scale, heterogeneity and dynamism of Grid applications and environments require Grid applications to be self-managing or autonomic. This paper presents the Accord autonomic ...
Hua Liu, Viraj Bhat, Manish Parashar, Scott Klasky
Autonomic distributed management enables for deploying self-directed monitoring and control tasks that track dynamic network problems such as performance degradation and security t...
This paper introduces the Patia Autonomic webserver, which has been designed to be self-monitoring and adaptive to not only improve webserver performance but robustness in terms o...