Sciweavers

90 search results - page 2 / 18
» System log pre-processing to improve failure prediction
Sort
View
SRDS
2006
IEEE
13 years 10 months ago
Call Availability Prediction in a Telecommunication System: A Data Driven Empirical Approach
Availability prediction in a telecommunication system plays a crucial role in its management, either by alerting the operator to potential failures or by proactively initiating pr...
Günther A. Hoffmann, Miroslaw Malek
IPPS
2006
IEEE
13 years 11 months ago
Evaluating cooperative checkpointing for supercomputing systems
Cooperative checkpointing, in which the system dynamically skips checkpoints requested by applications at runtime, can exploit system-level information to improve performance and ...
Adam J. Oliner, Ramendra K. Sahoo
ICPP
2007
IEEE
13 years 11 months ago
Fault-Driven Re-Scheduling For Improving System-level Fault Resilience
The productivity of HPC system is determined not only by their performance, but also by their reliability. The conventional method to limit the impact of failures is checkpointing...
Yawei Li, Prashasta Gujrati, Zhiling Lan, Xian-He ...
IPPS
2006
IEEE
13 years 11 months ago
Predicting failures of computer systems: a case study for a telecommunication system
The goal of online failure prediction is to forecast imminent failures while the system is running. This paper compares Similar Events Prediction (SEP) with two other well-known t...
Felix Salfner, M. Schieschke, Miroslaw Malek
IPPS
2005
IEEE
13 years 10 months ago
Proactive Fault Handling for System Availability Enhancement
Proactive fault handling combines prevention and repair actions with failure prediction techniques. We extend the standard availability formula by five key measures: (1) precisio...
Felix Salfner, Miroslaw Malek