Sciweavers

ECRTS
2007
IEEE

Thermal Faults Modeling Using a RC Model with an Application to Web Farms

13 years 10 months ago
Thermal Faults Modeling Using a RC Model with an Application to Web Farms
Today’s CPUs consume a significant amount of power and generate a high amount of heat, requiring an active cooling system to support reliable operations. In case of cooling system failures, these CPUs can reduce clock speed to prevent damage due to overheating. Unfortunately, when these CPUs are used in a real-time system, a clock control based on frequency-throttling can cause missed deadlines. In this paper, we first develop and validate a system-wide thermal model that can account for various thermal fault types such as failure of a CPU fan, faults in the case fan and air-conditioning malfunctions. Then we validate the thermal model through experimentation and measurements in AMD Linux boxes. Our soft real-time power-aware load-distribution algorithm for data centers incorporates a thermal model to minimize the number of missed deadlines that can be caused by thermal faults. We implemented the algorithm in a webserver farm simulator to test the efficacy of thermal-aware load-b...
Alexandre P. Ferreira, Daniel Mossé, Jae C.
Added 02 Jun 2010
Updated 02 Jun 2010
Type Conference
Year 2007
Where ECRTS
Authors Alexandre P. Ferreira, Daniel Mossé, Jae C. Oh
Comments (0)