Sciweavers

TDSC
2010

Using Underutilized CPU Resources to Enhance Its Reliability

13 years 2 months ago
Using Underutilized CPU Resources to Enhance Its Reliability
—Soft errors (or Transient faults) are temporary faults that arise in a circuit due to a variety of internal noise and external sources such as cosmic particle hits. Though soft errors still occur infrequently, they are rapidly becoming a major impediment to processor reliability. This is due primarily to processor scaling characteristics. In the past, systems designed to tolerate such faults utilized costly customized solutions, entailing the use of replicated hardware components to detect and recover from microprocessor faults. As the feature size keeps shrinking and with the proliferation of multiprocessor on die in all segments of computer-based systems, the capability to detect and recover from faults is also desired for commodity hardware. For such systems, however, performance and power constitute the main drivers, so the traditional solutions prove inadequate and new approaches are required. We introduce two independent and complementary microarchitecture-level techniques: Do...
Avi Timor, Avi Mendelson, Yitzhak Birk, Neeraj Sur
Added 31 Jan 2011
Updated 31 Jan 2011
Type Journal
Year 2010
Where TDSC
Authors Avi Timor, Avi Mendelson, Yitzhak Birk, Neeraj Suri
Comments (0)