Sciweavers

81 search results - page 1 / 17
» Challenging the Mean Time to Failure: Measuring Dependabilit...
Sort
View
TKDE
1998
116views more  TKDE 1998»
14 years 10 months ago
Dependability and Performance Measures for the Database Practitioner
-- We estimate the availability, reliability, and mean transaction time (response time) for repairable database configurations, centralized or distributed, in which each service co...
Toby J. Teorey, Wee Teck Ng
FAST
2007
15 years 2 hour ago
Disk Failures in the Real World: What Does an MTTF of 1, 000, 000 Hours Mean to You?
Component failure in large-scale IT installations is becoming an ever larger problem as the number of components in a single cluster approaches a million. In this paper, we presen...
Bianca Schroeder, Garth A. Gibson
SIGMETRICS
1998
ACM
14 years 10 months ago
Internet service performance failure detection
The increasing complexity of computer networks and our increasing dependence on them means enforcing reliability requirements is both more challenging and more critical. The expan...
Amy R. Ward, Peter W. Glynn, Kathy J. Richardson
DSN
2006
IEEE
15 years 4 months ago
A large-scale study of failures in high-performance computing systems
Designing highly dependable systems requires a good understanding of failure characteristics. Unfortunately, little raw data on failures in large IT installations is publicly avai...
Bianca Schroeder, Garth A. Gibson