Sciweavers

81 search results - page 1 / 17
» Challenging the Mean Time to Failure: Measuring Dependabilit...
Sort
View
TKDE
1998
116views more  TKDE 1998»
13 years 4 months ago
Dependability and Performance Measures for the Database Practitioner
-- We estimate the availability, reliability, and mean transaction time (response time) for repairable database configurations, centralized or distributed, in which each service co...
Toby J. Teorey, Wee Teck Ng
FAST
2007
13 years 6 months ago
Disk Failures in the Real World: What Does an MTTF of 1, 000, 000 Hours Mean to You?
Component failure in large-scale IT installations is becoming an ever larger problem as the number of components in a single cluster approaches a million. In this paper, we presen...
Bianca Schroeder, Garth A. Gibson
10
Voted
SIGMETRICS
1998
ACM
13 years 4 months ago
Internet service performance failure detection
The increasing complexity of computer networks and our increasing dependence on them means enforcing reliability requirements is both more challenging and more critical. The expan...
Amy R. Ward, Peter W. Glynn, Kathy J. Richardson
DSN
2006
IEEE
13 years 10 months ago
A large-scale study of failures in high-performance computing systems
Designing highly dependable systems requires a good understanding of failure characteristics. Unfortunately, little raw data on failures in large IT installations is publicly avai...
Bianca Schroeder, Garth A. Gibson