Testing methods are compared in a model where program failures are detected and the software changed to eliminate them. The question considered is whether it is better to use tests that seek out failures  debug testing" or to simulate usage and nd failures along the way  operational testing". Better" is measured by the delivered reliability obtained after all test failures have been eliminated. This comparison extends previous work, where the measure was the probability of detecting a failure. The theoretical treatment of the paper is probabilistic and analytical. Revealing special cases are exhibited in which each kind of testing is superior. Keywords Reliability, debugging, statistical testing theory 							
						
							
					 															
					Phyllis G. Frankl, Richard G. Hamlet, Bev Littlewo