Sciweavers

ALGORITHMICA
2002

Testing and Spot-Checking of Data Streams

13 years 4 months ago
Testing and Spot-Checking of Data Streams
Abstract. We consider the tasks of testing and spot-checking for data streams. These testers and spotcheckers are potentially useful in real-time or near real-time applications that process huge data sets. Crucial aspects of the computational model include the space complexity of the testers and spot-checkers (ideally much lower than the size of the input stream) and the number of passes that the tester or spot-checker must make over the input stream (ideally one, because the original stream may be too large to store for a second pass). A sampling-tester [GGR] for a property P samples some (but usually not all) of its input and, with high probability, outputs PASS if the input has property P and FAIL if the input is far from having P, for an appropriate sense of "far." A streaming-tester for a property P of one or more input streams takes as input one or more data streams and, with high probability, outputs PASS if the streams have property P and FAIL if the streams are far f...
Joan Feigenbaum, Sampath Kannan, Martin Strauss, M
Added 16 Dec 2010
Updated 16 Dec 2010
Type Journal
Year 2002
Where ALGORITHMICA
Authors Joan Feigenbaum, Sampath Kannan, Martin Strauss, Mahesh Viswanathan
Comments (0)