Sciweavers

QSIC
2007
IEEE

A Scriptable, Statistical Oracle for a Metadata Extraction System

13 years 10 months ago
A Scriptable, Statistical Oracle for a Metadata Extraction System
An oracle is described for dynamic validation of an application (metadata extraction from scanned documents) where a moderate failure rate is acceptable provided that instances of failures during operation can be identified. The oracle combines a variety of deterministic tests and statistical tests based upon characteristics of the document collection on which the system operates. Because this system must adapt to a variety of document collections with different characteristics, a scripting language is developed that binds combinations of tests to the metadata fields expected in a given document collection. The suitability of the oracle is demonstrated by an experiment measuring its ability to mimic human judgments as to which of several alternate outputs for the same document would be preferred.
Kurt Maly, Steven J. Zeil, Mohammad Zubair, Ashraf
Added 04 Jun 2010
Updated 04 Jun 2010
Type Conference
Year 2007
Where QSIC
Authors Kurt Maly, Steven J. Zeil, Mohammad Zubair, Ashraf Amrou, Ali Aazhar, Naveen Ratkal
Comments (0)