— Information extraction (IE) systems are trained to extract specific relations from text databases. Real-world applications often require that the output of multiple IE systems...
Alpa Jain, Panagiotis G. Ipeirotis, AnHai Doan, Lu...
Abstract. In focussed XML retrieval, a retrieval unit is an XML element that not only contains information relevant to a user query, but also is specific to the query. INEX defin...
There is a plethora of established and proposed document representation formats but none that can adequately support individual stages within an entire sequence of document image ...
Large-scale web and text retrieval systems deal with amounts of data that greatly exceed the capacity of any single machine. To handle the necessary data volumes and query through...
Here we present PaperSpace a computer vision based document management system that allows users to combine paper and digital documents. Using PaperSpace users can locate paper cop...
Jeff Smith, Jeremy Long, Tanya Lung, Mohd M. Anwar...