Sciweavers

2151 search results - page 158 / 431
» Using Document Dimensions for Enhanced Information Retrieval
Sort
View
ICDE
2009
IEEE
155views Database» more  ICDE 2009»
15 years 11 months ago
Join Optimization of Information Extraction Output: Quality Matters!
— Information extraction (IE) systems are trained to extract specific relations from text databases. Real-world applications often require that the output of multiple IE systems...
Alpa Jain, Panagiotis G. Ipeirotis, AnHai Doan, Lu...
ECIR
2007
Springer
15 years 5 months ago
Using Topic Shifts for Focussed Access to XML Repositories
Abstract. In focussed XML retrieval, a retrieval unit is an XML element that not only contains information relevant to a user query, but also is specific to the query. INEX defin...
Elham Ashoori, Mounia Lalmas
ICPR
2010
IEEE
15 years 2 months ago
The PAGE (Page Analysis and Ground-Truth Elements) Format Framework
There is a plethora of established and proposed document representation formats but none that can adequately support individual stages within an entire sequence of document image ...
Stefan Pletschacher, Apostolos Antonacopoulos
SIGIR
2006
ACM
15 years 10 months ago
Load balancing for term-distributed parallel retrieval
Large-scale web and text retrieval systems deal with amounts of data that greatly exceed the capacity of any single machine. To handle the necessary data volumes and query through...
Alistair Moffat, William Webber, Justin Zobel
CHI
2006
ACM
16 years 4 months ago
PaperSpace: a system for managing digital and paper documents
Here we present PaperSpace a computer vision based document management system that allows users to combine paper and digital documents. Using PaperSpace users can locate paper cop...
Jeff Smith, Jeremy Long, Tanya Lung, Mohd M. Anwar...