Sciweavers

SIGIR
2010
ACM
13 years 5 months ago
Score distribution models: assumptions, intuition, and robustness to score manipulation
Inferring the score distribution of relevant and non-relevant documents is an essential task for many IR applications (e.g. information filtering, recall-oriented IR, meta-search,...
Evangelos Kanoulas, Keshi Dai, Virgiliu Pavlu, Jav...
HM
2010
Springer
161views Optimization» more  HM 2010»
13 years 6 months ago
A Memetic Algorithm for Reconstructing Cross-Cut Shredded Text Documents
The reconstruction of destroyed paper documents became of more interest during the last years. On the one hand it (often) occurs that documents are destroyed by mistake while on th...
Christian Schauer, Matthias Prandtstetter, Gü...
ESA
2010
Springer
161views Algorithms» more  ESA 2010»
13 years 6 months ago
Top-k Ranked Document Search in General Text Databases
Text search engines return a set of k documents ranked by similarity to a query. Typically, documents and queries are drawn from natural language text, which can readily be partiti...
J. Shane Culpepper, Gonzalo Navarro, Simon J. Pugl...
DRR
2010
13 years 6 months ago
Detecting modifications in paper documents: a coding approach
This paper presents an algorithm called CIPDEC (Content Integrity of Printed Documents using Error Correction), which identifies any modifications made to a printed document. CIPD...
Yogesh Sankarasubramaniam, Badri Narayanan, Kapali...
CLEF
2010
Springer
13 years 6 months ago
Multilingual Expert Search using Linked Open Data as Interlingual Representation
Abstract. Most Information Retrieval models take documents as Bagof-Words and are thereby bound to the language of the documents. In this paper, we present an approach using Linked...
Daniel Herzig, Hristina Taneva
CLEF
2010
Springer
13 years 6 months ago
External and Intrinsic Plagiarism Detection Using a Cross-Lingual Retrieval and Segmentation System - Lab Report for PAN at CLEF
We present our hybrid system for the PAN challenge at CLEF 2010. Our system performs plagiarism detection for translated and non-translated externally as well as intrinsically plag...
Markus Muhr, Roman Kern, Mario Zechner, Michael Gr...
MVA
1990
13 years 6 months ago
Recognition of Document Structure on the Basis of Spatial and Geometric Relationships between Document Items
This paper introduces a new method to extract and classify the meaningful information from documents automatically. The basic idea in our method is to utilize the spatial and geom...
Qin Luo, Toyohide Watanabe, Yuuji Yoshida, Yasuyos...
NAACL
1994
13 years 6 months ago
Learning from Relevant Documents in Large Scale Routing Retrieval
The normal practice of selecting relevant documents for training routing queries is to either use all relevants or the 'best n' of them after a (retrieval) ranking opera...
K. L. Kwok, Laszlo Grunfeld
TREC
2000
13 years 6 months ago
The PISAB Question Answering System
The PISAB Question Answering system is based on a combination of Information Extraction and Information Retrieval techniques. Knowledge extracted from documents is modeled as a se...
Giuseppe Attardi, Cristian Burrini
WEBNET
1998
13 years 6 months ago
Categorisation by Context
Assistance in retrieving of documents on the World Wide Web is provided either by search engines, through keyword based queries, or by catalogues, which organise documents into hi...
Giuseppe Attardi, Sergio Di Marco, Davide Salvi