Sciweavers

109
Voted
CLEF
2011
Springer
14 years 7 days ago
Overview of the 3rd International Competition on Plagiarism Detection
Abstract This paper overviews eleven plagiarism detectors that have been de
Martin Potthast, Andreas Eiselt, Alberto Barr&oacu...
121
Voted
CLEF
2011
Springer
14 years 7 days ago
Intrinsic Plagiarism Detection Using Character Trigram Distance Scores - Notebook for PAN at CLEF 2011
Abstract In this paper, we describe a novel approach to intrinsic plagiarism detection. Each suspicious document is divided into a series of consecutive, potentially overlapping ā€...
Mike Kestemont, Kim Luyckx, Walter Daelemans
127
Voted
CLEF
2011
Springer
14 years 12 days ago
Simulation of Within-Session Query Variations Using a Text Segmentation Approach
Abstract. We propose a generative model for automatic query reformulations from an initial query using the underlying subtopic structure of top ranked retrieved documents. We addre...
Debasis Ganguly, Johannes Leveling, Gareth J. F. J...
CLEF
2011
Springer
14 years 12 days ago
External Plagiarism Detection using Information Retrieval and Sequence Alignment - Notebook for PAN at CLEF 2011
Abstract This paper describes the University of Sheffield entry for the 3rd International Competition on Plagiarism Detection which attempted the monolingual external plagiarism d...
Rao Muhammad Adeel Nawab, Mark Stevenson, Paul D. ...
CLEF
2011
Springer
14 years 12 days ago
Overview of the 2nd International Competition on Wikipedia Vandalism Detection
Abstract The paper overviews the vandalism detection task of the PAN’11 competition. A new corpus is introduced which comprises about 30 000 Wikipedia edits in the languages Engl...
Martin Potthast, Teresa Holfeld
113
Voted
CIKM
2011
Springer
14 years 12 days ago
Improved answer ranking in social question-answering portals
Community QA portals provide an important resource for non-factoid question-answering. The inherent noisiness of user-generated data makes the identification of high-quality cont...
Felix Hieber, Stefan Riezler
CIKM
2011
Springer
14 years 12 days ago
Detecting anomalies in graphs with numeric labels
This paper presents Yagada, an algorithm to search labelled graphs for anomalies using both structural data and numeric attributes. Yagada is explained using several security-rela...
Michael Davis, Weiru Liu, Paul Miller, George Redp...
CIKM
2011
Springer
14 years 12 days ago
Estimating selectivity for joined RDF triple patterns
A fundamental problem related to RDF query processing is selectivity estimation, which is crucial to query optimization for determining a join order of RDF triple patterns. In thi...
Hai Huang 0003, Chengfei Liu
CIKM
2011
Springer
14 years 12 days ago
Simultaneous joint and conditional modeling of documents tagged from two perspectives
This paper explores correspondence and mixture topic modeling of documents tagged from two different perspectives. There has been ongoing work in topic modeling of documents with...
Pradipto Das, Rohini K. Srihari, Yun Fu
114
Voted
CIKM
2011
Springer
14 years 12 days ago
Lower-bounding term frequency normalization
In this paper, we reveal a common deficiency of the current retrieval models: the component of term frequency (TF) normalization by document length is not lower-bounded properly;...
Yuanhua Lv, ChengXiang Zhai