Sciweavers

CLEF
2011
Springer
12 years 4 months ago
Author Identification Using Semi-supervised Learning - Notebook for PAN at CLEF 2011
Author identification models fall into two major categories according to the way they handle the training texts: profile-based models produce one representation per author while in...
Ioannis Kourtis, Efstathios Stamatatos
CLEF
2011
Springer
12 years 4 months ago
Using Clustering to Identify Outlier Chunks of Text - Notebook for PAN at CLEF 2011
Intrinsic plagiarism detection is a sub-task of authorship identification in which outlier chunks must be detected solely on the basis of stylistic differences from the main body o...
Navot Akiva
CLEF
2011
Springer
12 years 4 months ago
Approaches for Intrinsic and External Plagiarism Detection - Notebook for PAN at CLEF 2011
Plagiarism detection has been considered as a classification problem which can be approximated with intrinsic strategies, considering self-based information from a given document,...
Gabriel Oberreuter, Gaston L'Huillier, Sebasti&aac...
CLEF
2011
Springer
12 years 4 months ago
Search Snippet Evaluation at Yandex: Lessons Learned and Future Directions
This papers surveys different approaches to evaluation of web search summaries and describes experiments conducted at Yandex. We hypothesize that the complex task of snippet evalua...
Denis Savenkov, Pavel Braslavski, Mikhail Lebedev
CLEF
2011
Springer
12 years 4 months ago
Evaluating Some Contextual Factors for Image Retrieval - ReDCAD Participation at ImageCLEF Wikipedia 2011
Our participation in the ImageCLEF Wikipedia retrieval task aims to study the efficiency of using two contextual factors in image retrieval: metadata which contains specific infor...
Hatem Awadi, Mouna Torjmen Khemakhem, Maher Ben Je...
CLEF
2011
Springer
12 years 4 months ago
Overview of the 3rd International Competition on Plagiarism Detection
Abstract This paper overviews eleven plagiarism detectors that have been de
Martin Potthast, Andreas Eiselt, Alberto Barr&oacu...
CLEF
2011
Springer
12 years 4 months ago
Intrinsic Plagiarism Detection Using Character Trigram Distance Scores - Notebook for PAN at CLEF 2011
Abstract In this paper, we describe a novel approach to intrinsic plagiarism detection. Each suspicious document is divided into a series of consecutive, potentially overlapping ...
Mike Kestemont, Kim Luyckx, Walter Daelemans
CLEF
2011
Springer
12 years 4 months ago
Simulation of Within-Session Query Variations Using a Text Segmentation Approach
Abstract. We propose a generative model for automatic query reformulations from an initial query using the underlying subtopic structure of top ranked retrieved documents. We addre...
Debasis Ganguly, Johannes Leveling, Gareth J. F. J...
CLEF
2011
Springer
12 years 4 months ago
External Plagiarism Detection using Information Retrieval and Sequence Alignment - Notebook for PAN at CLEF 2011
Abstract This paper describes the University of Sheffield entry for the 3rd International Competition on Plagiarism Detection which attempted the monolingual external plagiarism d...
Rao Muhammad Adeel Nawab, Mark Stevenson, Paul D. ...
CLEF
2011
Springer
12 years 4 months ago
Overview of the 2nd International Competition on Wikipedia Vandalism Detection
Abstract The paper overviews the vandalism detection task of the PAN’11 competition. A new corpus is introduced which comprises about 30 000 Wikipedia edits in the languages Engl...
Martin Potthast, Teresa Holfeld