Sciweavers

3180 search results - page 252 / 636
» Knowledge-based Document Analysis
Sort
View
LREC
2010
150views Education» more  LREC 2010»
15 years 6 months ago
A Corpus for Evaluating Semantic Multilingual Web Retrieval Systems: The Sense Folder Corpus
In this paper, we present the multilingual Sense Folder Corpus. After the analysis of different corpora, we describe the requirements that have to be satisfied for evaluating sema...
Ernesto William De Luca
123
Voted
LREC
2008
99views Education» more  LREC 2008»
15 years 6 months ago
Characterization of Scientific and Popular Science Discourse in French, Japanese and Russian
We aim to characterize the comparability of corpora, we address this issue in the trilingual context through the distinction of expert and non expert documents. We work separately...
Lorraine Goeuriot, Natalia Grabar, Béatrice...
ECIR
2006
Springer
15 years 6 months ago
Intrinsic Plagiarism Detection
Current research in the field of automatic plagiarism detection for text documents focuses on algorithms that compare plagiarized documents against potential original documents. Th...
Sven Meyer zu Eissen, Benno Stein
SIGIR
2008
ACM
15 years 4 months ago
Latent dirichlet allocation based multi-document summarization
Extraction based Multi-Document Summarization Algorithms consist of choosing sentences from the documents using some weighting mechanism and combining them into a summary. In this...
Rachit Arora, Balaraman Ravindran
CIKM
2007
Springer
15 years 11 months ago
Effective top-k computation in retrieving structured documents with term-proximity support
Modern web search engines are expected to return top-k results efficiently given a query. Although many dynamic index pruning strategies have been proposed for efficient top-k com...
Mingjie Zhu, Shuming Shi, Mingjing Li, Ji-Rong Wen