Sciweavers

861 search results - page 116 / 173
» Information landscapes and the analysis of search algorithms
Sort
View
SIGIR
2008
ACM
14 years 10 months ago
Local text reuse detection
Text reuse occurs in many different types of documents and for many different reasons. One form of reuse, duplicate or near-duplicate documents, has been a focus of researchers be...
Jangwon Seo, W. Bruce Croft
AIRWEB
2005
Springer
15 years 4 months ago
Blocking Blog Spam with Language Model Disagreement
We present an approach for detecting link spam common in blog comments by comparing the language models used in the blog post, the comment, and pages linked by the comments. In co...
Gilad Mishne, David Carmel, Ronny Lempel
91
Voted
CIKM
2007
Springer
15 years 5 months ago
The role of documents vs. queries in extracting class attributes from text
Challenging the implicit reliance on document collections, this paper discusses the pros and cons of using query logs rather than document collections, as self-contained sources o...
Marius Pasca, Benjamin Van Durme, Nikesh Garera
SIGCSE
2006
ACM
163views Education» more  SIGCSE 2006»
15 years 4 months ago
TextMOLE: text mining operations library and environment
The paper describes the first version of the TextMOLE (Text Mining Operations Library and Environment) system for textual data mining. Currently TextMOLE acts as an advanced inde...
Daniel B. Waegel, April Kontostathis
CLEF
2005
Springer
15 years 4 months ago
The XLDB Group at GeoCLEF 2005
This paper describes our participation at the GeoCLEF 2005 task. We detail the main software components of our Geo-IR system, its adaptation for the participation at GeoCLEF and d...
Nuno Cardoso, Bruno Martins, Marcirio Silveira Cha...