Sciweavers

208 search results - page 22 / 42
» On evaluating web search with very few relevant documents
Sort
View
WEBI
2005
Springer
15 years 3 months ago
A Semi-Supervised Document Clustering Algorithm Based on EM
Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...
Leonardo Rigutini, Marco Maggini
ERCIMDL
1999
Springer
154views Education» more  ERCIMDL 1999»
15 years 1 months ago
Effectiveness of Keyword-Based Display and Selection of Retrieval Results for Interactive Searches
Abstract. We present an approach to increasing the effectiveness of rankedoutput retrieval systems that relies on graphical display and user manipulation of “views” of retrieva...
Ezio Berenci, Claudio Carpineto, Vittorio Giannini...
ICADL
2010
Springer
160views Education» more  ICADL 2010»
15 years 2 months ago
Thesaurus Extension Using Web Search Engines
Maintaining and extending large thesauri is an important challenge facing digital libraries and IT businesses alike. In this paper we describe a method building on and extending ex...
Robert Meusel, Mathias Niepert, Kai Eckert, Heiner...
89
Voted
WWW
2007
ACM
15 years 10 months ago
Answering bounded continuous search queries in the world wide web
Search queries applied to extract relevant information from the World Wide Web over a period of time may be denoted as continuous search queries. The improvement of continuous sea...
Dirk Kukulenz, Alexandros Ntoulas
LAWEB
2003
IEEE
15 years 3 months ago
On the Evolution of Clusters of Near-Duplicate Web Pages
This paper expands on a 1997 study of the amount and distribution of near-duplicate pages on the World Wide Web. We downloaded a set of 150 million web pages on a weekly basis ove...
Dennis Fetterly, Mark Manasse, Marc Najork