Sciweavers

3251 search results - page 534 / 651
» Challenges in Web Information Retrieval
Sort
View
SIGIR
2010
ACM
15 years 3 months ago
Adaptive near-duplicate detection via similarity learning
In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
AIRS
2004
Springer
15 years 2 months ago
Effective Topic Distillation with Key Resource Pre-selection
Topic distillation aims at finding key resources which are high-quality pages for certain topics. With analysis in non-content features of key resources, a pre-selection method is ...
Yiqun Liu, Min Zhang, Shaoping Ma
CIKM
2006
Springer
15 years 2 months ago
A system for query-specific document summarization
There has been a great amount of work on query-independent summarization of documents. However, due to the success of Web search engines query-specific document summarization (que...
Ramakrishna Varadarajan, Vagelis Hristidis
PRICAI
2000
Springer
15 years 2 months ago
Towards a Next-Generation Search Engine
As more information becomes available on the World Wide Web, it has become an acute problem to provide effective search tools for information access. Previous generations of search...
Qiang Yang, Hai-Feng Wang, Ji-Rong Wen, Gao Zhang,...
CIKM
2008
Springer
15 years 1 months ago
Vanity fair: privacy in querylog bundles
A recently proposed approach to address privacy concerns in storing web search querylogs is bundling logs of multiple users together. In this work we investigate privacy leaks tha...
Rosie Jones, Ravi Kumar, Bo Pang, Andrew Tomkins