Sciweavers

27 search results - page 4 / 6
» Web Search Results Clustering in Polish: Experimental Evalua...
Sort
View
EXPDB
2006
ACM
13 years 11 months ago
A Reproducible Benchmark for P2P Retrieval
With the growing popularity of information retrieval (IR) in distributed systems and in particular P2P Web search, a huge number of protocols and prototypes have been introduced i...
Thomas Neumann, Matthias Bender, Sebastian Michel,...
WWW
2010
ACM
14 years 23 days ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
WWW
2008
ACM
14 years 6 months ago
Automatic online news issue construction in web environment
In many cases, rather than a keyword search, people intend to see what is going on through the Internet. Then the integrated comprehensive information on news topics is necessary,...
Canhui Wang, Min Zhang, Shaoping Ma, Liyun Ru
WWW
2008
ACM
14 years 6 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
MM
2006
ACM
167views Multimedia» more  MM 2006»
13 years 11 months ago
Image annotation by large-scale content-based image retrieval
Image annotation has been an active research topic in recent years due to its potentially large impact on both image understanding and Web image search. In this paper, we target a...
Xirong Li, Le Chen, Lei Zhang, Fuzong Lin, Wei-Yin...