Sciweavers

1562 search results - page 197 / 313
» Indexing Multimedia for the Internet
Sort
View
88
Voted
WWW
2007
ACM
16 years 1 months ago
Efficient search engine measurements
We address the problem of measuring global quality metrics of search engines, like corpus size, index freshness, and density of duplicates in the corpus. The recently proposed est...
Ziv Bar-Yossef, Maxim Gurevich
100
Voted
WWW
2007
ACM
16 years 1 months ago
A new suffix tree similarity measure for document clustering
In this paper, we propose a new similarity measure to compute the pairwise similarity of text-based documents based on suffix tree document model. By applying the new suffix tree ...
Hung Chim, Xiaotie Deng
122
Voted
WWW
2007
ACM
16 years 1 months ago
P-TAG: large scale automatic generation of personalized annotation tags for the web
The success of the Semantic Web depends on the availability of Web pages annotated with metadata. Free form metadata or tags, as used in social bookmarking and folksonomies, have ...
Paul-Alexandru Chirita, Stefania Costache, Wolfgan...
93
Voted
WWW
2007
ACM
16 years 1 months ago
Deriving knowledge from figures for digital libraries
Figures in digital documents contain important information. Current digital libraries do not summarize and index information available within figures for document retrieval. We pr...
Xiaonan Lu, James Ze Wang, Prasenjit Mitra, C. Lee...
117
Voted
WWW
2007
ACM
16 years 1 months ago
Organizing and searching the world wide web of facts -- step two: harnessing the wisdom of the crowds
As part of a large effort to acquire large repositories of facts from unstructured text on the Web, a seed-based framework for textual information extraction allows for weakly sup...
Marius Pasca