160
click to vote
WIDM
15 years 8 months ago
2004 ACM
A Focused crawler must use information gleaned from previously crawled page sequences to estimate the relevance of a newly seen URL. Therefore, good performance depends on powerfu...
141
click to vote
WIDM
15 years 8 months ago
2004 ACM
In this paper, we propose a set of similarity metrics for manipulating collections of values occuring in XML documents. Following the data model presented in TAX algebra, we treat...
139
click to vote
WIDM
15 years 8 months ago
2004 ACM
We address the problem of querying XML data over a P2P network. In P2P networks, the allowed kinds of queries are usually exact-match queries over file names. We discuss the exte...
130
click to vote
WIDM
15 years 8 months ago
2004 ACM
We present the user evaluation of two recommendation server methodologies implemented for the NASA Technical Report Server (NTRS). One methodology for generating recommendations u...
130
click to vote
WIDM
15 years 8 months ago
2004 ACM
In this paper, we propose a new approach to automatically clustering e-commerce search engines (ESEs) on the Web such that ESEs in the same cluster sell similar products. This all...
|