188
click to vote
WIDM
15 years 10 months ago
2004 ACM
A Focused crawler must use information gleaned from previously crawled page sequences to estimate the relevance of a newly seen URL. Therefore, good performance depends on powerfu...
155
click to vote
WIDM
15 years 10 months ago
2004 ACM
In this paper, we propose a set of similarity metrics for manipulating collections of values occuring in XML documents. Following the data model presented in TAX algebra, we treat...
152
click to vote
WIDM
15 years 10 months ago
2004 ACM
We address the problem of querying XML data over a P2P network. In P2P networks, the allowed kinds of queries are usually exact-match queries over file names. We discuss the exte...
144
click to vote
WIDM
15 years 10 months ago
2004 ACM
Abstract— Peer-to-Peer networking has become a major research topic over the last few years. Sharing of structured data in such decentralized environments is a challenging proble...
144
click to vote
WIDM
15 years 10 months ago
2004 ACM
In this paper, we propose a new approach to automatically clustering e-commerce search engines (ESEs) on the Web such that ESEs in the same cluster sell similar products. This all...
|