Sciweavers

121 search results - page 13 / 25
» Pagerank based clustering of hypertext document collections
Sort
View
DOCENG
2007
ACM
15 years 1 months ago
Elimination of junk document surrogate candidates through pattern recognition
A surrogate is an object that stands for a document and enables navigation to that document. Hypermedia is often represented with textual surrogates, even though studies have show...
Eunyee Koh, Daniel Caruso, Andruid Kerne, Ricardo ...
DBISP2P
2003
Springer
100views Database» more  DBISP2P 2003»
15 years 2 months ago
Content-Based Overlay Networks for XML Peers Based on Multi-level Bloom Filters
Peer-to-peer systems are gaining popularity as a means to effectively share huge, massively distributed data collections. In this paper, we consider XML peers, that is, peers that ...
Georgia Koloniari, Yannis Petrakis, Evaggelia Pito...
ICML
2005
IEEE
15 years 10 months ago
Multi-way distributional clustering via pairwise interactions
We present a novel unsupervised learning scheme that simultaneously clusters variables of several types (e.g., documents, words and authors) based on pairwise interactions between...
Ron Bekkerman, Ran El-Yaniv, Andrew McCallum
IRCDL
2007
14 years 11 months ago
An Hybrid Approach for Improving Word Sense Disambiguation and Text Clustering
Abstract— In this paper we suggest a new approach to represent text document collections, integrating background knowledge to improve clustering effectiveness. Background knowled...
Paolo Casoto, Carlo Tasso
85
Voted
SIGIR
2005
ACM
15 years 3 months ago
Exploiting the hierarchical structure for link analysis
Link analysis algorithms have been extensively used in Web information retrieval. However, current link analysis algorithms generally work on a flat link graph, ignoring the hiera...
Gui-Rong Xue, Qiang Yang, Hua-Jun Zeng, Yong Yu, Z...