Sciweavers

10 search results - page 1 / 2
» Query-driven document partitioning and collection selection
Sort
View
INFOSCALE
2006
ACM
13 years 11 months ago
Query-driven document partitioning and collection selection
Diego Puppin, Fabrizio Silvestri, Domenico Laforen...
KDD
2010
ACM
326views Data Mining» more  KDD 2010»
13 years 3 months ago
Document clustering via dirichlet process mixture model with feature selection
One essential issue of document clustering is to estimate the appropriate number of clusters for a document collection to which documents should be partitioned. In this paper, we ...
Guan Yu, Ruizhang Huang, Zhaojun Wang
WWW
2007
ACM
14 years 6 months ago
Query-driven indexing for peer-to-peer text retrieval
We describe a query-driven indexing framework for scalable text retrieval over structured P2P networks. To cope with the bandwidth consumption problem that has been identified as ...
Gleb Skobeltsyn, Toan Luu, Karl Aberer, Martin Raj...
INFOSCALE
2007
ACM
13 years 6 months ago
Load-balancing and caching for collection selection architectures
— To address the rapid growth of the Internet, modern Web search engines have to adopt distributed organizations, where the collection of indexed documents is partitioned among s...
Diego Puppin, Fabrizio Silvestri, Raffaele Perego,...
WEBI
2005
Springer
13 years 10 months ago
A Semi-Supervised Document Clustering Algorithm Based on EM
Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...
Leonardo Rigutini, Marco Maggini