Sciweavers

280 search results - page 22 / 56
» A Semi-Supervised Document Clustering Algorithm Based on EM
Sort
View
ICCS
2009
Springer
15 years 8 months ago
Frequent Itemset Mining for Clustering Near Duplicate Web Documents
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use ...
Dmitry I. Ignatov, Sergei O. Kuznetsov
AI
2007
Springer
15 years 7 months ago
Fuzzy Clustering for Topic Analysis and Summarization of Document Collections
Abstract. Large document collections, such as those delivered by Internet search engines, are difficult and time-consuming for users to read and analyse. The detection of common an...
René Witte, Sabine Bergler
CIKM
2008
Springer
15 years 3 months ago
A language for manipulating clustered web documents results
We propose a novel conception language for exploring the results retrieved by several internet search services (like search engines) that cluster retrieved documents. The goal is ...
Gloria Bordogna, Alessandro Campi, Giuseppe Psaila...
ICASSP
2011
IEEE
14 years 5 months ago
A hierarchical generative model for Generic Audio Document Categorization
In this paper, we call the pattern classification problem that consists in assigning a category label to a long audio signal based on its semantic content as Generic Audio Documen...
Zhi Zeng, Shuwu Zhang
ICML
2004
IEEE
16 years 2 months ago
Boosting margin based distance functions for clustering
The performance of graph based clustering methods critically depends on the quality of the distance function, used to compute similarities between pairs of neighboring nodes. In t...
Tomer Hertz, Aharon Bar-Hillel, Daphna Weinshall