Sciweavers

55 search results - page 3 / 11
» A hybrid unsupervised approach for document clustering
Sort
View
SDM
2007
SIAM
187views Data Mining» more  SDM 2007»
13 years 6 months ago
Topic Models over Text Streams: A Study of Batch and Online Unsupervised Learning
Topic modeling techniques have widespread use in text data mining applications. Some applications use batch models, which perform clustering on the document collection in aggregat...
Arindam Banerjee, Sugato Basu
IJCNN
2006
IEEE
13 years 11 months ago
A Self-Organising Map Approach for Clustering of XML Documents
— The number of XML documents produced and available on the Internet is steadily increasing. It is thus important to devise automatic procedures to extract useful information fro...
Francesca Trentini, Markus Hagenbuchner, Alessandr...
IRCDL
2007
13 years 6 months ago
An Hybrid Approach for Improving Word Sense Disambiguation and Text Clustering
Abstract— In this paper we suggest a new approach to represent text document collections, integrating background knowledge to improve clustering effectiveness. Background knowled...
Paolo Casoto, Carlo Tasso
ICDM
2008
IEEE
147views Data Mining» more  ICDM 2008»
13 years 11 months ago
Clustering Documents with Active Learning Using Wikipedia
Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
WWW
2006
ACM
13 years 11 months ago
Using proportional transportation similarity with learned element semantics for XML document clustering
This paper proposes a novel approach to measuring XML document similarity by taking into account the semantics between XML elements. The motivation of the proposed approach is to ...
Xiaojun Wan, Jianwu Yang