Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

174

WEBI
2005
Springer

216views Internet Technology» more WEBI 2005»

A Semi-Supervised Document Clustering Algorithm Based on EM

15 years 9 months ago

A Semi-Supervised Document Clustering Algorithm Based on EM

Download www.dii.unisi.it

Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the category structure. This task can be difﬁcult also for humans because many different but valid partitions may exist for the same collection. Moreover, the lack of information about categories makes it difﬁcult to apply effective feature selection techniques to reduce the noise in the representation of texts. Despite these intrinsic difﬁculties, text clustering is an important task for Web search applications in which huge collections or quite long query result lists must be automatically organized. Semi-supervised clustering lies in between automatic categorization and auto-organization. It is assumed that the supervisor is not required to specify a set of classes, but only to provide a set of texts grouped by the criteria to be used to organize the collection. In this paper we present a novel algorithm fo...

Leonardo Rigutini, Marco Maggini

Real-time Traffic

Automatic Text Processing | Document Clustering | Feature Selection Technique | Internet Technology | WEBI 2005 |

claim paper

Related Content

» A SemiSupervised Document Clustering Technique for Information Organization

» SISC A Text Classification Approach Using Semi Supervised Subspace Clustering

» SemiSupervised Learning via Regularized Boosting Working on Multiple SemiSupervised Assump...

» Multilabel ASRS Dataset Classification Using Semi Supervised Subspace Clustering

» A Framework Based on SemiSupervised Clustering for Discovering Unique Writing Styles

» A Framework for SemiSupervised Learning Based on Subjective and Objective Clustering Crite...

» ClusteringBased Stratified Seed Sampling for SemiSupervised Relation Classification

» On SemiSupervised Classification

» Semi Supervised Spectral Clustering for Regulatory Module Discovery

Post Info
More Details (n/a)

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	WEBI
Authors	Leonardo Rigutini, Marco Maggini

Comments (0)