Sciweavers

832 search results - page 37 / 167
» Document clustering with committees
Sort
View
EEE
2005
IEEE
15 years 7 months ago
Learning the Kernel Matrix for XML Document Clustering
The rapid growth of XML adoption has urged for the need of a proper representation for semi-structured documents, where the document structural information has to be taken into ac...
Jianwu Yang, William Kwok-Wai Cheung, Xiaoou Chen
ICPR
2004
IEEE
16 years 2 months ago
Coordinate Systems Reconstruction for Graphical Documents by Hough-feature Clustering and Geometric Analysis
Two-dimensional and three-dimensional coordinate systems are the basic graphics symbols in many graphical documents. A robust coordinate system detection scheme is needed in order...
Chew Lim Tan, Yan Ping Zhou
IJCAI
2001
15 years 2 months ago
Combining Statistics and Semantics for Word and Document Clustering
A new approach for constructing pseudo-keywords, referred to as Sense Units, is proposed. Sense Units are obtained by a word clustering process, where the underlying similarity re...
Alexandre Termier, Michèle Sebag, Marie-Chr...
EMNLP
2009
14 years 11 months ago
Unsupervised morphological segmentation and clustering with document boundaries
Many approaches to unsupervised morphology acquisition incorporate the frequency of character sequences with respect to each other to identify word stems and affixes. This typical...
Taesun Moon, Katrin Erk, Jason Baldridge
CIKM
2004
Springer
15 years 7 months ago
Stemming and lemmatization in the clustering of finnish text documents
Under construction… Categories and Subject Descriptors H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval – clustering. General Terms Algorithms, Expe...
Tuomo Korenius, Jorma Laurikkala, Kalervo Jär...