Sciweavers

NIPS
2001
13 years 6 months ago
An Efficient Clustering Algorithm Using Stochastic Association Model and Its Implementation Using Nanostructures
This paper describes a clustering algorithm for vector quantizers using a "stochastic association model". It offers a new simple and powerful softmax adaptation rule. Th...
Takashi Morie, Tomohiro Matsuura, Makoto Nagata, A...
LWA
2004
13 years 6 months ago
Experiments in Term Weighting and Keyword Extraction in Document Clustering
We study methods to initialize or bias different clustering methods using prior information about the "importance" of a keyword w.r.t. the whole document collection or a...
Christian Borgelt, Andreas Nürnberger
EMNLP
2007
13 years 6 months ago
Extending a Thesaurus in the Pan-Chinese Context
In this paper, we address a unique problem in Chinese language processing and report on our study on extending a Chinese thesaurus with region-specific words, mostly from the fina...
Oi Yee Kwong, Benjamin Ka-Yin T'sou
DAS
2008
Springer
13 years 6 months ago
A Comparison of Clustering Methods for Word Image Indexing
In this paper we explore the effectiveness of three clustering methods used to perform word image indexing. The three methods are: the Self-Organazing Map (SOM), the Growing Hiera...
Simone Marinai, Emanuele Marino, Giovanni Soda
CLEF
2008
Springer
13 years 6 months ago
Clustering for Photo Retrieval at Image CLEF 2008
This paper presents the first participation of the University of Ottawa group in the Photo Retrieval task at Image CLEF 2008. Our system uses Lucene for text indexing and LIRE for ...
Diana Inkpen, Marc Stogaitis, François DeGu...
VLDB
1994
ACM
140views Database» more  VLDB 1994»
13 years 8 months ago
Efficient and Effective Clustering Methods for Spatial Data Mining
Spatial data mining is the discovery of interesting relationships and characteristics that may exist implicitly in spatial databases. In this paper, we explore whether clustering ...
Raymond T. Ng, Jiawei Han
ISPA
2005
Springer
13 years 10 months ago
COMPACT: A Comparative Package for Clustering Assessment
Abstract. There exist numerous algorithms that cluster data-points from largescale genomic experiments such as sequencing, gene-expression and proteomics. Such algorithms may emplo...
Roy Varshavsky, Michal Linial, David Horn
ISBRA
2007
Springer
13 years 10 months ago
GFBA: A Biclustering Algorithm for Discovering Value-Coherent Biclusters
Clustering has been one of the most popular approaches used in gene expression data analysis. A clustering method is typically used to partition genes according to their similarity...
Xubo Fei, Shiyong Lu, Horia F. Pop, Lily R. Liang
AUSDM
2007
Springer
173views Data Mining» more  AUSDM 2007»
13 years 10 months ago
The Use of Various Data Mining and Feature Selection Methods in the Analysis of a Population Survey Dataset
This paper reports the results of feature reduction in the analysis of a population based dataset for which there were no specific target variables. All attributes were assessed a...
Ellen Pitt, Richi Nayak
ICDM
2008
IEEE
118views Data Mining» more  ICDM 2008»
13 years 11 months ago
Extension of Partitional Clustering Methods for Handling Mixed Data
Clustering is an active research topic in data mining and different methods have been proposed in the literature. Most of these methods are based on the use of a distance measure ...
Yosr Naïja, Salem Chakhar, Kaouthar Blibech, ...