Sciweavers

1314 search results - page 108 / 263
» Approximate data mining in very large relational data
Sort
View
ICDM
2003
IEEE
71views Data Mining» more  ICDM 2003»
15 years 7 months ago
Tree-structured Partitioning Based on Splitting Histograms of Distances
We propose a novel clustering algorithm that is similar in spirit to classification trees. The data is recursively split using a criterion that applies a discrete curve evolution...
Longin Jan Latecki, Rajagopal Venugopal, Marc Sobe...
BMCBI
2005
163views more  BMCBI 2005»
15 years 1 months ago
CoaSim: A flexible environment for simulating genetic data under coalescent models
Background: Coalescent simulations are playing a large role in interpreting large scale intraspecific sequence or polymorphism surveys and for planning and evaluating association ...
Thomas Mailund, Mikkel H. Schierup, Christian N. S...
KDD
2001
ACM
216views Data Mining» more  KDD 2001»
16 years 2 months ago
The distributed boosting algorithm
In this paper, we propose a general framework for distributed boosting intended for efficient integrating specialized classifiers learned over very large and distributed homogeneo...
Aleksandar Lazarevic, Zoran Obradovic
IDEAL
2000
Springer
15 years 5 months ago
Applying Independent Component Analysis to Factor Model in Finance
Factor model is a very useful and popular model in finance. In this paper, we show the relation between factor model and blind source separation, and we propose to use Independent ...
Siu-Ming Cha, Lai-Wan Chan
KDD
2002
ACM
166views Data Mining» more  KDD 2002»
16 years 2 months ago
Frequent term-based text clustering
Text clustering methods can be used to structure large sets of text or hypertext documents. The well-known methods of text clustering, however, do not really address the special p...
Florian Beil, Martin Ester, Xiaowei Xu