Sciweavers

IDA
2007
Springer
13 years 4 months ago
In search of deterministic methods for initializing K-means and Gaussian mixture clustering
The performance of K-means and Gaussian mixture model (GMM) clustering depends on the initial guess of partitions. Typically, clus∗ corresponding author 1
Ting Su, Jennifer G. Dy
IDA
2007
Springer
13 years 10 months ago
Relational Algebra for Ranked Tables with Similarities: Properties and Implementation
The paper presents new developments in an extension of Codd’s relational model of data. The extension consists in equipping domains of attribute values with a similarity relation...
Radim Belohlávek, Stanislav Opichal, Vil&ea...
IDA
2007
Springer
13 years 10 months ago
Two Bagging Algorithms with Coupled Learners to Encourage Diversity
In this paper, we present two ensemble learning algorithms which make use of boostrapping and out-of-bag estimation in an attempt to inherit the robustness of bagging to overfitti...
Carlos Valle, Ricardo Ñanculef, Héct...
IDA
2007
Springer
13 years 10 months ago
Landscape Multidimensional Scaling
Katharina Tschumitschew, Frank Klawonn, Frank H&ou...
IDA
2007
Springer
13 years 10 months ago
Visualising the Cluster Structure of Data Streams
The increasing availability of streaming data is a consequence of the continuing advancement of data acquisition technology. Such data provides new challenges to the various data a...
Dimitris K. Tasoulis, Gordon J. Ross, Niall M. Ada...
IDA
2007
Springer
13 years 10 months ago
Learning to Align: A Statistical Approach
We present a new machine learning approach to the inverse parametric sequence alignment problem: given as training examples a set of correct pairwise global alignments, find the p...
Elisa Ricci, Tijl De Bie, Nello Cristianini
IDA
2007
Springer
13 years 10 months ago
Combining Bagging and Random Subspaces to Create Better Ensembles
Random forests are one of the best performing methods for constructing ensembles. They derive their strength from two aspects: using random subsamples of the training data (as in b...
Pance Panov, Saso Dzeroski
IDA
2007
Springer
13 years 10 months ago
Compact and Understandable Descriptions of Mixtures of Bernoulli Distributions
Abstract. Finite mixture models can be used in estimating complex, unknown probability distributions and also in clustering data. The parameters of the models form a complex repres...
Jaakko Hollmén, Jarkko Tikka
IDA
2007
Springer
13 years 10 months ago
DENCLUE 2.0: Fast Clustering Based on Kernel Density Estimation
The Denclue algorithm employs a cluster model based on kernel density estimation. A cluster is defined by a local maximum of the estimated density function. Data points are assign...
Alexander Hinneburg, Hans-Henning Gabriel