Nearest neighbours in least-squares data imputation algorithms with different missing patterns

15 years 5 months ago

Download www.dcs.bbk.ac.uk

Methods for imputation of missing data in the so-called least-squares approximation approach, a non-parametric computationally efficient multidimensional technique, are experimentally compared. Contributions are made to each of the three components of the experiment setting: (a) algorithms to be compared, (b) data generation, and (c) patterns of missing data. Specifically, "global" methods for least-squares data imputation are reviewed and extensions to them are proposed based on the nearest neighbours (NN) approach. A conventional generator of mixtures of Gaussian distributions is theoretically analysed and, then, modified to scale clusters differently. Patterns of missing data are defined in terms of rows and columns according to three different mechanisms that are referred to as Random missings, Restricted random missings, and Merged database. It appears that NN-based versions almost always outperform their global counterparts. With the Random missings pattern, the winner...

Ito Wasito, Boris Mirkin

Real-time Traffic

CSDA 2006 | Efficient Multidimensional Technique | Random Missings | So-called Least-squares Approximation |

claim paper

» Evaluating music sequence models through missing data

» CFGeNe Fuzzy Framework for Robust Gene Regulatory Network Inference

» Filling in the Blanks Krimp Minimisation for Missing Data

» ARTMAPIC and medical diagnosis Instance counting and inconsistent cases

Post Info
More Details (n/a)

Added	11 Dec 2010
Updated	11 Dec 2010
Type	Journal
Year	2006
Where	CSDA
Authors	Ito Wasito, Boris Mirkin

Comments (0)

Sciweavers

Nearest neighbours in least-squares data imputation algorithms with different missing patterns

CSDA 2006 | Efficient Multidimensional Technique | Random Missings | So-called Least-squares Approximation |

Explore & Download

Productivity Tools

Sciweavers