Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

137

Voted

ICONIP
2007

139views Information Technology» more ICONIP 2007»

Principal Component Analysis for Sparse High-Dimensional Data

15 years 5 months ago

Principal Component Analysis for Sparse High-Dimensional Data

Download www.cis.hut.fi

Abstract. Principal component analysis (PCA) is a widely used technique for data analysis and dimensionality reduction. Eigenvalue decomposition is the standard algorithm for solving PCA, but a number of other algorithms have been proposed. For instance, the EM algorithm is much more eﬃcient in case of high dimensionality and a small number of principal components. We study a case where the data are high-dimensional and a majority of the values are missing. In this case, both of these algorithms prove inadequate. We propose using a gradient descent algorithm inspired by Oja’s rule, and speeding it up by an approximate Newton’s method. The computational complexity of the proposed method is linear to the number of observed values in the data and to the number of principal components. The experiments with Netﬂix data conﬁrm that the proposed algorithm is about ten times faster than any of the four comparison methods.

Tapani Raiko, Alexander Ilin, Juha Karhunen

Real-time Traffic

Algorithm | ICONIP 2007 | Information Technology | Principal Component Analysis | Principal Components |

claim paper

Related Content

» The Effect of Principal Component Analysis on Machine Learning Accuracy with High Dimensio...

» Principal Component Analysis with Contaminated Data The High Dimensional Case

» Practical Approaches to Principal Component Analysis in the Presence of Missing Values

» Principal Component Analysis for Large Scale Problems with Lots of Missing Values

» Cluster Analysis of HighDimensional Data A Case Study

» Gaussian Process Latent Variable Models for Visualisation of High Dimensional Data

» Generalized Projected Clustering in HighDimensional Data Streams

» Algorithms for BoundedError Correlation of High Dimensional Data in Microarray Experiments

» Clustering and Feature Selection using Sparse Principal Component Analysis

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	ICONIP
Authors	Tapani Raiko, Alexander Ilin, Juha Karhunen

Comments (0)