Sciweavers

ANOR
2010

A framework of irregularity enlightenment for data pre-processing in data mining

13 years 4 months ago
A framework of irregularity enlightenment for data pre-processing in data mining
Abstract Irregularities are widespread in large databases and often lead to erroneous conclusions with respect to data mining and statistical analysis. For example, considerable bias is often resulted from many parameter estimation procedures without properly handling significant irregularities. Most data cleaning tools assume one known type of irregularity. This paper proposes a generic Irregularity Enlightenment (IE) framework for dealing with the situation when multiple irregularities are hidden in large volumes of data in general and cross sectional time series in particular. It develops an automatic data mining platform to capture key irregularities and classify them based on their importance in a database. By decomposing time series data into basic components, we propose to optimize a penalized least square loss function to aid the selection of key irregularities in consecutive steps and cluster time series into different groups until an acceptable level of variation reduction is...
Siu-Tong Au, Rong Duan, Siamak G. Hesar, Wei Jiang
Added 08 Dec 2010
Updated 08 Dec 2010
Type Journal
Year 2010
Where ANOR
Authors Siu-Tong Au, Rong Duan, Siamak G. Hesar, Wei Jiang
Comments (0)