A framework of irregularity enlightenment for data pre-processing in data mining

15 years 6 months ago

Download www.research.att.com

Abstract Irregularities are widespread in large databases and often lead to erroneous conclusions with respect to data mining and statistical analysis. For example, considerable bias is often resulted from many parameter estimation procedures without properly handling significant irregularities. Most data cleaning tools assume one known type of irregularity. This paper proposes a generic Irregularity Enlightenment (IE) framework for dealing with the situation when multiple irregularities are hidden in large volumes of data in general and cross sectional time series in particular. It develops an automatic data mining platform to capture key irregularities and classify them based on their importance in a database. By decomposing time series data into basic components, we propose to optimize a penalized least square loss function to aid the selection of key irregularities in consecutive steps and cluster time series into different groups until an acceptable level of variation reduction is...

Siu-Tong Au, Rong Duan, Siamak G. Hesar, Wei Jiang

Real-time Traffic

Abstract Irregularities | ANOR 2010 | Key Irregularities | Time Series |

claim paper

» High Performance Subgraph Mining in Molecular Compounds

» Validating and Refining Clusters via Visual Rendering

Post Info
More Details (n/a)

Added	08 Dec 2010
Updated	08 Dec 2010
Type	Journal
Year	2010
Where	ANOR
Authors	Siu-Tong Au, Rong Duan, Siamak G. Hesar, Wei Jiang

Comments (0)

Sciweavers

A framework of irregularity enlightenment for data pre-processing in data mining

Abstract Irregularities | ANOR 2010 | Key Irregularities | Time Series |

Explore & Download

Productivity Tools

Sciweavers