Inlier-Based Outlier Detection via Direct Density Ratio Estimation

16 years 1 months ago

Download sugiyama-www.cs.titech.ac.jp

We propose a new statistical approach to the problem of inlier-based outlier detection, i.e., ﬁnding outliers in the test set based on the training set consisting only of inliers. Our key idea is to use the ratio of training and test data densities as an outlier score; we estimate the ratio directly in a semi-parametric fashion without going through density estimation. Thus our approach is expected to have better performance in high-dimensional problems. Furthermore, the applied algorithm for density ratio estimation is equipped with a natural cross-validation procedure, allowing us to objectively optimize the value of tuning parameters such as the regularization parameter and the kernel width. The algorithm offers a closed-form solution as well as a closedform formula for the leave-one-out error. Thanks to this, the proposed outlier detection method is computationally very efﬁcient and is scalable to massive datasets. Simulations with benchmark and real-world datasets illustrate ...

Shohei Hido, Yuta Tsuboi, Hisashi Kashima, Masashi

Real-time Traffic

Data Mining | ICDM 2008 | Inlier-based Outlier Detection | Outlier | Outlier Detection |

claim paper

» Direct Density Ratio Estimation with Dimensionality Reduction

» Dimensionality reduction for density ratio estimation in highdimensional spaces

» Direct importance estimation with probabilistic principal component analyzers

» Statistical analysis of kernelbased leastsquares densityratio estimation

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICDM
Authors	Shohei Hido, Yuta Tsuboi, Hisashi Kashima, Masashi Sugiyama, Takafumi Kanamori

Comments (0)

Sciweavers

Inlier-Based Outlier Detection via Direct Density Ratio Estimation

Data Mining | ICDM 2008 | Inlier-based Outlier Detection | Outlier | Outlier Detection |

Explore & Download

Productivity Tools

Sciweavers