In this paper, we propose a novel formulation for distance-based outliers that is based on the distance of a point from its kth nearest neighbor. We rank each point on the basis o...
Covariance and correlation estimates have important applications in data mining. In the presence of outliers, classical estimates of covariance and correlation matrices are not re...
Fatemah A. Alqallaf, Kjell P. Konis, R. Douglas Ma...
This paper describes a study performed in an industrial setting that attempts to build predictive models to identify parts of a Java system with a high probability of fault. The s...
Many problems in computer vision involving recognition and/or classification can be posed in the general framework of supervised learning. There is however one aspect of image dat...
Arunava Banerjee, Santhosh Kodipaka, Baba C. Vemur...
When using a Genetic Algorithm (GA) to optimize the feature space of pattern classification problems, the performance improvement is not only determined by the data set used, but a...
Zhijian Huang, Min Pei, Erik D. Goodman, Yong Huan...