Sciweavers

17688 search results - page 103 / 3538
» Data Set Balancing
Sort
View
SIGMOD
2000
ACM
173views Database» more  SIGMOD 2000»
15 years 7 months ago
Efficient Algorithms for Mining Outliers from Large Data Sets
In this paper, we propose a novel formulation for distance-based outliers that is based on the distance of a point from its kth nearest neighbor. We rank each point on the basis o...
Sridhar Ramaswamy, Rajeev Rastogi, Kyuseok Shim
LISA
2007
15 years 6 months ago
Policy-Driven Management of Data Sets
Contemporary storage systems separate the management of data from the management of the underlying physical storage media used to store that data. This separation is artificial an...
Jim Holl, Kostadis Roussos, Jim Voll
HIS
2008
15 years 5 months ago
Genetic-Based Synthetic Data Sets for the Analysis of Classifiers Behavior
In this paper, we highlight the use of synthetic data sets to analyze learners behavior under bounded complexity. We propose a method to generate synthetic data sets with a specif...
Núria Macià, Albert Orriols-Puig, Es...
PAMI
2008
162views more  PAMI 2008»
15 years 3 months ago
Dimensionality Reduction of Clustered Data Sets
We present a novel probabilistic latent variable model to perform linear dimensionality reduction on data sets which contain clusters. We prove that the maximum likelihood solution...
Guido Sanguinetti
DAGM
2006
Springer
15 years 8 months ago
A Modification of the Level Set Speed Function to Bridge Gaps in Data
Abstract. Level set methods have become very popular means for image segmentation in recent years. But due to the data-driven nature of this methods it is difficult to segment obje...
Karsten Rink, Klaus D. Tönnies