Sciweavers

582 search results - page 47 / 117
» Distance Measures for Well-Distributed Sets
Sort
View
KDD
2004
ACM
132views Data Mining» more  KDD 2004»
14 years 9 months ago
A probabilistic framework for semi-supervised clustering
Unsupervised clustering can be significantly improved using supervision in the form of pairwise constraints, i.e., pairs of instances labeled as belonging to same or different clu...
Sugato Basu, Mikhail Bilenko, Raymond J. Mooney
EDBT
2004
ACM
192views Database» more  EDBT 2004»
14 years 8 months ago
LIMBO: Scalable Clustering of Categorical Data
Abstract. Clustering is a problem of great practical importance in numerous applications. The problem of clustering becomes more challenging when the data is categorical, that is, ...
Periklis Andritsos, Panayiotis Tsaparas, Ren&eacut...
IJDLS
2010
131views more  IJDLS 2010»
13 years 6 months ago
Annotating Historical Archives of Images
Recent initiatives like the Million Book Project and Google Print Library Project have already archived several million books in digital format, and within a few years a significa...
Xiaoyue Wang, Lexiang Ye, Eamonn J. Keogh, Christi...
CVPR
2001
IEEE
14 years 10 months ago
Learning Models for Object Recognition
We consider learning models for object recognition from examples. Our method is motivated by systems that use the Hausdorff distance as a shape comparison measure. Typically an ob...
Pedro F. Felzenszwalb
ICDE
1999
IEEE
183views Database» more  ICDE 1999»
14 years 10 months ago
ROCK: A Robust Clustering Algorithm for Categorical Attributes
Clustering, in data mining, is useful to discover distribution patterns in the underlying data. Clustering algorithms usually employ a distance metric based (e.g., euclidean) simi...
Sudipto Guha, Rajeev Rastogi, Kyuseok Shim