The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Pairwise constraints specify whether or not two samples should be in one cluster. Although it has been successful to incorporate them into traditional clustering methods, such as ...
Image auto-annotation is an important open problem in
computer vision. For this task we propose TagProp, a discriminatively
trained nearest neighbor model. Tags of test
images a...
Matthieu Guillaumin, Thomas Mensink, Jakob Verbeek...
Conventional image categorization techniques primarily rely on low-level visual cues. In this paper, we describe a multimodal fusion scheme which improves the image classification...
Abstract. Clustering high dimensional data with sparse features is challenging because pairwise distances between data items are not informative in high dimensional space. To addre...