Images are being produced and made available in ever increasing numbers; but how can we find images "like this one" that are of interest to us? Many different systems hav...
Overlapping classes and outliers can significantly decrease a classifier performance. We adress here the problem of giving a classifier the ability to reject some patterns eith...
Protein sequence analysis is an important tool to decode the logic of life. One of the most important similarity measures in this area is the edit distance between amino acids of ...
It is often useful to get high-level views of datasets in order to identify areas of interest worthy of further exploration. In relational databases, the high-level view can be de...
We introduce a cost model for the M-tree access method [Ciaccia et al., 1997] which provides estimates of CPU (distance computations) and I/O costs for the execution of similarity ...