Measuring quality of similarity functions in approximate data matching

15 years 6 months ago

Download www.inf.ufrgs.br

This paper presents a method for assessing the quality of similarity functions. The scenario taken into account is that of approximate data matching, in which it is necessary to determine whether two data instances represent the same real world object. Our method is based on the semi-automatic estimation of optimal threshold values. We propose two methods for performing such estimation. The ﬁrst method is an algorithm based on a reward function, and the second is a statistical method. Experiments were carried out to validate the techniques proposed. The results show that both methods for threshold estimation produce similar results. The output of such methods was used to design a grading function for similarity functions. This grading function, called discernability, was used to compare a number of similarity functions applied to an experimental data set. © 2006 Elsevier Ltd. All rights reserved.

Roberto da Silva, Raquel Kolitski Stasiu, Viviane

Real-time Traffic

Estimation | JOI 2007 | Optimal Threshold Values | Similarity Functions |

claim paper

» A comparative study of similarity measures for contentbased multimedia retrieval

» Fast approximate hierarchical clustering using similarity heuristics

» Image registration convex weighting functions for histogrambased similarity measures

» Dimensionality Reduction and Similarity Computation by Inner Product Approximations

» Toward an adaptive String Similarity Measure for Matching Product Offers

» Learning Termweighting Functions for Similarity Measures

» Measuring the Quality of Approximated Clusterings

» Multiple Sequence Alignment using Fuzzy Logic

Post Info
More Details (n/a)

Added	15 Dec 2010
Updated	15 Dec 2010
Type	Journal
Year	2007
Where	JOI
Authors	Roberto da Silva, Raquel Kolitski Stasiu, Viviane Moreira Orengo, Carlos A. Heuser

Comments (0)

Sciweavers

Measuring quality of similarity functions in approximate data matching

Estimation | JOI 2007 | Optimal Threshold Values | Similarity Functions |

Explore & Download

Productivity Tools

Sciweavers