Sciweavers

DASFAA
2006
IEEE

Probabilistic Similarity Join on Uncertain Data

14 years 6 months ago
Probabilistic Similarity Join on Uncertain Data
An important database primitive for commonly used feature databases is the similarity join. It combines two datasets based on some similarity predicate into one set such that the new set contains pairs of objects of the two original sets. In many different application areas, e.g. sensor databases, location based services or face recognition systems, distances between objects have to be computed based on vague and uncertain data. In this paper, we propose to express the similarity between two uncertain objects by probability density functions which assign a probability value to each possible distance value. By integrating these probabilistic distance functions directly into the join algorithms the full information provided by these functions is exploited. The resulting probabilistic similarity join assigns to each object pair a probability value indicating the likelihood that the object pair belongs to the result set. As the computation of these probability values is very expensive, we ...
Hans-Peter Kriegel, Peter Kunath, Martin Pfeifle,
Added 10 Jun 2010
Updated 10 Jun 2010
Type Conference
Year 2006
Where DASFAA
Authors Hans-Peter Kriegel, Peter Kunath, Martin Pfeifle, Matthias Renz
Comments (0)