Surrogate Ranking for Very Expensive Similarity Queries

16 years 6 months ago

Download www.cise.ufl.edu

1 We consider the problem of similarity search in applications where the cost of computing the similarity between two records is very expensive, and the similarity measure is not a metric. In such applications, comparing even a tiny fraction of the database records to a single query record can be orders of magnitude slower than reading the entire database from disk, and indexing is often not possible. We develop a general-purpose, statistical framework for answering top-k queries in such databases, when the database administrator is able to supply an inexpensive surrogate ranking function that substitutes for the actual similarity measure. We develop a robust method that learns the relationship between the surrogate function and the similarity measure. Given a query, we use Bayesian statistics to update the model by taking into account the observed partial results. Using the updated model, we construct bounds on the accuracy of the result set obtained via the surrogate ranking. Our exp...

Chris Jermaine, Fei Xu, Mingxi Wu, Ravi Jampani, T

Real-time Traffic

Actual Similarity Measure | Database | ICDE 2010 | Similarity Measure | Surrogate Ranking Function |

claim paper

» BoostMap A Method for Efficient Approximate Similarity Rankings

» Ranked Queries over Sources with Boolean Query Interfaces without Ranking Support

» Hashed samples selectivity estimators for set similarity selection queries

» Performance of query processing implementations in rankingbased text retrieval systems usi...

» An Ontology for Domainoriented Semantic Similarity Search on XML Data

» A Novel Method for Detecting Similar Documents

» Learning ContextSensitive Shape Similarity by Graph Transduction

» A multistep strategy for approximate similarity search in image databases

Post Info
More Details (n/a)

Added	20 Dec 2009
Updated	03 Jan 2010
Type	Conference
Year	2010
Where	ICDE
Authors	Chris Jermaine, Fei Xu, Mingxi Wu, Ravi Jampani, Tamer Kahveci

Comments (0)

Sciweavers

Surrogate Ranking for Very Expensive Similarity Queries

Actual Similarity Measure | Database | ICDE 2010 | Similarity Measure | Surrogate Ranking Function |

Explore & Download

Productivity Tools

Sciweavers