Cost-Aware Processing of Similarity Queries in Structured Overlays

16 years 22 days ago

Download www.manfredhauswirth.org

Large-scale distributed data management with P2P systems requires the existence of similarity operators for queries as we cannot assume that all users will agree on exactly the same schema and value representations and data quality problems due to spelling errors and typos. In this paper, we present an approach for efﬁcient processing of similarity selections and joins in a structured overlay. We show that there are several possible strategies exploiting DHT features to a different extent (i.e., key organization, routing, multicasting) and thus the choice of the best operator implementation in a given situation (selectivity, data distribution, load) should be based on cost information allowing the system to estimate the computation and communication costs of query execution plans. Hence, we present a cost model for similarity operations on structured data in a DHT and demonstrate the efﬁciency of our proposal by experimental results from a large-scale PlanetLab deployment. 1 Motiv...

Marcel Karnstedt, Kai-Uwe Sattler, Manfred Hauswir

Real-time Traffic

Data Management | Large-scale Distributed Data | P2P 2006 | Peer-to-Peer Computing | Public Data Management |

claim paper

» The Challenges of Merging Two Similar Structured Overlays A Tale of Two Networks

» PublishSubscribe with RDF Data over Large Structured Overlay Networks

» Query workloadaware overlay construction using histograms

» Distributed Evaluation of Continuous Equijoin Queries over Large Structured Overlay Networ...

» Range Queries in TrieStructured Overlays

» SCAN a smallworld structured p2p overlay for multidimensional queries

» Completeness Estimation of Range Queries in Structured Overlays

Post Info
More Details (n/a)

Added	12 Jun 2010
Updated	12 Jun 2010
Type	Conference
Year	2006
Where	P2P
Authors	Marcel Karnstedt, Kai-Uwe Sattler, Manfred Hauswirth, Roman Schmidt

Comments (0)

Sciweavers

Cost-Aware Processing of Similarity Queries in Structured Overlays

Data Management | Large-scale Distributed Data | P2P 2006 | Peer-to-Peer Computing | Public Data Management |

Explore & Download

Productivity Tools

Sciweavers