Sciweavers

274 search results - page 4 / 55
» Approximating Edit Distance Efficiently
Sort
View
PVLDB
2010
195views more  PVLDB 2010»
13 years 4 months ago
Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints
A string similarity join finds similar pairs between two collections of strings. It is an essential operation in many applications, such as data integration and cleaning, and has ...
Jiannan Wang, Guoliang Li, Jianhua Feng
BTW
2005
Springer
80views Database» more  BTW 2005»
14 years 3 months ago
Measuring the Quality of Approximated Clusterings
Abstract. Clustering has become an increasingly important task in modern application domains. In many areas, e.g. when clustering complex objects, in distributed clustering, or whe...
Hans-Peter Kriegel, Martin Pfeifle
ICDE
2008
IEEE
163views Database» more  ICDE 2008»
14 years 11 months ago
Approximate Joins for Data-Centric XML
In data integration applications, a join matches elements that are common to two data sources. Often, however, elements are represented slightly different in each source, so an app...
Nikolaus Augsten, Michael H. Böhlen, Curtis E...
JCB
2002
111views more  JCB 2002»
13 years 9 months ago
A General Edit Distance between RNA Structures
Arc-annotated sequences are useful in representing the structural information of RNA sequences. In general, RNA secondary and tertiary structures can be represented as a set of ne...
Tao Jiang, Guohui Lin, Bin Ma, Kaizhong Zhang
SODA
2010
ACM
181views Algorithms» more  SODA 2010»
13 years 8 months ago
Near-Optimal Sublinear Time Algorithms for Ulam Distance
We give near-tight bounds for estimating the edit distance between two non-repetitive strings (Ulam distance) with constant approximation, in sub-linear time. For two strings of l...
Alexandr Andoni, Huy L. Nguyen