Optimal Spaced Seeds for Faster Approximate String Matching

16 years 2 days ago

Download www.cs.bgu.ac.il

Filtering is a standard technique for fast approximate string matching in practice. In ﬁltering, a quick ﬁrst step is used to rule out almost all positions of a text as possible starting positions for a pattern. Typically this step consists of ﬁnding the exact matches of small parts of the pattern. In the followup step, a slow method is used to verify or eliminate each remaining position. The running time of such a method depends largely on the quality of the ﬁltering step, as measured by its false positives rate. The quality of such a method depends on the number of true matches that it misses, that is, on its false negative rate. A spaced seed is a recently introduced type of ﬁlter pattern that allows gaps (i.e. don’t cares) in the small sub-pattern to be searched for. Spaced seeds promise to yield a much lower false positives rate, and thus have been extensively studied, though heretofore only heuristically or statistically. In this paper, we show how to design almost o...

Martin Farach-Colton, Gad M. Landau, Süleyman

Real-time Traffic

False Negatives | False Positives Rate | ICALP 2005 | Spaced Seeds | Theoretical Computer Science |

claim paper

» Efficient Sampling of Disparity Space for Fast And Accurate Matching

» Accelerating String Set Matching in FPGA Hardware for Bioinformatics Research

» Statistical Shape Models Using ElasticString Representations

» Using convex hulls to represent classifier conditions

» Selectivity Estimation for Boolean Queries

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ICALP
Authors	Martin Farach-Colton, Gad M. Landau, Süleyman Cenk Sahinalp, Dekel Tsur

Comments (0)

Sciweavers

Optimal Spaced Seeds for Faster Approximate String Matching

False Negatives | False Positives Rate | ICALP 2005 | Spaced Seeds | Theoretical Computer Science |

Explore & Download

Productivity Tools

Sciweavers