A New Family of String Classifiers Based on Local Relatedness

15 years 10 months ago

Download www.ssp.isee.kyushu-u.ac.jp

This paper introduces a new family of string classifiers based on local relatedness. We use three types of local relatedness measurements, namely, longest common substrings (LCStr's), longest common subsequences (LCSeq's), and window-accumulated longest common subsequences (wLCSeq's). We show that finding the optimal classier for given two sets of strings (the positive set and the negative set), is NP-hard for all of the above measurements. In order to achieve practically efficient algorithms for finding the best classifier, we investigate pruning heuristics and fast string matching techniques based on the properties of the local relatedness measurements.

Yasuto Higa, Shunsuke Inenaga, Hideo Bannai, Masay

Real-time Traffic

DIS 2006 | Local Relatedness | Local Relatedness Measurements | Longest Common Subsequences | Theoretical Computer Science |

claim paper

Post Info
More Details (n/a)

Added	22 Aug 2010
Updated	22 Aug 2010
Type	Conference
Year	2006
Where	DIS
Authors	Yasuto Higa, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda

Comments (0)

Sciweavers

A New Family of String Classifiers Based on Local Relatedness

DIS 2006 | Local Relatedness | Local Relatedness Measurements | Longest Common Subsequences | Theoretical Computer Science |

Explore & Download

Productivity Tools

Sciweavers