Sciweavers

BIRD
2007
Springer

Efficient and Scalable Indexing Techniques for Biological Sequence Data

14 years 6 months ago
Efficient and Scalable Indexing Techniques for Biological Sequence Data
We investigate indexing techniques for sequence data, crucial in a wide variety of applications, where efficient, scalable, and versatile search algorithms are required. Recent research has focused on suffix trees (ST) and suffix arrays (SA) as desirable index representations. Existing solutions for very long sequences however provide either efficient index construction or efficient search, but not both. We propose a new ST representation, STTD64, which has reasonable construction time and storage requirement, and is efficient in search. We have implemented the construction and search algorithms for the proposed technique and conducted numerous experiments to evaluate its performance on various types of real sequence data. Our results show that while the construction time for STTD64 is comparable with current ST based techniques, it outperforms them in search. Compared to ESA, the best known SA technique, STTD64 exhibits slower construction time, but has similar space requirement and c...
Mihail Halachev, Nematollaah Shiri, Anand Thamildu
Added 07 Jun 2010
Updated 07 Jun 2010
Type Conference
Year 2007
Where BIRD
Authors Mihail Halachev, Nematollaah Shiri, Anand Thamildurai
Comments (0)