Variable-length Intervals in Homology Search

15 years 8 months ago

Download crpit.com

Fast, accurate, and scalable search techniques for homology searching of large genomic collections are becoming an increasingly important requirement as genomic sequence collections continue to double in size almost yearly. Almost all homology search techniques rely on extracting fixed-length overlapping sequences from queries and database sequences, and comparing these as the first step in query evaluation; this is a feature of well-known tools such as fasta, blast, and our own cafe technique. In this paper we discuss a novel, variable-length approach to extracting subsequences that is based on homology scoring matrices. Our motivation is to achieve a balance between the speed and accuracy of fixed-length choices, that is, to encapsulate the speed of longer subsequence lengths and the accuracy of shorter ones. We show that incorporating this approach into our cafe technique leads to a good compromise between accuracy and retrieval efficiency when searching with blosum matrices sensit...

Abhijit Chattaraj, Hugh E. Williams

Real-time Traffic

APBC 2004 | APBC 2007 | Homology Scoring Matrices | Homology Search Techniques | Homology Searching |

claim paper

» Homology Search with Fragmented Nucleic Acid Sequence Patterns

» Regular Expression Matching with MultiStrings

» An Extension of Geiringers Theorem for a Wide Class of Evolutionary Search Algorithms

» Fuzzy Clustering for Categorical Spaces

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2004
Where	APBC
Authors	Abhijit Chattaraj, Hugh E. Williams

Comments (0)

Sciweavers

Variable-length Intervals in Homology Search

APBC 2004 | APBC 2007 | Homology Scoring Matrices | Homology Search Techniques | Homology Searching |

Explore & Download

Productivity Tools

Sciweavers