Sciweavers

136 search results - page 1 / 28
» The ed-tree: An Index for Large DNA Sequence Databases
Sort
View
VLDB
2002
ACM
184views Database» more  VLDB 2002»
14 years 11 months ago
Database indexing for large DNA and protein sequence collections
Our aim is to develop new database technologies for the approximate matching of unstructured string data using indexes. We explore the potential of the suffix tree data structure i...
Ela Hunt, Malcolm P. Atkinson, Robert W. Irving
SSDBM
2003
IEEE
141views Database» more  SSDBM 2003»
14 years 4 months ago
The ed-tree: An Index for Large DNA Sequence Databases
The growing interest in genomic research has caused an explosive growth in the size of DNA databases making it increasely challenging to perform searches on them. In this paper, w...
Zhenqiang Tan, Xia Cao, Beng Chin Ooi, Anthony K. ...
DASFAA
2005
IEEE
136views Database» more  DASFAA 2005»
14 years 4 months ago
Indexing DNA Sequences Using q-Grams
We have observed in recent years a growing interest in similarity search on large collections of biological sequences. Contributing to the interest, this paper presents a method fo...
Xia Cao, Shuai Cheng Li, Anthony K. H. Tung
BMCBI
2006
116views more  BMCBI 2006»
13 years 11 months ago
MICA: desktop software for comprehensive searching of DNA databases
Background: Molecular biologists work with DNA databases that often include entire genomes. A common requirement is to search a DNA database to find exact matches for a nondegener...
William A. Stokes, Benjamin S. Glick
ICDE
2003
IEEE
149views Database» more  ICDE 2003»
15 years 16 days ago
Indexing Weighted-Sequences in Large Databases
We present an index structure for managing weightedsequences in large databases. A weighted-sequence is defined as a two-dimensional structure where each element in the sequence i...
Haixun Wang, Chang-Shing Perng, Wei Fan, Sanghyun ...