Anchoring millions of distinct reads on the human genome within seconds

14 years 6 months ago

Download www.cs.ucr.edu

With the advent of next-generation DNA sequencing machines, there is an increasing need for the development of computational tools that can anchor accurately and expediently the millions of generated short DNA sequences (or reads) onto the genomes of target organisms. In this work, we describe ‘Q-Pick’, a new and efﬁcient method for solving this problem. Q-Pick allows the rapid identiﬁcation and anchoring of such reads with possible wildcards in large genomic databases, while guaranteeing completeness of results and efﬁciency of operation. Q-Pick requires very spartan memory and computational resources, and is trivially amenable to SIMD implementation; it can also be easily extended to handle longer reads, e.g. 75-mers or longer. Our experiments indicate that Q-Pick can anchor millions of distinct short reads against both strands of a mammalian genome in seconds, using a single-core computer processor.

Tien Huynh, Michail Vlachos, Isidore Rigoutsos

Real-time Traffic

Database | Distinct Short Reads | DNA Sequencing Machines | EDBT 2010 | Generated Short Dna |

claim paper

Post Info
More Details (n/a)

Added	18 May 2010
Updated	18 May 2010
Type	Conference
Year	2010
Where	EDBT
Authors	Tien Huynh, Michail Vlachos, Isidore Rigoutsos

Comments (0)

Sciweavers

Anchoring millions of distinct reads on the human genome within seconds

Database | Distinct Short Reads | DNA Sequencing Machines | EDBT 2010 | Generated Short Dna |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers