pFANGS: Parallel high speed sequence mapping for Next Generation 454-roche Sequencing reads

14 years 5 months ago

Download www.hicomb.org

Millions of DNA sequences (reads) are generated by Next Generation Sequencing machines everyday. There is a need for high performance algorithms to map these sequences to the reference genome to identify single nucleotide polymorphisms or rare transcripts to fulfill the dream of personalized medicine. In this paper, we present a high-throughput parallel sequence mapping program pFANGS. pFANGS is designed to find all the matches of a query sequence in the reference genome tolerating a large number of mismatches or insertions/deletions. pFANGS partitions the computational workload and data among all the processes and employs loadbalancing mechanisms to ensure better process efficiency. Our experiments show that, with 512 processors, we are able to map approximately 31 million 454/Roche queries of length 500 each to a reference human genome per hour allowing 5 mismatches or insertion/deletions at full sensitivity. We also report and compare the performance results of two alternative paral...

Sanchit Misra, Ramanathan Narayanan, Wei-keng Liao

Real-time Traffic

Alternative Parallel Implementations | Distributed And Parallel Computing | Genome | IPPS 2010 | Reference Human Genome |

claim paper

Post Info
More Details (n/a)

Added	13 Feb 2011
Updated	13 Feb 2011
Type	Journal
Year	2010
Where	IPPS
Authors	Sanchit Misra, Ramanathan Narayanan, Wei-keng Liao, Alok N. Choudhary, Simon Lin

Comments (0)

Sciweavers

pFANGS: Parallel high speed sequence mapping for Next Generation 454-roche Sequencing reads

Alternative Parallel Implementations | Distributed And Parallel Computing | Genome | IPPS 2010 | Reference Human Genome |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers