Highly scalable genome assembly on campus grids

16 years 25 days ago

Download www.cse.nd.edu

Bioinformatics researchers need efﬁcient means to process large collections of sequence data. One application of interest, genome assembly, has great potential for parallelization, however most previous attempts at parallelization require uncommon high-end hardware. This paper introduces a scalable modular genome assembler that can achieve signiﬁcant speedup using large numbers of conventional desktop machines, such as those found in a campus computing grid. The system is based on the Celera open-source assembly toolkit, and replaces two independent sequential modules with scalable replacements: a scalable candidate selector exploits the distributed memory capacity of a campus grid, while the scalable aligner exploits the distributed computing capacity. For large problems, these modules provide robust task and data management while also achieving speedup with high efﬁciency on several scales of resources. We show results for several datasets ranging from 738 thousand to over 121...

Christopher Moretti, Michael Olson, Scott J. Emric

Real-time Traffic

Applied Computing | Campus Grid | SC 2009 | Scalable | Scalable Modular Genome |

claim paper

» Information Services for Dynamically Assembled Semantic Grids

» SeqHive A Reconfigurable Computer Cluster for Genome Resequencing

» Fault tolerant high performance Information Services for dynamic collections of Grid and W...

» ClustalXeed a GUIbased grid computation version for high performance and terabyte size mul...

Post Info
More Details (n/a)

Added	19 May 2010
Updated	19 May 2010
Type	Conference
Year	2009
Where	SC
Authors	Christopher Moretti, Michael Olson, Scott J. Emrich, Douglas Thain

Comments (0)

Sciweavers

Highly scalable genome assembly on campus grids

Applied Computing | Campus Grid | SC 2009 | Scalable | Scalable Modular Genome |

Explore & Download

Productivity Tools

Sciweavers