Cloud technologies for bioinformatics applications

15 years 11 months ago

Download grids.ucs.indiana.edu

Executing large number of independent tasks or tasks that perform minimal inter-task communication in parallel is a common requirement in many domains. In this paper, we present our experience in applying two new Microsoft technologies Dryad and Azure to three bioinformatics applications. We also compare with traditional MPI and Apache Hadoop MapReduce implementation in one example. The applications are an EST (Expressed Sequence Tag) sequence assembly program, PhyloD statistical package to identify HLA-associated viral evolution, and a pairwise Alu gene alignment application. We give detailed performance discussion on a 768 core Windows HPC Server cluster and an Azure cloud. All the applications start with a “doubly data parallel step” involving independent data chosen from two similar (EST, Alu) or two different databases (PhyloD). There are different structures for final stages in each application. Categories and Subject Descriptors

Xiaohong Qiu, Jaliya Ekanayake, Scott Beason, Thil

Real-time Traffic

Expressed Sequence Tag | Microsoft Technologies Dryad | Pairwise Alu Gene | SC 2009 |

claim paper

» Cloud Computing The Next Revolution in Information Technology

» SciCloud Scientific Computing on the Cloud

» Opportunities and Challenges in Running Scientific Workflows on the Cloud

» Providing Scalable Database Services on the Cloud

» PerfCloud GRID Services for PerformanceOriented Development of Cloud Computing Application...

» New Instruments and Technologies for Cultural Heritage Survey Full Integration between Poi...

» Personalization as a service the architecture and a case study

» CloudCmp comparing public cloud providers

Post Info
More Details (n/a)

Added	23 Jul 2010
Updated	23 Jul 2010
Type	Conference
Year	2009
Where	SC
Authors	Xiaohong Qiu, Jaliya Ekanayake, Scott Beason, Thilina Gunarathne, Geoffrey Fox, Roger S. Barga, Dennis Gannon

Comments (0)

Sciweavers

Cloud technologies for bioinformatics applications

Expressed Sequence Tag | Microsoft Technologies Dryad | Pairwise Alu Gene | SC 2009 |

Explore & Download

Productivity Tools

Sciweavers