Parallel EST Clustering

15 years 7 months ago

Download www.hicomb.org

Expressed sequence tags, abbreviated ESTs, are DNA fragments experimentally derived from expressed portions of genes. Clustering of ESTs is essential for gene recognition and understanding important genetic variations such as those resulting in diseases. In this paper, we present the design and development of a parallel software system for EST clustering. The novel features of our approach include 1) space efﬁcient algorithms to keep the space requirement linear in the size of the input data set, 2) a combination of algorithmic techniques to reduce the total work without sacriﬁcing the quality of EST clustering, and 3) use of parallel processing to reduce the run-time and facilitate the clustering of large data sets. Using a combination of these techniques, we report the clustering of 50,000 maize ESTs in 16 minutes on a 32-processor IBM SP. To our knowledge, this is the ﬁrst effort in building a parallel software system for EST clustering.

Anantharaman Kalyanaraman, Srinivas Aluru, Suresh

Real-time Traffic

Distributed And Parallel Computing | EST Clustering | Expressed Sequence Tags | IPPS 2002 | Parallel Software |

claim paper

» Making sense of EST sequences by CLOBBing them

» A Hybrid Recursive Algorithm for Clustering Expressed Sequence Tags in Chlamydomonas reinh...

» EST clustering error evaluation and correction

» CLU A new algorithm for EST clustering

» Gene capture prediction and overlap estimation in EST sequencing from one or multiple libr...

» Cloud technologies for bioinformatics applications

» ESTIMA a tool for EST management in a multiproject environment

» In silico prediction of UTR repeats using clustered EST data

Post Info
More Details (n/a)

Added	15 Jul 2010
Updated	15 Jul 2010
Type	Conference
Year	2002
Where	IPPS
Authors	Anantharaman Kalyanaraman, Srinivas Aluru, Suresh C. Kothari

Comments (0)

Sciweavers

Parallel EST Clustering

Distributed And Parallel Computing | EST Clustering | Expressed Sequence Tags | IPPS 2002 | Parallel Software |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers