Abstract.) Ying Xu, Richard J. MuraF, and Edward C. Uberbacher Computer Science and Mathematics Division and tLife Sciences Division Computational methods for gene identification in genomic sequences typically have two phases: coding region prediction and gene parsing. While there are manyeffective methods for predicting coding regions (exons), parsing the predicted exons into proper gene structures, to a large extent, remains an unsolved problem. This paper presents an algorithm for inferring gone structures from predicted exon candidates, based on Expressed Sequence Tags (ESTs) and biological intuition/rules. Thealgorithm first finds all the related ESTs in the ESTdatabase (dbEST.) for each predicted exon, and infers the boundaries of one or a series of genes based on the available EST information and biological rules. Then it constructs gone models within each pair of genc boundaries, that are most consistent with the ESTinformation. By exploiting ESTinformation and biological rule...
Ying Xu, Richard J. Mural, Edward C. Uberbacher