Sciweavers

BMCBI
2007

How to decide which are the most pertinent overly-represented features during gene set enrichment analysis

13 years 11 months ago
How to decide which are the most pertinent overly-represented features during gene set enrichment analysis
Background: The search for enriched features has become widely used to characterize a set of genes or proteins. A key aspect of this technique is its ability to identify correlations amongst heterogeneous data such as Gene Ontology annotations, gene expression data and genome location of genes. Despite the rapid growth of available data, very little has been proposed in terms of formalization and optimization. Additionally, current methods mainly ignore the structure of the data which causes results redundancy. For example, when searching for enrichment in GO terms, genes can be annotated with multiple GO terms and should be propagated to the more general terms in the Gene Ontology. Consequently, the gene sets often overlap partially or totally, and this causes the reported enriched GO terms to be both numerous and redundant, hence, overwhelming the researcher with non-pertinent information. This situation is not unique, it arises whenever some hierarchical clustering is performed (e....
Roland Barriot, David J. Sherman, Isabelle Dutour
Added 12 Dec 2010
Updated 12 Dec 2010
Type Journal
Year 2007
Where BMCBI
Authors Roland Barriot, David J. Sherman, Isabelle Dutour
Comments (0)