Assessment of discretization techniques for relevant pattern discovery from gene expression data

16 years 2 months ago

Download www.cs.rpi.edu

In the domain of gene expression data analysis, various researchers have recently emphasized the promising application of pattern discovery techniques like association rule mining or formal concept extraction from boolean matrices that encode gene properties. To take the most from these approaches, a needed step concerns gene property encoding (e.g., over-expression) and its need for the discretization of raw gene expression data. The impact of this preprocessing step on both the quantity and the relevancy of the extracted patterns is crucial. In this paper, we study the impact of discretization parameters by a sound comparison between the dendrograms, i.e., trees that are generated by a hierarchical clustering algorithm, computed from raw expression data and from the various derived boolean matrices. Thanks to a new similarity measure and practical validation over several gene expression data sets, we propose a method that supports the choice of a discretization technique and its par...

Ruggero G. Pensa, Claire Leschi, Jéré

Real-time Traffic

Data Mining | Gene Expression Data | KDD 2004 | Raw Expression Data | Raw Gene Expression |

claim paper

» hProfile plots for the discovery and exploration of patterns in gene expression data with ...

» Using Classification and Visualization on Pattern Databases for Gene Expression Data Analy...

» Integrative Biomarker Discovery for Breast Cancer Metastasis from Gene Expression and Prot...

» Chromosomal patterns of gene expression from microarray data methodology validation and cl...

» A biordering approach to linking gene expression with clinical annotations in gastric canc...

» Using transposition for pattern discovery from microarray data

» Incremental wrapperbased gene selection from microarray data for cancer classification

» Fractal Clustering for Microarray Data Analysis

Post Info
More Details (n/a)

Added	30 Nov 2009
Updated	30 Nov 2009
Type	Conference
Year	2004
Where	KDD
Authors	Ruggero G. Pensa, Claire Leschi, Jérémy Besson, Jean-François Boulicaut

Comments (0)

Sciweavers

Assessment of discretization techniques for relevant pattern discovery from gene expression data

Data Mining | Gene Expression Data | KDD 2004 | Raw Expression Data | Raw Gene Expression |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers