Unsupervised Stability-Based Ensembles to Discover Reliable Structures in Complex Bio-molecular Data

15 years 9 months ago

Download eprints.pascal-network.org

The assessment of the reliability of clusters discovered in bio-molecular data is a central issue in several bioinformatics problems. Several methods based on the concept of stability have been proposed to estimate the reliability of each individual cluster as well as the "optimal" number of clusters. In this conceptual framework a clustering ensemble is obtained through bootstrapping techniques, noise injection into the data or random projections into lower dimensional subspaces. A measure of the reliability of a given clustering is obtained through specific stability/reliability scores based on the similarity of the clusterings composing the ensemble. Classical stability-based methods do not provide an assessment of the statistical significance of the clustering solutions and are not able to directly detect multiple structures (e.g. hierarchical structures) simultaneously present in the data. Statistical approaches based on the chi-square distribution and on the Bernstein i...

Alberto Bertoni, Giorgio Valentini

Real-time Traffic

Artificial Intelligence | Bio-molecular Data | CIBB 2008 | Multiple Structures | Stability-based Methods |

claim paper

Added	12 Oct 2010
Updated	12 Oct 2010
Type	Conference
Year	2008
Where	CIBB
Authors	Alberto Bertoni, Giorgio Valentini

Sciweavers

Unsupervised Stability-Based Ensembles to Discover Reliable Structures in Complex Bio-molecular Data

Artificial Intelligence | Bio-molecular Data | CIBB 2008 | Multiple Structures | Stability-based Methods |

Explore & Download

Productivity Tools

Sciweavers