Sciweavers

CIBB
2008

Unsupervised Stability-Based Ensembles to Discover Reliable Structures in Complex Bio-molecular Data

14 years 2 months ago
Unsupervised Stability-Based Ensembles to Discover Reliable Structures in Complex Bio-molecular Data
The assessment of the reliability of clusters discovered in bio-molecular data is a central issue in several bioinformatics problems. Several methods based on the concept of stability have been proposed to estimate the reliability of each individual cluster as well as the "optimal" number of clusters. In this conceptual framework a clustering ensemble is obtained through bootstrapping techniques, noise injection into the data or random projections into lower dimensional subspaces. A measure of the reliability of a given clustering is obtained through specific stability/reliability scores based on the similarity of the clusterings composing the ensemble. Classical stability-based methods do not provide an assessment of the statistical significance of the clustering solutions and are not able to directly detect multiple structures (e.g. hierarchical structures) simultaneously present in the data. Statistical approaches based on the chi-square distribution and on the Bernstein i...
Alberto Bertoni, Giorgio Valentini
Added 12 Oct 2010
Updated 12 Oct 2010
Type Conference
Year 2008
Where CIBB
Authors Alberto Bertoni, Giorgio Valentini
Comments (0)