Variable Selection in Model-Based Clustering: To Do or To Facilitate

15 years 7 months ago

Download www.cse.ust.hk

Variable selection for cluster analysis is a difficult problem. The difficulty originates not only from the lack of class information but also the fact that high-dimensional data are often multifaceted and can be meaningfully clustered in multiple ways. In such a case the effort to find one subset of attributes that presumably gives the "best" clustering may be misguided. It makes more sense to facilitate variable selection by domain experts, that is, to systematically identify various facets of a data set (each being based on a subset of attributes), cluster the data along each one, and present the results to the domain experts for appraisal and selection. In this paper, we propose a generalization of the Gaussian mixture model, show its ability to cluster data along multiple facets, and demonstrate it is often more reasonable to facilitate variable selection than to perform it.

Leonard K. M. Poon, Nevin Lianwen Zhang, Tao Chen,

Real-time Traffic

Cluster Analysis | Domain Experts | ICML 2010 | Machine Learning | Variable Selection |

claim paper

» The Use of Various Data Mining and Feature Selection Methods in the Analysis of a Populati...

» Contextdependent clustering for dynamic cellular state modeling of microarray gene express...

» Robust Calibration for Localization in Clustered Wireless Sensor Networks

» GOurmet A tool for quantitative comparison and visualization of gene expression profiles b...

» Interactive exploration of very large relational datasets through 3D dynamic projections

» Achieving Sharp Deliveries in Supply Chains through Variance Pool Allocation

» Can molecular dynamics simulations help in discriminating correct from erroneous protein 3...

» A novel series of compositionally biased substitution matrices for comparing Plasmodium pr...

Post Info
More Details (n/a)

Added	09 Nov 2010
Updated	09 Nov 2010
Type	Conference
Year	2010
Where	ICML
Authors	Leonard K. M. Poon, Nevin Lianwen Zhang, Tao Chen, Yi Wang

Comments (0)

Sciweavers

Variable Selection in Model-Based Clustering: To Do or To Facilitate

Cluster Analysis | Domain Experts | ICML 2010 | Machine Learning | Variable Selection |

Explore & Download

Productivity Tools

Sciweavers