An Experiment with Distance Measures for Clustering

15 years 8 months ago

Download www.cse.iitb.ac.in

Distance measure plays an important role in clustering data points. Choosing the right distance measure for a given dataset is a non-trivial problem. In this paper, we study various distance measures and their effect on different clustering techniques. In addition to the standard Euclidean distance, we use Bit-Vector based, Comparative Clustering based, Huffman code based and Dominance based distance measures. We cluster both synthetic datasets and one real life dataset using the above distance measures by employing k-means, matrix partitioning and dominance based clustering algorithms. We analyse the results of our study using a real life dataset of cricket and compare the accuracy of various techniques using synthetic datasets.

Ankita Vimal, Satyanarayana R. Valluri, Kamalakar

Real-time Traffic

COMAD 2008 | Distance Measures | Knowledge Management | Real Life Dataset | Right Distance Measure |

claim paper

» A new Mallows distance based metric for comparing clusterings

» Evaluation of geneexpression clustering via mutual information distance measure

» Adaptive Hausdorff distances and dynamic clustering of symbolic interval data

» Phonetic subspace mixture model for speaker diarization

» Clustering NarrowDomain Short Texts by Using the KullbackLeibler Distance

» Distance Based Subspace Clustering with Flexible Dimension Partitioning

» Fuzzy Clustering of Short TimeSeries and Unevenly Distributed Sampling Points

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	COMAD
Authors	Ankita Vimal, Satyanarayana R. Valluri, Kamalakar Karlapalem

Comments (0)

Sciweavers

An Experiment with Distance Measures for Clustering

COMAD 2008 | Distance Measures | Knowledge Management | Real Life Dataset | Right Distance Measure |

Explore & Download

Productivity Tools

Sciweavers