Optimal Grid-Clustering: Towards Breaking the Curse of Dimensionality in High-Dimensional Clustering

15 years 11 months ago

Download fusion.cs.uni-magdeburg.de

Many applications require the clustering of large amounts of high-dimensional data. Most clustering algorithms, however, do not work e ectively and e ciently in highdimensional space, which is due to the so-called "curse of dimensionality". In addition, the high-dimensional data often contains a signi cant amount of noise which causes additional e ectiveness problems. In this paper, we review and compare the existing algorithms for clustering highdimensional data and show the impact of the curse of dimensionality on their e ectiveness and e ciency. The comparison reveals that condensation-based approaches such as BIRCH or STING are the most promising candidates for achieving the necessary e ciency, but it also shows that basically all condensation-based approaches have severe weaknesses with respect to their e ectiveness in highdimensional space. To overcome these problems, we develop a new clustering technique called OptiGrid which is based on constructing an optimal grid...

Alexander Hinneburg, Daniel A. Keim

Real-time Traffic

Data Sets | Database | Most Clustering Algorithms | Optimal Grid-partitioning | VLDB 1999 |

claim paper

Post Info
More Details (n/a)

Added	05 Aug 2010
Updated	05 Aug 2010
Type	Conference
Year	1999
Where	VLDB
Authors	Alexander Hinneburg, Daniel A. Keim

Comments (0)

Sciweavers

Optimal Grid-Clustering: Towards Breaking the Curse of Dimensionality in High-Dimensional Clustering

Data Sets | Database | Most Clustering Algorithms | Optimal Grid-partitioning | VLDB 1999 |

Explore & Download

Productivity Tools

Sciweavers