Sciweavers

ADC
2006
Springer

Approximate data mining in very large relational data

14 years 6 months ago
Approximate data mining in very large relational data
In this paper we discuss eNERF, an extended version of non-Euclidean relational fuzzy c-means (NERFCM) for approximate clustering in very large (unloadable) relational data. The eNERF procedure consists of four parts: (i) selection of distinguished features by algorithm DF to be monitored during progressive sampling; (ii) progressively sampling a square N × N relation matrix RN by algorithm PS until an n × n sample relation Rn passes a goodness of fit test; (iii) Clustering Rn using algorithm LNERF; and (iv), extension of the LNERF results to RN-Rn by algorithm xNERF, which uses an iterative procedure based on LNERF to compute fuzzy membership values for all of the objects remaining after LNERF clustering of the accepted sample. Three of the four algorithms are new - only LNERF (called NERFCM in the original literature) precedes this article.
James C. Bezdek, Richard J. Hathaway, Christopher
Added 13 Jun 2010
Updated 13 Jun 2010
Type Conference
Year 2006
Where ADC
Authors James C. Bezdek, Richard J. Hathaway, Christopher Leckie, Kotagiri Ramamohanarao
Comments (0)