Sciweavers

5880 search results - page 162 / 1176
» Data Clustering: A Review
Sort
View
OSDI
2004
ACM
14 years 10 months ago
MapReduce: Simplified Data Processing on Large Clusters
MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to ge...
Jeffrey Dean, Sanjay Ghemawat
IDEAL
2004
Springer
14 years 3 months ago
Visualisation of Distributions and Clusters Using ViSOMs on Gene Expression Data
Microarray datasets are often too large to visualise due to the high dimensionality. The self-organising map has been found useful to analyse massive complex datasets. It can be us...
Swapna Sarvesvaran, Hujun Yin
KES
2005
Springer
14 years 3 months ago
OntoExtractor: A Fuzzy-Based Approach in Clustering Semi-structured Data Sources and Metadata Generation
This paper describes a theoretical approach on data mining, information classifying and a global overview of our OntoExtractor application, concerning the analysis of incoming data...
Zhan Cui, Ernesto Damiani, Marcello Leida, Marco V...
ICDT
2009
ACM
148views Database» more  ICDT 2009»
14 years 11 months ago
Tight results for clustering and summarizing data streams
In this paper we investigate algorithms and lower bounds for summarization problems over a single pass data stream. In particular we focus on histogram construction and K-center c...
Sudipto Guha
ISVC
2009
Springer
14 years 4 months ago
Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster
In this paper, we propose an inherent parallel scheme for 3D image segmentation of large volume data on a GPU cluster. This method originates from an extended Lattice Boltzmann Mod...
Aaron Hagan, Ye Zhao