Sciweavers

953 search results - page 85 / 191
» Mining functional dependencies from data
Sort
View
SDM
2004
SIAM
162views Data Mining» more  SDM 2004»
13 years 10 months ago
Subspace Clustering of High Dimensional Data
Clustering suffers from the curse of dimensionality, and similarity functions that use all input features with equal relevance may not be effective. We introduce an algorithm that...
Carlotta Domeniconi, Dimitris Papadopoulos, Dimitr...
ICDM
2008
IEEE
143views Data Mining» more  ICDM 2008»
14 years 3 months ago
Exploiting Data Semantics to Discover, Extract, and Model Web Sources
We describe DEIMOS, a system that automatically discovers and models new sources of information. The system exploits four core technologies developed by our group that makes an en...
José Luis Ambite, Craig A. Knoblock, Kristi...
PAKDD
2005
ACM
112views Data Mining» more  PAKDD 2005»
14 years 2 months ago
Approximated Clustering of Distributed High-Dimensional Data
In many modern application ranges high-dimensional feature vectors are used to model complex real-world objects. Often these objects reside on different local sites. In this paper,...
Hans-Peter Kriegel, Peter Kunath, Martin Pfeifle, ...
ICDM
2002
IEEE
191views Data Mining» more  ICDM 2002»
14 years 1 months ago
Iterative Clustering of High Dimensional Text Data Augmented by Local Search
The k-means algorithm with cosine similarity, also known as the spherical k-means algorithm, is a popular method for clustering document collections. However, spherical k-means ca...
Inderjit S. Dhillon, Yuqiang Guan, J. Kogan
GFKL
2007
Springer
148views Data Mining» more  GFKL 2007»
14 years 3 months ago
Mixture Model Based Group Inference in Fused Genotype and Phenotype Data
The analysis of genetic diseases has classically been directed towards establishing direct links between cause, a genetic variation, and effect, the observable deviation of phenot...
Benjamin Georgi, M. Anne Spence, Pamela Flodman, A...