Sciweavers

220 search results - page 12 / 44
» Peer-to-Peer Usage Analysis: a Distributed Mining Approach
Sort
View
SDM
2012
SIAM
245views Data Mining» more  SDM 2012»
11 years 9 months ago
Deterministic CUR for Improved Large-Scale Data Analysis: An Empirical Study
Low-rank approximations which are computed from selected rows and columns of a given data matrix have attracted considerable attention lately. They have been proposed as an altern...
Christian Thurau, Kristian Kersting, Christian Bau...
KDD
2004
ACM
158views Data Mining» more  KDD 2004»
14 years 7 months ago
A generalized maximum entropy approach to bregman co-clustering and matrix approximation
Co-clustering is a powerful data mining technique with varied applications such as text clustering, microarray analysis and recommender systems. Recently, an informationtheoretic ...
Arindam Banerjee, Inderjit S. Dhillon, Joydeep Gho...
KDD
2006
ACM
150views Data Mining» more  KDD 2006»
14 years 7 months ago
Maximally informative k-itemsets and their efficient discovery
In this paper we present a new approach to mining binary data. We treat each binary feature (item) as a means of distinguishing two sets of examples. Our interest is in selecting ...
Arno J. Knobbe, Eric K. Y. Ho
ICDM
2009
IEEE
139views Data Mining» more  ICDM 2009»
14 years 2 months ago
A Bootstrap Approach to Eigenvalue Correction
—Eigenvalue analysis is an important aspect in many data modeling methods. Unfortunately, the eigenvalues of the sample covariance matrix (sample eigenvalues) are biased estimate...
Anne Hendrikse, Luuk J. Spreeuwers, Raymond N. J. ...
GFKL
2007
Springer
139views Data Mining» more  GFKL 2007»
14 years 1 months ago
The Noise Component in Model-based Cluster Analysis
The so-called noise-component has been introduced by Banfield and Raftery (1993) to improve the robustness of cluster analysis based on the normal mixture model. The idea is to ad...
Christian Hennig, Pietro Coretto