Sciweavers

883 search results - page 89 / 177
» Applying Grid Technologies to Distributed Data Mining
Sort
View
PODS
2009
ACM
134views Database» more  PODS 2009»
14 years 8 months ago
An efficient rigorous approach for identifying statistically significant frequent itemsets
As advances in technology allow for the collection, storage, and analysis of vast amounts of data, the task of screening and assessing the significance of discovered patterns is b...
Adam Kirsch, Michael Mitzenmacher, Andrea Pietraca...
SC
2009
ACM
14 years 2 months ago
Lessons learned from a year's worth of benchmarks of large data clouds
In this paper, we discuss some of the lessons that we have learned working with the Hadoop and Sector/Sphere systems. Both of these systems are cloud-based systems designed to sup...
Yunhong Gu, Robert L. Grossman
ICDM
2006
IEEE
131views Data Mining» more  ICDM 2006»
14 years 2 months ago
A Maximum Likelihood Approach to Noise Estimation for Intensity Measurements in Biology
Often, measurement of biological components generates results, that are corrupted by noise. Noise can be caused by various factors like the detectors themselves, sample properties...
Frank Klawonn, Claudia Hundertmark, Lothar Jä...
BMCBI
2006
126views more  BMCBI 2006»
13 years 8 months ago
Effect of data normalization on fuzzy clustering of DNA microarray data
Background: Microarray technology has made it possible to simultaneously measure the expression levels of large numbers of genes in a short time. Gene expression data is informati...
Seo Young Kim, Jae Won Lee, Jong Sung Bae
TDP
2010
166views more  TDP 2010»
13 years 2 months ago
Communication-Efficient Privacy-Preserving Clustering
The ability to store vast quantities of data and the emergence of high speed networking have led to intense interest in distributed data mining. However, privacy concerns, as well ...
Geetha Jagannathan, Krishnan Pillaipakkamnatt, Reb...