Sciweavers

446 search results - page 66 / 90
» Randomization in Privacy-Preserving Data Mining
Sort
View
KDD
2009
ACM
227views Data Mining» more  KDD 2009»
14 years 10 months ago
Efficiently learning the accuracy of labeling sources for selective sampling
Many scalable data mining tasks rely on active learning to provide the most useful accurately labeled instances. However, what if there are multiple labeling sources (`oracles...
Pinar Donmez, Jaime G. Carbonell, Jeff Schneider
DKE
2008
109views more  DKE 2008»
13 years 10 months ago
Deterministic algorithms for sampling count data
Processing and extracting meaningful knowledge from count data is an important problem in data mining. The volume of data is increasing dramatically as the data is generated by da...
Hüseyin Akcan, Alex Astashyn, Hervé Br...
ICDM
2009
IEEE
205views Data Mining» more  ICDM 2009»
14 years 4 months ago
Active Selection of Sensor Sites in Remote Sensing Applications
— In a data-mining approach, a model for estimation of Aerosol Optical Depth (AOD) from satellite observations is learned using collocated satellite and groundbased observations....
Debasish Das, Zoran Obradovic, Slobodan Vucetic
ICDM
2003
IEEE
158views Data Mining» more  ICDM 2003»
14 years 3 months ago
Combining Multiple Weak Clusterings
A data set can be clustered in many ways depending on the clustering algorithm employed, parameter settings used and other factors. Can multiple clusterings be combined so that th...
Alexander P. Topchy, Anil K. Jain, William F. Punc...
KDD
1998
ACM
120views Data Mining» more  KDD 1998»
14 years 2 months ago
Large Datasets Lead to Overly Complex Models: An Explanation and a Solution
This paper explores unexpected results that lie at the intersection of two common themes in the KDD community: large datasets and the goal of building compact models. Experiments ...
Tim Oates, David Jensen