Sciweavers

385 search results - page 4 / 77
» Improving data mining utility with projective sampling
Sort
View
PAKDD
2009
ACM
87views Data Mining» more  PAKDD 2009»
14 years 3 months ago
Application-Independent Feature Construction from Noisy Samples
When training classifiers, presence of noise can severely harm their performance. In this paper, we focus on “non-class” attribute noise and we consider how a frequent fault-t...
Dominique Gay, Nazha Selmaoui, Jean-Françoi...
PAKDD
2005
ACM
94views Data Mining» more  PAKDD 2005»
14 years 2 months ago
Progressive Sampling for Association Rules Based on Sampling Error Estimation
We explore in this paper a progressive sampling algorithm, called Sampling Error Estimation (SEE), which aims to identify an appropriate sample size for mining association rules. S...
Kun-Ta Chuang, Ming-Syan Chen, Wen-Chieh Yang
ACMDIS
2010
ACM
13 years 8 months ago
The CLOTHO project: predicting application utility
When using the computer, each user has some notion that "these applications are important" at a given point in time. We term this subset of applications that the user va...
Joshua M. Hailpern, Nicholas Jitkoff, Joseph Subid...
ICDM
2003
IEEE
220views Data Mining» more  ICDM 2003»
14 years 1 months ago
Exploiting Unlabeled Data for Improving Accuracy of Predictive Data Mining
Predictive data mining typically relies on labeled data without exploiting a much larger amount of available unlabeled data. The goal of this paper is to show that using unlabeled...
Kang Peng, Slobodan Vucetic, Bo Han, Hongbo Xie, Z...
IRAL
2003
ACM
14 years 1 months ago
Improving document clustering by utilizing meta-data
In this paper, we examine how to improve the precision and recall of document clustering by utilizing meta-data. We use meta-data through NewsML tags to assist clustering and show...
Kam-Fai Wong, Nam-Kiu Chan, Kam-Lai Wong