Sciweavers

1339 search results - page 76 / 268
» Learning Functions from Imperfect Positive Data
Sort
View
KDD
2002
ACM
93views Data Mining» more  KDD 2002»
14 years 10 months ago
Interactive deduplication using active learning
Deduplication is a key operation in integrating data from multiple sources. The main challenge in this task is designing a function that can resolve when a pair of records refer t...
Sunita Sarawagi, Anuradha Bhamidipaty
PKDD
2001
Springer
108views Data Mining» more  PKDD 2001»
14 years 2 months ago
Knowledge Discovery in Multi-label Phenotype Data
The biological sciences are undergoing an explosion in the amount of available data. New data analysis methods are needed to deal with the data. We present work using KDD to analys...
Amanda Clare, Ross D. King
CVPR
2010
IEEE
13 years 8 months ago
P-N learning: Bootstrapping binary classifiers by structural constraints
This paper shows that the performance of a binary classifier can be significantly improved by the processing of structured unlabeled data, i.e. data are structured if knowing the ...
Zdenek Kalal, Jiri Matas, Krystian Mikolajczyk
KDD
2004
ACM
117views Data Mining» more  KDD 2004»
14 years 10 months ago
Regularized multi--task learning
Past empirical work has shown that learning multiple related tasks from data simultaneously can be advantageous in terms of predictive performance relative to learning these tasks...
Theodoros Evgeniou, Massimiliano Pontil
KDD
2002
ACM
108views Data Mining» more  KDD 2002»
14 years 10 months ago
Incremental Machine Learning to Reduce Biochemistry Lab Costs in the Search for Drug Discovery
This paper promotes the use of supervised machine learning in laboratory settings where chemists have a large number of samples to test for some property, and are interested in id...
George Forman