Sciweavers

CORR
2011
Springer
183views Education» more  CORR 2011»
13 years 3 months ago
Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction
For large, real-world inductive learning problems, the number of training examples often must be limited due to the costs associated with procuring, preparing, and storing the tra...
Foster J. Provost, Gary M. Weiss
JAIR
2002
95views more  JAIR 2002»
13 years 11 months ago
SMOTE: Synthetic Minority Over-sampling Technique
An approach to the construction of classifiers from imbalanced datasets is described. A dataset is imbalanced if the classification categories are not approximately equally repres...
Nitesh V. Chawla, Kevin W. Bowyer, Lawrence O. Hal...
CORR
2000
Springer
84views Education» more  CORR 2000»
13 years 11 months ago
Robust Classification for Imprecise Environments
In real-world environments it usually is difficult to specify target operating conditions precisely, for example, target misclassification costs. This uncertainty makes building ro...
Foster J. Provost, Tom Fawcett
ICML
2000
IEEE
15 years 9 days ago
A Boosting Approach to Topic Spotting on Subdialogues
We report the results of a study on topic spotting in conversational speech. Using a machine learning approach, we build classifiers that accept an audio file of conversational hu...
Kary Myers, Michael J. Kearns, Satinder P. Singh, ...