This paper addresses cost-sensitive classification in the setting where there are costs for measuring each attribute as well as costs for misclassification errors. We show how to formulate this as a Markov Decision Process in which the transition model is learned from the training data. Specifically, we assume a set of training examples in which all attributes (and the true class) have been measured. We describe a learning algorithm based on the AO heuristic search procedure that searches for the classification policy with minimum expected cost. We provide an admissible heuristic for AO that substantially reduces the number of nodes that need to be expanded, particularly when attribute measurement costs are high. To further prune the search space, we introduce a statistical pruning heuristic based on the principle that if the values of two policies are statistically indistinguishable (on the training data), then we can prune one of the policies from the AO search space. Experiments wi...
Valentina Bayer Zubek, Thomas G. Dietterich