Performance thresholding in practical text classification

14 years 4 months ago

Download www.ims.uni-stuttgart.de

In practical classification, there is often a mix of learnable and unlearnable classes and only a classifier above a minimum performance threshold can be deployed. This problem is exacerbated if the training set is created by active learning. The bias of actively learned training sets makes it hard to determine whether a class has been learned. We give evidence that there is no general and efficient method for reducing the bias and correctly identifying classes that have been learned. However, we characterize a number of scenarios where active learning can succeed despite these difficulties. Categories and Subject Descriptors I.2.6 [Artificial Intelligence]: Learning; H.3.1 [Information Storage and Retrieval]: Content Analysis and Indexing General Terms Algorithms, Measurement, Performance, Experimentation Keywords practical text classification, active learning, accuracy estimation, learnability

Hinrich Schütze, Emre Velipasaoglu, Jan O. Pe

Real-time Traffic

Active Learning | CIKM 2006 | Information Management | Minimum Performance Threshold | Training Sets |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2010
Updated	20 Aug 2010
Type	Conference
Year	2006
Where	CIKM
Authors	Hinrich Schütze, Emre Velipasaoglu, Jan O. Pedersen

Comments (0)

Sciweavers

Performance thresholding in practical text classification

Active Learning | CIKM 2006 | Information Management | Minimum Performance Threshold | Training Sets |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers