Sciweavers

AUSAI
2001
Springer

Fast Text Classification Using Sequential Sampling Processes

14 years 3 months ago
Fast Text Classification Using Sequential Sampling Processes
A central problem in information retrieval is the automated classification of text documents. While many existing methods achieve good levels of performance, they generally require levels of computation that prevent them from making sufficiently fast decisions in some applied setting. Using insights gained from examining the way humans make fast decisions when classifying text documents, two new text classification algorithms are developed based on sequential sampling processes. These algorithms make extremely fast decisions, because they need to examine only a small number of words in each text document. Evaluation against the Reuters-21578 collection shows both techniques have levels of performance that approach benchmark methods, and the ability of one of the classifiers to produce realistic measures of confidence in its decisions is shown to be useful for prioritizing relevant documents.
Michael D. Lee
Added 23 Aug 2010
Updated 23 Aug 2010
Type Conference
Year 2001
Where AUSAI
Authors Michael D. Lee
Comments (0)