Boosting and Rocchio Applied to Text Filtering

15 years 6 months ago

Download singhal.info

We discuss two learning algorithms for text ﬁltering: modiﬁed Rocchio and a boosting algorithm called AdaBoost. We show how both algorithms can be adapted to maximize any general utility matrix that associates cost (or gain) for each pair of machine prediction and correct label. We ﬁrst show that AdaBoost signiﬁcantly outperforms another highly effective text ﬁltering algorithm. We then compare AdaBoost and Rocchio over three large text ﬁltering tasks. Overall both algorithms are comparable and are quite effective. AdaBoost produces better classiﬁers than Rocchio when the training collection contains a very large number of relevant documents. However, on these tasks, Rocchio runs much faster than AdaBoost.

Robert E. Schapire, Yoram Singer, Amit Singhal

Real-time Traffic

AdaBoost | Algorithm Called Adaboost | Algorithms | Information Management | SIGIR 1998 |

claim paper

Post Info
More Details (n/a)

Added	05 Aug 2010
Updated	05 Aug 2010
Type	Conference
Year	1998
Where	SIGIR
Authors	Robert E. Schapire, Yoram Singer, Amit Singhal

Comments (0)

Sciweavers

Boosting and Rocchio Applied to Text Filtering

AdaBoost | Algorithm Called Adaboost | Algorithms | Information Management | SIGIR 1998 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers