Sciweavers

ICML
2003
IEEE

Tackling the Poor Assumptions of Naive Bayes Text Classifiers

15 years 1 months ago
Tackling the Poor Assumptions of Naive Bayes Text Classifiers
Naive Bayes is often used as a baseline in text classification because it is fast and easy to implement. Its severe assumptions make such efficiency possible but also adversely affect the quality of its results. In this paper we propose simple, heuristic solutions to some of the problems with Naive Bayes classifiers, addressing both systemic issues as well as problems that arise because text is not actually generated according to a multinomial model. We find that our simple corrections result in a fast algorithm that is competitive with stateof-the-art text classification algorithms such as the Support Vector Machine.
Jason D. Rennie, Lawrence Shih, Jaime Teevan, Davi
Added 17 Nov 2009
Updated 17 Nov 2009
Type Conference
Year 2003
Where ICML
Authors Jason D. Rennie, Lawrence Shih, Jaime Teevan, David R. Karger
Comments (0)