Using Text Classifiers for Numerical Classification

15 years 8 months ago

Download www.cise.ufl.edu

Consider a supervised learning problem in which examples contain both numerical- and text-valued features. To use traditional featurevector-based learning methods, one could treat the presence or absence of a word as a Boolean feature and use these binary-valued features together with the numerical features. However, the use of a text-classification system on this is a bit more problematic -- in the most straight-forward approach each number would be considered a distinct token and treated as a word. This paper presents an alternative approach for the use of text classification methods for supervised learning problems with numerical-valued features in which the numerical features are converted into bag-of-words features, thereby making them directly usable by text classification methods. We show that even on purely numerical-valued data the results of textclassification on the derived text-like representation outperforms the more naive numbers-as-tokensrepresentation and,more importan...

Sofus A. Macskassy, Haym Hirsh, Arunava Banerjee,

Real-time Traffic

Classification Methods | IJCAI 2001 | IJCAI 2007 | Numerical Features | Supervised Learning Problems |

claim paper

» Bayesian online classifiers for text classification and filtering

» Classification of ProteinProtein Interaction FullText Documents Using Text and Citation Ne...

» Text Classification using the Concept of Association Rule of Data Mining

» Multivariate Stream Data Classification Using Simple Text Classifiers

» Automatic Quality Assessment of SRS Text by Means of a DecisionTreeBased Text Classifier

» An Iterative Improvement Approach for the Discretization of Numeric Attributes in Bayesian...

» WEB Image Classification Based on the Fusion of Image and Text Classifiers

» Classifying HighDimensional Text and Web Data Using Very Short Patterns

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2001
Where	IJCAI
Authors	Sofus A. Macskassy, Haym Hirsh, Arunava Banerjee, Aynur A. Dayanik

Comments (0)

Sciweavers

Using Text Classifiers for Numerical Classification

Classification Methods | IJCAI 2001 | IJCAI 2007 | Numerical Features | Supervised Learning Problems |

Explore & Download

Productivity Tools

Sciweavers