Chi-Square Classifier for Document Categorization

15 years 11 months ago

Download www.gelbukh.com

The problem of document categorization is considered. The set of domains and the keywords specific for these domains is supposed to be selected beforehand as initial data. We apply the well-known statistical hypothesis test that considers images of documents and domains as normalized vectors. In comparison with existing methods, such approach allows to take into account a random character of initial data. The classifier is developed in the framework of Document Investigator software package.

Mikhail Alexandrov, Alexander F. Gelbukh, George L

Real-time Traffic

CICLING 2001 | Document | Initial Data | Natural Language Processing | Well-known Statistical Hypothesis |

claim paper

» Classifying Documents According to Locational Relevance

» Toward File Consolidation by Document Categorization

» Feature Reinforcement Approach to Polylingual Text Categorization

» Automatic Learning Features Using Bootstrapping for Text Categorization

» Development of a MultiClassifier Approach for Multilingual Text Categorization

» Exploiting structural information for semistructured document categorization

» Categorical Proportional Difference A Feature Selection Method for Text Categorization

» Systematic Construction of Hierarchical Classifier in SVMBased Text Categorization

Post Info
More Details (n/a)

Added	28 Jul 2010
Updated	28 Jul 2010
Type	Conference
Year	2001
Where	CICLING
Authors	Mikhail Alexandrov, Alexander F. Gelbukh, George Lozovoi

Comments (0)

Sciweavers

Chi-Square Classifier for Document Categorization

CICLING 2001 | Document | Initial Data | Natural Language Processing | Well-known Statistical Hypothesis |

Explore & Download

Productivity Tools

Sciweavers