Sciweavers

22 search results - page 4 / 5
» An Effective Approach to Enhance Centroid Classifier for Tex...
Sort
View
IRI
2007
IEEE
14 years 2 months ago
Enhancing Text Analysis via Dimensionality Reduction
Many applications require analyzing vast amounts of textual data, but the size and inherent noise of such data can make processing very challenging. One approach to these issues i...
David G. Underhill, Luke McDowell, David J. Marche...
SIGIR
2000
ACM
14 years 1 days ago
Hierarchical classification of Web content
This paper explores the use of hierarchical structure for classifying a large, heterogeneous collection of web content. The hierarchical structure is initially used to train diffe...
Susan T. Dumais, Hao Chen
ADCS
2004
13 years 9 months ago
Phrases and Feature Selection in E-Mail Classification
In this paper we study the effectiveness of using a phrase-based representation in e-mail classification, and the affect this approach has on a number of machine learning algorithm...
Elisabeth Crawford, Irena Koprinska, Jon Patrick
ECAI
2006
Springer
13 years 11 months ago
Automatic Term Categorization by Extracting Knowledge from the Web
This paper addresses the problem of categorizing terms or lexical entities into a predefined set of semantic domains exploiting the knowledge available on-line in the Web. The prop...
Leonardo Rigutini, Ernesto Di Iorio, Marco Ernande...
DEXA
2006
Springer
197views Database» more  DEXA 2006»
13 years 9 months ago
Cleaning Web Pages for Effective Web Content Mining
Classifying and mining noise-free web pages will improve on accuracy of search results as well as search speed, and may benefit webpage organization applications (e.g., keyword-bas...
Jing Li, Christie I. Ezeife