Sciweavers

106 search results - page 8 / 22
» Document Representation and Dimension Reduction for Text Clu...
Sort
View
ICDM
2009
IEEE
176views Data Mining» more  ICDM 2009»
13 years 5 months ago
SISC: A Text Classification Approach Using Semi Supervised Subspace Clustering
Text classification poses some specific challenges. One such challenge is its high dimensionality where each document (data point) contains only a small subset of them. In this pap...
Mohammad Salim Ahmed, Latifur Khan
ACL
2012
11 years 10 months ago
A Novel Burst-based Text Representation Model for Scalable Event Detection
Mining retrospective events from text streams has been an important research topic. Classic text representation model (i.e., vector space model) cannot model temporal aspects of d...
Xin Zhao, Rishan Chen, Kai Fan, Hongfei Yan, Xiaom...
IRI
2007
IEEE
14 years 1 months ago
Enhancing Text Analysis via Dimensionality Reduction
Many applications require analyzing vast amounts of textual data, but the size and inherent noise of such data can make processing very challenging. One approach to these issues i...
David G. Underhill, Luke McDowell, David J. Marche...
ICONIP
1998
13 years 9 months ago
Automated Text Categorization Using Support Vector Machine
In this paper, we study the use of support vector machine in text categorization. Unlike other machine learning techniques, it allows easy incorporation of new documents into an e...
James Tin-Yau Kwok
ICPR
2008
IEEE
14 years 2 months ago
A robust technique for text extraction in mixed-type binary documents
A crucial preprocessing stage in applications such as OCR is text extraction from mixed-type documents. The present work, in contrast to most until now, successfully faces the pro...
Charalambos Strouthopoulos, Athanasios Nikolaidis