Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

230

ICDM
2009
IEEE

176views Data Mining» more ICDM 2009»

SISC: A Text Classification Approach Using Semi Supervised Subspace Clustering

15 years 5 months ago

SISC: A Text Classification Approach Using Semi Supervised Subspace Clustering

Download www.utdallas.edu

Text classification poses some specific challenges. One such challenge is its high dimensionality where each document (data point) contains only a small subset of them. In this paper, we propose Semi-supervised Impurity based Subspace Clustering (SISC) in conjunction with -Nearest Neighbor approach, based on semi-supervised subspace clustering that considers the high dimensionality as well as the sparse nature of them in text data. SISC finds clusters in the subspaces of the high dimensional text data where each text document has fuzzy cluster membership. This fuzzy clustering exploits two factors - chi square statistic of the dimensions and the impurity measure within each cluster. Empirical evaluation on real world data sets reveals the effectiveness of our approach as it significantly outperforms other state-of-the-art text classification and subspace clustering algorithms.

Mohammad Salim Ahmed, Latifur Khan

Real-time Traffic

Data Mining | ICDM 2009 | Subspace Clustering | Text Classification | Text Data |

claim paper

Related Content

» Multilabel ASRS Dataset Classification Using Semi Supervised Subspace Clustering

» A Framework for SemiSupervised Learning Based on Subjective and Objective Clustering Crite...

» Enhancing the Performance of SemiSupervised Classification Algorithms with Bridging

» Asymptotic Analysis of Generative SemiSupervised Learning

» On the use of linear programming for unsupervised text classification

» SemiSupervised Learning Using Gaussian Fields and Harmonic Functions

» Regularized MultiClass SemiSupervised Boosting

» Classification and Clustering via Dictionary Learning with Structured Incoherence

» Unsupervised document classification using sequential information maximization

Post Info
More Details (n/a)

Added	18 Feb 2011
Updated	18 Feb 2011
Type	Journal
Year	2009
Where	ICDM
Authors	Mohammad Salim Ahmed, Latifur Khan

Comments (0)