A Vector Space Model for Subjectivity Classification in Urdu aided by Co-Training

15 years 1 months ago

Download acl.eldoc.ub.rug.nl

The goal of this work is to produce a classifier that can distinguish subjective sentences from objective sentences for the Urdu language. The amount of labeled data required for training automatic classifiers can be highly imbalanced especially in the multilingual paradigm as generating annotations is an expensive task. In this work, we propose a cotraining approach for subjectivity analysis in the Urdu language that augments the positive set (subjective set) and generates a negative set (objective set) devoid of all samples close to the positive ones. Using the data set thus generated for training, we conduct experiments based on SVM and VSM algorithms, and show that our modified VSM based approach works remarkably well as a sentence level subjectivity classifier.

Smruthi Mukund, Rohini K. Srihari

Real-time Traffic

COLING 2010 | Computational Linguistics | Level Subjectivity Classifier | Training Automatic Classifiers | Urdu Language |

claim paper

Post Info
More Details (n/a)

Added	13 May 2011
Updated	13 May 2011
Type	Journal
Year	2010
Where	COLING
Authors	Smruthi Mukund, Rohini K. Srihari

Comments (0)

Sciweavers

A Vector Space Model for Subjectivity Classification in Urdu aided by Co-Training

COLING 2010 | Computational Linguistics | Level Subjectivity Classifier | Training Automatic Classifiers | Urdu Language |

Explore & Download

Productivity Tools

Sciweavers