Sciweavers

COMPSAC
2005
IEEE

Recovering "Lack of Words" in Text Categorization for Item Banks

14 years 5 months ago
Recovering "Lack of Words" in Text Categorization for Item Banks
PKIP, Patterned Keywords in Phrase, is our feature selection approach to text categorization (TC) for item banks. An item bank is a collection of textual data in which each item consists of short sentences and has only a few relevant words for categorization. Traditional TC techniques cannot provide sufficiently accurate results because of a “lack of words” problem. PKIP improves categorization accuracy and recovers from the “lack of words” problem. Our sample item bank is the collection of Thai primary mathematics problems and we use SVM as our classifier. Classification results show that PKIP produces acceptable classification performance.
Atorn Nuntiyagul, Nick Cercone, Kanlaya Naruedomku
Added 24 Jun 2010
Updated 24 Jun 2010
Type Conference
Year 2005
Where COMPSAC
Authors Atorn Nuntiyagul, Nick Cercone, Kanlaya Naruedomkul
Comments (0)