Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

180

NLPRS
2001
Springer

132views Natural Language Processing» more NLPRS 2001»

Unknown Word Guessing and Part-of-Speech Tagging Using Support Vector Machines

15 years 11 months ago

Unknown Word Guessing and Part-of-Speech Tagging Using Support Vector Machines

Download cl.aist-nara.ac.jp

The accuracy of part-of-speech (POS) tagging for unknown words is substantially lower than that for known words. Considering the high accuracy rate of up-to-date statistical POS taggers, unknown words account for a non-negligible portion of the errors. This paper describes POS prediction for unknown words using Support Vector Machines. We achieve high accuracy in POS tag prediction using substrings and surrounding context as the features. Furthermore, we integrate this method with a practical English POS tagger, and achieve accuracy of 97.1%, higher than conventional approaches.

Tetsuji Nakagawa, Taku Kudo, Yuji Matsumoto

Real-time Traffic

Natural Language Processing | NLPRS 2001 | POS Tagger | Statistical Pos Taggers | Unknown Words |

claim paper

Related Content

» Morphological Richness Offsets Resource Demand Experiences in Constructing a POS Tagger f...

» Combining Machine Learning with Linguistic Heuristics for Chinese Word Segmentation

» Target Word Detection and Semantic Role Chunking using Support Vector Machines

» Some Experiments in Humour Recognition Using the Italian Wikiquote Collection

» Identifying Anatomical Phrases in Clinical Reports by Shallow Semantic Parsing Methods

» Learning a twostage SVMCRF sequence classifier

» Systematic feature evaluation for gene name recognition

» Translationinvariant classification of nonstationary signals

Post Info
More Details (n/a)

Added	30 Jul 2010
Updated	30 Jul 2010
Type	Conference
Year	2001
Where	NLPRS
Authors	Tetsuji Nakagawa, Taku Kudo, Yuji Matsumoto

Comments (0)