Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

117

CORR
1998
Springer

favoriteEmaildiscussreport

84views Education» more CORR 1998»

Comparing a statistical and a rule-based tagger for German

15 years 1 months ago

Comparing a statistical and a rule-based tagger for German

Download www.ling.su.se

In this paper we present the results of comparing a statistical tagger for German based on decision trees and a rule-based Brill-Tagger for German. We used the same training corpus and therefore the same tag-set to train both taggers. We then applied the taggers to the same test corpus and compared their respective behavior and in particular their error rates. Both taggers perform similarly with an error rate of around 5. From the detailed error analysis it can be seen that the rule-based tagger has more problems with unknown words than the statistical tagger. But the results are opposite for tokens that are many-ways ambiguous. If the unknown words are fed into the taggers with the help of an external lexicon such as the Gertwol system the error rate of the rule-based tagger drops to 4.7, and the respective rate of the statistical taggers drops to around 3.7. Combining the taggers by using the output of one tagger to help the other did not lead to any further improvement. In d...

Martin Volk, Gerold Schneider

Real-time Traffic

CORR 1998 | Education | Error Rate | Rule-based Tagger | Statistical Taggers |

claim paper

Related Content

» A Simple RuleBased Part of Speech Tagger

» Probabilistic and RuleBased Tagger of an Inflective Language a Comparison

Post Info
More Details (n/a)

Added	22 Dec 2010
Updated	22 Dec 2010
Type	Journal
Year	1998
Where	CORR
Authors	Martin Volk, Gerold Schneider

Comments (0)