A Memory-Based Shallow Parser for Spoken Dutch

15 years 8 months ago

Download www.cnts.ua.ac.be

We describe the development of a Dutch memory-based shallow parser. The availability of large treebanks for Dutch, such as the one provided by the Spoken Dutch Corpus, allows memory-based learners to be trained on examples of shallow parsing taken from the treebank, and act as a shallow parser after training. An overview is given of a modular memory-based learning approach to shallow parsing, composed of a part-of-speech tagger– chunker and two grammatical relation ﬁnders, which has originally been developed for English. This approach is applied to the syntactically annotated part of the Spoken Dutch Corpus to construct a Dutch shallow parser. From the generalisation scores of the parser we conclude that existing memory-based parsing approaches can be applied to spoken Dutch successfully, but that there is room for improvement in the tagger–chunker.

Sander Canisius, Antal van den Bosch

Real-time Traffic

CLIN 2003 | Computational Linguistics | Dutch Memory-based Shallow | Shallow Parser | Spoken Dutch Corpus |

claim paper

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	CLIN
Authors	Sander Canisius, Antal van den Bosch

Sciweavers

A Memory-Based Shallow Parser for Spoken Dutch

CLIN 2003 | Computational Linguistics | Dutch Memory-based Shallow | Shallow Parser | Spoken Dutch Corpus |

Explore & Download

Productivity Tools

Sciweavers