Sciweavers

27 search results - page 3 / 6
» clin 2004
Sort
View
CLIN
2001
13 years 11 months ago
Accurate Stemming of Dutch for Text Classification
This paper investigates the use of stemming for classification of Dutch (email) texts. We introduce a stemmer, which combines dictionary lookup (implemented efficiently as a finit...
Tanja Gaustad, Gosse Bouma
CLIN
2001
13 years 11 months ago
Creating a Dutch Information Retrieval Test Corpus
This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch te...
Djoerd Hiemstra, David van Leeuwen
CLIN
2001
13 years 11 months ago
A Named Entity Recognition System for Dutch
We describe a Named Entity Recognition system for Dutch that combines gazetteers, handcrafted rules, and machine learning on the basis of seed material. We used gazetteers and a c...
Fien De Meulder, Walter Daelemans, Véroniqu...
CLIN
2001
13 years 11 months ago
The Alpino Dependency Treebank
In this paper we present the Alpino Dependency Treebank and the tools that we have developed to facilitate the annotation process. Annotation typically starts with parsing a sente...
Leonoor van der Beek, Gosse Bouma, Rob Malouf, Ger...
CLIN
2000
13 years 11 months ago
Proper Name Extraction from Non-Journalistic Texts
This paper discusses the influence of the corpus on the automatic identification of proper names in texts. Techniques developed for the newswire genre are generally not sufficient...
Thierry Poibeau, Leila Kosseim