Sciweavers

IR
2007
13 years 7 months ago
Searching strategies for the Bulgarian language
This paper reports on the underlying IR problems encountered when indexing and searching with the Bulgarian language. For this language we propose a general light stemmer and demon...
Jacques Savoy
CLIN
2001
13 years 8 months ago
Accurate Stemming of Dutch for Text Classification
This paper investigates the use of stemming for classification of Dutch (email) texts. We introduce a stemmer, which combines dictionary lookup (implemented efficiently as a finit...
Tanja Gaustad, Gosse Bouma
ACL
2003
13 years 8 months ago
Unsupervised Learning of Arabic Stemming Using a Parallel Corpus
This paper presents an unsupervised learning approach to building a non-English (Arabic) stemmer. The stemming model is based on statistical machine translation and it uses an Eng...
Monica Rogati, J. Scott McCarley, Yiming Yang
ITCC
2005
IEEE
14 years 1 months ago
A Stemming Algorithm for the Farsi Language
In this paper, we report on the design and implementation of a stemmer for the Farsi language. The results of our evaluation on a small Farsi document collection shows a signific...
Kazem Taghva, Russell Beckley, Mohammad Sadeh