Sciweavers

SPIRE
2000
Springer

A Word Stemming Algorithm for the Spanish Language

14 years 3 months ago
A Word Stemming Algorithm for the Spanish Language
This paper describes a word stemming algorithm for the Spanish Language. Experiments in document retrieval regarding English text suggest that word stemming based on morphological analysis does not generally or consistently outperform ad-hoc hand tuned algorithms such as that proposed by Porter (Porter M., 1980). It is difficult to produce a Porter style algorithm for a Romance languages such as Spanish, however, due to the greater grammatical complexity and to the fact that inflection often causes changes to the root of words, not just to their endings (as is the case with English). In general terms the difficulty consists of producing an algorithm which can cope with the additional complexity of Spanish morphology, whilst preserving the simplicity of a Porter style algorithm. One such algorithm is presented in this paper. The algorithm combines dictionary look ups with some 300 stemming and intermediate reduction rules.
Asunción Honrado, Ruben Leon, Ruairi O'Donn
Added 25 Aug 2010
Updated 25 Aug 2010
Type Conference
Year 2000
Where SPIRE
Authors Asunción Honrado, Ruben Leon, Ruairi O'Donnell, Duncan Sinclair
Comments (0)