Developing Morphological Analysers for South Asian Languages: Experimenting with the Hindi and Gujarati Languages

14 years 5 months ago

Download www.lrec-conf.org

A considerable amount of work has been put into development of stemmers and morphological analysers. The majority of these approaches use hand-crafted suffix-replacement rules but a few try to discover such rules from corpora. While most of the approaches remove or replace suffixes, there are examples of derivational stemmers which are based on prefixes as well. In this paper we present a rule-based morphological analyser. We propose an approach that takes both prefixes as well as suffixes into account. Given a corpus and a dictionary, our method can be used to obtain a set of suffix-replacement rules for deriving an inflected word's root form. We developed an approach for the Hindi language but show that the approach is portable, at least to related languages, by adapting it to the Gujarati language. Given that the entire process of developing such a ruleset is simple and fast, our approach can be used for rapid development of morphological analysers and yet it can obtain compet...

Niraj Aswani, Robert J. Gaizauskas

Real-time Traffic

Education | LREC 2010 | Morphological Analysers | Rule-based Morphological Analyser | Suffix-replacement Rules |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Niraj Aswani, Robert J. Gaizauskas

Comments (0)

Sciweavers

Developing Morphological Analysers for South Asian Languages: Experimenting with the Hindi and Gujarati Languages

Education | LREC 2010 | Morphological Analysers | Rule-based Morphological Analyser | Suffix-replacement Rules |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers