Sciweavers

ECIR
2006
Springer

Generating Search Term Variants for Text Collections with Historic Spellings

14 years 1 months ago
Generating Search Term Variants for Text Collections with Historic Spellings
In this paper, we describe a new approach for retrieval in texts with non-standard spelling, which is important for historic texts in English or German. For this purpose, we present a new algorithm for generating search term variants in ancient orthography. By applying a spell checker on a corpus of historic texts, we generate a list of candidate terms for which the contemporary spellings have to be assigned manually. Then our algorithm produces a set of probabilistic rules. These probabilities can be considered for ranking in the retrieval stage. An experimental comparison shows that our approach outperforms competing methods.
Andrea Ernst-Gerlach, Norbert Fuhr
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2006
Where ECIR
Authors Andrea Ernst-Gerlach, Norbert Fuhr
Comments (0)