Sciweavers

COLING
2008

A Discriminative Alignment Model for Abbreviation Recognition

14 years 1 months ago
A Discriminative Alignment Model for Abbreviation Recognition
This paper presents a discriminative alignment model for extracting abbreviations and their full forms appearing in actual text. The task of abbreviation recognition is formalized as a sequential alignment problem, which finds the optimal alignment (origins of abbreviation letters) between two strings (abbreviation and full form). We design a large amount of finegrained features that directly express the events where letters produce or do not produce abbreviations. We obtain the optimal combination of features on an aligned abbreviation corpus by using the maximum entropy framework. The experimental results show the usefulness of the alignment model and corpus for improving abbreviation recognition.
Naoaki Okazaki, Sophia Ananiadou, Jun-ichi Tsujii
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where COLING
Authors Naoaki Okazaki, Sophia Ananiadou, Jun-ichi Tsujii
Comments (0)