Sciweavers

IJSI
2008

Managing the Acronym/Expansion Identification Process for Text-Mining Applications

13 years 11 months ago
Managing the Acronym/Expansion Identification Process for Text-Mining Applications
This paper deals with an acronym/definition extraction approach from textual data (corpora) and the disambiguation of these definitions (or expansions). Both steps of our global process of acquisition and management of acronyms are precisely described. The first step consists in using markers such as brackets to identify expansion candidates. The alignment of the letters allows to select the acronym/definition couples. The second step is to define the relevant expansion of an acronym in a given context. Our method is based on statistical measurements (Mutual Information, Cubic Mutual Information, Dice Measure) and the results provided by search engines. This paper presents an evaluation of the global process from real data (general and specialized domains). Key words: Web-mining; text-mining; natural language processing; BioNLP; named entities recognition; acronym; quality measures Roche M, Prince V. Managing the acronym/expansion identification process for textmining Applications. Int...
Mathieu Roche, Violaine Prince
Added 12 Dec 2010
Updated 12 Dec 2010
Type Journal
Year 2008
Where IJSI
Authors Mathieu Roche, Violaine Prince
Comments (0)