Improving the Performance of a Tagger Generator in an Information Extraction Application

15 years 6 months ago

Download www.jucs.org

: In this paper we present an experience in the extraction of named entities from Spanish texts using stacking. Named Entity Extraction (NEE) is a subtask of Information Extraction that involves the identiﬁcation of groups of words that make up the name of an entity, and the classiﬁcation of these names into a set of predeﬁned categories. Our approach is corpus-based, we use a re-trainable tagger generator to obtain a named entity extractor from a set of tagged examples. The main contribution of our work is that we obtain the systems needed in a stacking scheme without making use of any additional training material or tagger generators. Instead of it, we have generated the variability needed in stacking by applying corpus transformation to the original training corpus. Once we have several versions of the training corpus we generate several extractors and combine them by means of a machine learning algorithm. Experiments show that the combination of corpus transformation and stac...

José A. Troyano, Fernando Enríquez,

Real-time Traffic

Corpus Transformation | Entity Extraction | JUCS 2007 | Tagger Generator |

claim paper

» BioCreAtIvE Task1A entity identification with a stochastic tagger

» Analysis and Repair of Name Tagger Errors

» Improving the Performance of a Named Entity Extractor by Applying a Stacking Scheme

» A Case Restoration Approach to Named Entity Tagging in Degraded Documents

» Semisupervised learning of semantic classes for query understanding from the web and for t...

» iTag a personalized blog tagger

» TEGa hybrid approach to information extraction

» Mining User Reviews from Specification to Summarization

Post Info
More Details (n/a)

Added	16 Dec 2010
Updated	16 Dec 2010
Type	Journal
Year	2007
Where	JUCS
Authors	José A. Troyano, Fernando Enríquez, Fermín Cruz, José Miguel Cañete Valdeón, F. Javier Ortega

Comments (0)

Sciweavers

Improving the Performance of a Tagger Generator in an Information Extraction Application

Corpus Transformation | Entity Extraction | JUCS 2007 | Tagger Generator |

Explore & Download

Productivity Tools

Sciweavers