Sciweavers

LREC
2010

NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems

13 years 11 months ago
NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems
Availability of labeled language resources, such as annotated corpora and domain dependent labeled language resources is crucial for experiments in the field of Natural Language Processing. Most often, due to lack of resources, manual verification and annotation of electronic text material is a prerequisite for the development of NLP tools. In the context of under-resourced language, the lack of copora becomes a crucial problem because most of the research efforts are supported by organizations with limited funds. Using free, multilingual and highly structured corpora like Wikipedia to produce automatically labeled language resources can be an answer to those needs. This paper introduces NLGbAse, a multilingual linguistic resource built from the Wikipedia encyclopedic content. This system produces structured metadata which make possible the automatic annotation of corpora with syntactical and semantical labels. A metadata contains semantical and statistical informations related to a...
Eric Charton, Juan Manuel Torres Moreno
Added 29 Jan 2011
Updated 29 Jan 2011
Type Journal
Year 2010
Where LREC
Authors Eric Charton, Juan Manuel Torres Moreno
Comments (0)