Sciweavers

LREC
2010
125views Education» more  LREC 2010»
13 years 9 months ago
Ontology-based Interoperation of Linguistic Tools for an Improved Lemma Annotation in Spanish
In this paper, we present an ontology-based methodology and architecture for the comparison, assessment, combination (and, to some extent, also contrastive evaluation) of the resu...
Antonio Pareja-Lora, Guadalupe Aguado de Cea
LREC
2010
443views Education» more  LREC 2010»
13 years 9 months ago
Interpreting SentiWordNet for Opinion Classification
We describe a set of tools, resources, and experiments for opinion classification in business-related datasources in two languages. In particular we concentrate on SentiWordNet te...
Horacio Saggion, Adam Funk
LREC
2010
132views Education» more  LREC 2010»
13 years 9 months ago
A Question-answer Distance Measure to Investigate QA System Progress
The performance of question answering system is evaluated through successive evaluations campaigns. A set of questions are given to the participating systems which are to find the...
Guillaume Bernard, Sophie Rosset, Martine Adda-Dec...
LREC
2010
217views Education» more  LREC 2010»
13 years 9 months ago
Building a Web Corpus of Czech
Large corpora are essential to modern methods of computational linguistics and natural language processing. In this paper, we describe an ongoing project whose aim is to build a l...
Drahomíra "johanka" Spoustová, Miros...
LREC
2010
107views Education» more  LREC 2010»
13 years 9 months ago
Identifying Paraphrases between Technical and Lay Corpora
In previous work, we presented a preliminary study to identify paraphrases between technical and lay discourse types from medical corpora dedicated to the French language. In this...
Louise Deléger, Pierre Zweigenbaum
LREC
2010
175views Education» more  LREC 2010»
13 years 9 months ago
Capturing Coercions in Texts: a First Annotation Exercise
In this paper we report the first results of an annotation exercise of argument coercion phenomena performed on Italian texts. Our corpus consists of ca 4000 sentences from the PA...
Elisabetta Jezek, Valeria Quochi
LREC
2010
165views Education» more  LREC 2010»
13 years 9 months ago
Maximum Entropy Classifier Ensembling using Genetic Algorithm for NER in Bengali
In this paper, we propose classifier ensemble selection for Named Entity Recognition (NER) as a single objective optimization problem. Thereafter, we develop a method based on gen...
Asif Ekbal, Sriparna Saha
LREC
2010
168views Education» more  LREC 2010»
13 years 9 months ago
GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains
The development of a multilingual terminology is a very long and costly process. We present the creation of a multilingual terminological database called GRISP covering multiple t...
Patrice Lopez, Laurent Romary
LREC
2010
195views Education» more  LREC 2010»
13 years 9 months ago
Adapting Chinese Word Segmentation for Machine Translation Based on Short Units
In Chinese texts, words composed of single or multiple characters are not separated by spaces, unlike most western languages. Therefore Chinese word segmentation is considered an ...
Yiou Wang, Kiyotaka Uchimoto, Jun'ichi Kazama, Can...
LREC
2010
132views Education» more  LREC 2010»
13 years 9 months ago
Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation
Linguistic Data Consortium (LDC) at the University of Pennsylvania has participated as a data provider in a variety of governmentsponsored programs that support development of Hum...
Kazuaki Maeda, Haejoong Lee, Stephen Grimes, Jonat...