Sciweavers

LREC
2008
60views Education» more  LREC 2008»
13 years 10 months ago
A Framework for Identity Resolution and Merging for Multi-source Information Extraction
In the context of ontology-based information extraction, identity resolution is the process of deciding whether an instance extracted from text refers to a known entity in the tar...
Milena Yankova, Horacio Saggion, Hamish Cunningham
LREC
2008
105views Education» more  LREC 2008»
13 years 10 months ago
Local Methods for On-Demand Out-of-Vocabulary Word Retrieval
Most of the Web-based methods for lexicon augmenting consist in capturing global semantic features of the targeted domain in order to collect relevant documents from the Web. We s...
Stanislas Oger, Georges Linares, Fréd&eacut...
LREC
2008
101views Education» more  LREC 2008»
13 years 10 months ago
Building a Golden Collection of Parallel Multi-Language Word Alignment
This paper reports an experience on producing manual word alignments over six different language pairs (all combinations between Portuguese, English, French and Spanish) (Grac
Joao Graça, Joana Paulo Pardal, Luís...
LREC
2008
92views Education» more  LREC 2008»
13 years 10 months ago
Projecting Propbank Roles onto the CCGbank
This paper describes a method of accurately projecting Propbank roles onto constituents in the CCGbank with near perfect accuracy and automatically annotating verbal categories wi...
Stephen A. Boxwell, Michael White
LREC
2008
155views Education» more  LREC 2008»
13 years 10 months ago
Using Reordering in Statistical Machine Translation based on Alignment Block Classification
Statistical Machine Translation (SMT) is based on alignment models which learn from bilingual corpora the word correspondences between source and target language. These models are...
Marta R. Costa-Jussà, José A. R. Fon...
LREC
2008
141views Education» more  LREC 2008»
13 years 10 months ago
Building a Corpus of Temporal-Causal Structure
While recent corpus annotation efforts cover a wide variety of semantic structures, work on temporal and causal relations is still in its early stages. Annotation efforts have typ...
Steven Bethard, William Corvey, Sara Klingenstein,...
LREC
2008
131views Education» more  LREC 2008»
13 years 10 months ago
System Evaluation on a Named Entity Corpus from Clinical Notes
This paper presents the evaluation of the dictionary look-up component of Mayo Clinic's Information Extraction system. The component was tested on a corpus of 160 free-text c...
Karin Schuler, Vinod Kaggal, James J. Masanz, Phil...
LREC
2008
138views Education» more  LREC 2008»
13 years 10 months ago
Pragmatic Annotation of Discourse Markers in a Multilingual Parallel Corpus (Arabic- Spanish-English)
Discourse structure and coherence relations are one of the main inferential challenges addressed by computational pragmatics. The present study focuses on discourse markers as key...
Doaa Samy, Ana González-Ledesma
LREC
2008
86views Education» more  LREC 2008»
13 years 10 months ago
Annotating Expressions of Opinion and Emotion in the Italian Content Annotation Bank
In this paper we describe the result of manually annotating I-CAB, the Italian Content Annotation Bank, by expressions of private state (EPSs), i.e., expressions that denote the p...
Andrea Esuli, Fabrizio Sebastiani, Ilaria Urciuoli
LREC
2008
110views Education» more  LREC 2008»
13 years 10 months ago
Information Extraction Tools and Methods for Understanding Dialogue in a Companion
This paper discusses how Information Extraction is used to understand and manage Dialogue in the EU-funded Companions project. This will be discussed with respect to the Senior Co...
Roberta Catizone, Alexiei Dingli, Hugo Pinto, Yori...