Sciweavers

LREC
2010
133views Education» more  LREC 2010»
13 years 9 months ago
Towards a Learning Approach for Abbreviation Detection and Resolution
The explosion of biomedical literature and with it the -uncontrolled- creation of abbreviations presents some special challenges for both human readers and computer applications. ...
Klaar Vanopstal, Bart Desmet, Véronique Hos...
LREC
2010
150views Education» more  LREC 2010»
13 years 9 months ago
Achieving Domain Specificity in SMT without Overt Siloing
We examine pooling data as a method for improving Statistical Machine Translation (SMT) quality for narrowly defined domains, such as data for a particular company or public entit...
William D. Lewis, Chris Wendt, David Bullock
LREC
2010
186views Education» more  LREC 2010»
13 years 9 months ago
The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News
This paper presents the EPAC corpus which is composed by a set of 100 hours of conversational speech manually transcribed and by the outputs of automatic tools (automatic segmenta...
Yannick Estève, Thierry Bazillon, Jean-Yves...
LREC
2010
187views Education» more  LREC 2010»
13 years 9 months ago
Belgisch Staatsblad Corpus: Retrieving French-Dutch Sentences from Official Documents
We describe the compilation of a large corpus of French-Dutch sentence pairs from official Belgian documents which are available in the online version of the publication Belgisch ...
Tom Vanallemeersch
LREC
2010
148views Education» more  LREC 2010»
13 years 9 months ago
POS Multi-tagging Based on Combined Models
In the POS tagging task, there are two kinds of statistical models: one is generative model, such as the HMM, the others are discriminative models, such as the Maximum Entropy Mod...
Yan Zhao, Gertjan van Noord
LREC
2010
182views Education» more  LREC 2010»
13 years 9 months ago
Aligning FrameNet and WordNet based on Semantic Neighborhoods
This paper presents an algorithm for aligning FrameNet lexical units to WordNet synsets. Both, FrameNet and WordNet, are well-known as well as widely-used resources by the entire ...
Óscar Ferrández, Michael Ellsworth, ...
LREC
2010
148views Education» more  LREC 2010»
13 years 9 months ago
FipsRomanian: Towards a Romanian Version of the Fips Syntactic Parser
We describe work in progress on the development of a full syntactic parser for Romanian. This work is part of a larger project of multilingual extension of the Fips parser (Wehrli...
Violeta Seretan, Eric Wehrli, Luka Nerima, Gabriel...
LREC
2010
187views Education» more  LREC 2010»
13 years 9 months ago
FIDJI: Web Question-Answering at Quaero 2009
This paper presents the participation of FIDJI system to the Web Question-Answering evaluation campaign organized by Quaero in 2009. FIDJI is an open-domain question-answering sys...
Xavier Tannier, Véronique Moriceau
LREC
2010
203views Education» more  LREC 2010»
13 years 9 months ago
MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse
In this paper, we describe our experience with collecting and creating an annotated corpus of multi-party online conversations in a chat-room environment. This effort is part of a...
Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell...
LREC
2010
159views Education» more  LREC 2010»
13 years 9 months ago
Bilingual Lexicon Induction: Effortless Evaluation of Word Alignment Tools and Production of Resources for Improbable Language P
In this paper, we present a simple protocol to evaluate word aligners on bilingual lexicon induction tasks from parallel corpora. Rather than resorting to gold standards, it relie...
Adrien Lardilleux, Julien Gosme, Yves Lepage