Sciweavers

LREC
2010
168views Education» more  LREC 2010»
13 years 9 months ago
Bulgarian National Corpus Project
The paper presents Bulgarian National Corpus project (BulNC) - a large-scale, representative, online available corpus of Bulgarian. The BulNC is also a monolingual general corpus,...
Svetla Koeva, Diana Blagoeva, Siya Kolkovska
LREC
2010
177views Education» more  LREC 2010»
13 years 9 months ago
Maskkot - An Entity-centric Annotation Platform
The Semantic Web is facing the important challenge to maintain its promise of a real world-wide graph of interconnected resources. Unfortunately, while URIs almost guarantee a dir...
Armando Stellato, Heiko Stoermer, Stefano Bortoli,...
LREC
2010
143views Education» more  LREC 2010»
13 years 9 months ago
Word Boundaries in French: Evidence from Large Speech Corpora
The goal of this paper is to investigate French word segmentation strategies using phonemic and lexical transcriptions as well as prosodic and part-of-speech annotations. Average ...
Rena Nemoto, Martine Adda-Decker, Jacques Durand
LREC
2010
180views Education» more  LREC 2010»
13 years 9 months ago
A Comprehensive Resource to Evaluate Complex Open Domain Question Answering
We describe two corpora of question and answer pairs collected for complex, open-domain Question Answering (QA) to enable answer classification and re-ranking experiments. We deli...
Silvia Quarteroni, Alessandro Moschitti
LREC
2010
187views Education» more  LREC 2010»
13 years 9 months ago
Analysing Temporally Annotated Corpora with CAVaT
We present CAVaT, a tool that performs Corpus Analysis and Validation for TimeML. CAVaT is an open source, modular checking utility for statistical analysis of features specific t...
Leon Derczynski, Robert J. Gaizauskas
LREC
2010
114views Education» more  LREC 2010»
13 years 9 months ago
Recent Developments in the National Corpus of Polish
The aim of the paper is to present recent -- as of March 2010 -- developments in the construction of the National Corpus of Polish (NKJP). The NKJP project was launched at the ver...
Adam Przepiórkowski, Rafal L. Górski...
LREC
2010
156views Education» more  LREC 2010»
13 years 9 months ago
Spontal: A Swedish Spontaneous Dialogue Corpus of Audio, Video and Motion Capture
We present the Spontal database of spontaneous Swedish dialogues. 120 dialogues of at least 30 minutes each have been captured in high-quality audio, high-resolution video and wit...
Jens Edlund, Jonas Beskow, Kjell Elenius, Kahl Hel...
LREC
2010
194views Education» more  LREC 2010»
13 years 9 months ago
NameDat: A Database of English Proper Names Spoken by Native Norwegians
This paper describes the design and collection of NameDat, a database containing English proper names spoken by native Norwegians. The database was designed to cover the typical a...
Line Adde, Torbjørn Svendsen
LREC
2010
160views Education» more  LREC 2010»
13 years 9 months ago
Fine-Grained Geographical Relation Extraction from Wikipedia
In this paper, we present work on enhancing the basic data resource of a context-aware system. First, we introduce a supervised approach to extracting geographical relations on a ...
André Blessing, Hinrich Schütze