Sciweavers

LREC
2010
155views Education» more  LREC 2010»
14 years 27 days ago
How Specialized are Specialized Corpora? Behavioral Evaluation of Corpus Representativeness for Maltese
In this paper we bring to light a novel intersection between corpus linguistics and behavioral data that can be employed as an evaluation metric for resources for low-density lang...
Jerid Francom, Amy LaCross, Adam Ussishkin
LREC
2010
140views Education» more  LREC 2010»
14 years 27 days ago
Wizard of Oz Experiments for a Companion Dialogue System: Eliciting Companionable Conversation
Within the EU-funded COMPANIONS project, we are working to evaluate new collaborative conversational models of dialogue. Such an evaluation requires us to benchmark approaches to ...
Nick Webb, David Benyon, Jay Bradley, Preben Hanse...
LREC
2010
189views Education» more  LREC 2010»
14 years 27 days ago
The Brandeis Annotation Tool
The Brandeis Annotation Tool is a web-based text annotation tool that is centered around the notions of layered annotation and task decomposition. It allows annotations to refer t...
Marc Verhagen
LREC
2010
195views Education» more  LREC 2010»
14 years 27 days ago
Integration of Linguistic Markup into Semantic Models of Folk Narratives: The Fairy Tale Use Case
Propp's influential structural analysis of fairy tales created a powerful schema for representing storylines in terms of character functions, which is directly exploitable fo...
Piroska Lendvai, Thierry Declerck, Sándor D...
LREC
2010
172views Education» more  LREC 2010»
14 years 27 days ago
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9
CzEng 0.9 is the third release of a large parallel corpus of Czech and English. For the current release, CzEng was extended by significant amount of texts from various types of so...
Ondrej Bojar, Adam Liska, Zdenek Zabokrtský
LREC
2010
134views Education» more  LREC 2010»
14 years 27 days ago
Comparison of Spectral Properties of Read, Prepared and Casual Speech in French
In this paper, we investigate the acoustic properties of phonemes in three speaking styles: read speech, prepared speech and spontaneous speech. Our aim is to better understand wh...
Jean-Luc Rouas, Mayumi Beppu, Martine Adda-Decker
LREC
2010
137views Education» more  LREC 2010»
14 years 27 days ago
Handling of Missing Values in Lexical Acquisition
We propose a strategy to reduce the impact of the sparse data problem in the tasks of lexical information acquisition based on the observation of linguistic cues. It justifies tha...
Núria Bel
LREC
2010
153views Education» more  LREC 2010»
14 years 27 days ago
Annotation Time Stamps - Temporal Metadata from the Linguistic Annotation Process
We describe the re-annotation of selected types of named entities (persons, organizations, locations) from the MUC7 corpus. The focus of this annotation initiative is on recording...
Katrin Tomanek, Udo Hahn
LREC
2010
192views Education» more  LREC 2010»
14 years 27 days ago
Testing Semantic Similarity Measures for Extracting Synonyms from a Corpus
The definition of lexical semantic similarity measures has been the subject of lots of works for many years. In this article, we focus more specifically on distributional semantic...
Olivier Ferret
LREC
2010
165views Education» more  LREC 2010»
14 years 27 days ago
Corpus-based Semantics of Concession: Where do Expectations Come from?
In this paper, we discuss our analysis and resulting new annotations of Penn Discourse Treebank (PDTB) data tagged as Concession. Concession arises whenever one of the two argumen...
Livio Robaldo, Eleni Miltsakaki, Alessia Bianchini