Sciweavers

LREC
2008
112views Education» more  LREC 2008»
14 years 8 days ago
The Automatic Mapping of Princeton WordNet Lexical-Conceptual Relations onto the Brazilian Portuguese WordNet Database
Princeton WordNet (WN.Pr) lexical database has motivated efficient compilations of bulky relational lexicons since its inception in the 1980
Bento Carlos Dias-da-Silva, Ariani Di Felippo, Mar...
LREC
2008
141views Education» more  LREC 2008»
14 years 8 days ago
Centering Theory for Evaluation of Coherence in Computer-Aided Summaries
This paper investigates a new evaluation method for assessing the coherence of computer-aided summaries, justified by the inappropriacy of existing evaluation methods for this tas...
Laura Hasler
LREC
2008
112views Education» more  LREC 2008»
14 years 8 days ago
A Comparative Cross-Domain Study of the Occurrence of Laughter in Meeting and Seminar Corpora
Laughter is an intrinsic component of human-human interaction, and current automatic speech understanding paradigms stand to gain significantly from its detection and modeling. In...
Susanne Burger, Kornel Laskowski, Matthias Wö...
LREC
2008
104views Education» more  LREC 2008»
14 years 8 days ago
CzEng 0.7: Parallel Corpus with Community-Supplied Translations
This paper describes CzEng 0.7, a new release of Czech-English parallel corpus freely available for research and educational purposes. We provide basic statistics of the corpus an...
Ondrej Bojar, Miroslav Janícek, Zdenek Zabo...
LREC
2008
83views Education» more  LREC 2008»
14 years 8 days ago
Question Answering on Speech Transcriptions: the QAST evaluation in CLEF
This paper reports on the QAST track of CLEF aiming to evaluate Question Answering on Speech Transcriptions. Accessing information in spoken documents provides additional challeng...
Lori Lamel, Sophie Rosset, Christelle Ayache, Djam...
LREC
2008
169views Education» more  LREC 2008»
14 years 8 days ago
A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
In recent years, language resources acquired from the Web are released, and these data improve the performance of applications in several NLP tasks. Although the language resource...
Keiji Shinzato, Daisuke Kawahara, Chikara Hashimot...
LREC
2008
93views Education» more  LREC 2008»
14 years 8 days ago
Targeting Chinese Nominal Compounds in Corpora
For compounding languages, a great part of the topical semantics is conveyed via nominal compounds. Various applications of natural language processing can profit from explicit ac...
Weiruo Qu, Christoph Ringlstetter, Randy Goebel
LREC
2008
115views Education» more  LREC 2008»
14 years 8 days ago
Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary
Recently, collaboratively constructed resources such as Wikipedia and Wiktionary have been discovered as valuable lexical semantic knowledge bases with a high potential in diverse...
Torsten Zesch, Christof Müller, Iryna Gurevyc...
LREC
2008
151views Education» more  LREC 2008»
14 years 8 days ago
A LAF/GrAF based Encoding Scheme for underspecified Representations of syntactic Annotations
Data models and encoding formats for syntactically annotated text corpora need to deal with syntactic ambiguity; underspecified representations are particularly well suited for th...
Manuel Kountz, Ulrich Heid, Kerstin Eckart
LREC
2008
125views Education» more  LREC 2008»
14 years 8 days ago
I saw TREE trees in the park: How to Correct Real-Word Spelling Mistakes
This paper presents a context sensitive spell checking system that uses mixed trigram models, and introduces a new empirically grounded method for building confusion sets. The pro...
Davide Fossati, Barbara Di Eugenio