Sciweavers

LREC
2008
117views Education» more  LREC 2008»
13 years 9 months ago
Swedish-Turkish Parallel Treebank
In this paper, we describe our work on building a parallel treebank for a less studied and typologically dissimilar language pair, namely Swedish and Turkish. The treebank is a ba...
Beáta Megyesi, Bengt Dahlqvist, Eva Petters...
LREC
2008
69views Education» more  LREC 2008»
13 years 9 months ago
A Multi-Lingual Dictionary of Dirty Words
We present a multi-lingual dictionary of dirty words. We have collected about 3,200 dirty words in several languages and built a database of these. The language with the most word...
Jonas Sjöbergh, Kenji Araki
LREC
2008
112views Education» more  LREC 2008»
13 years 9 months ago
The Automatic Mapping of Princeton WordNet Lexical-Conceptual Relations onto the Brazilian Portuguese WordNet Database
Princeton WordNet (WN.Pr) lexical database has motivated efficient compilations of bulky relational lexicons since its inception in the 1980
Bento Carlos Dias-da-Silva, Ariani Di Felippo, Mar...
LREC
2008
141views Education» more  LREC 2008»
13 years 9 months ago
Centering Theory for Evaluation of Coherence in Computer-Aided Summaries
This paper investigates a new evaluation method for assessing the coherence of computer-aided summaries, justified by the inappropriacy of existing evaluation methods for this tas...
Laura Hasler
LREC
2008
112views Education» more  LREC 2008»
13 years 9 months ago
A Comparative Cross-Domain Study of the Occurrence of Laughter in Meeting and Seminar Corpora
Laughter is an intrinsic component of human-human interaction, and current automatic speech understanding paradigms stand to gain significantly from its detection and modeling. In...
Susanne Burger, Kornel Laskowski, Matthias Wö...
LREC
2008
104views Education» more  LREC 2008»
13 years 9 months ago
CzEng 0.7: Parallel Corpus with Community-Supplied Translations
This paper describes CzEng 0.7, a new release of Czech-English parallel corpus freely available for research and educational purposes. We provide basic statistics of the corpus an...
Ondrej Bojar, Miroslav Janícek, Zdenek Zabo...
LREC
2008
83views Education» more  LREC 2008»
13 years 9 months ago
Question Answering on Speech Transcriptions: the QAST evaluation in CLEF
This paper reports on the QAST track of CLEF aiming to evaluate Question Answering on Speech Transcriptions. Accessing information in spoken documents provides additional challeng...
Lori Lamel, Sophie Rosset, Christelle Ayache, Djam...
LREC
2008
169views Education» more  LREC 2008»
13 years 9 months ago
A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
In recent years, language resources acquired from the Web are released, and these data improve the performance of applications in several NLP tasks. Although the language resource...
Keiji Shinzato, Daisuke Kawahara, Chikara Hashimot...
LREC
2008
93views Education» more  LREC 2008»
13 years 9 months ago
Targeting Chinese Nominal Compounds in Corpora
For compounding languages, a great part of the topical semantics is conveyed via nominal compounds. Various applications of natural language processing can profit from explicit ac...
Weiruo Qu, Christoph Ringlstetter, Randy Goebel
LREC
2008
115views Education» more  LREC 2008»
13 years 9 months ago
Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary
Recently, collaboratively constructed resources such as Wikipedia and Wiktionary have been discovered as valuable lexical semantic knowledge bases with a high potential in diverse...
Torsten Zesch, Christof Müller, Iryna Gurevyc...