Sciweavers

LREC
2010
198views Education» more  LREC 2010»
13 years 10 months ago
eXtended WordFrameNet
This paper presents a novel automatic approach to partially integrate FrameNet and WordNet. In that way we expect to extend FrameNet coverage, to enrich WordNet with frame semanti...
Egoitz Laparra, German Rigau
LREC
2010
143views Education» more  LREC 2010»
13 years 10 months ago
Learning Morphology of Romance, Germanic and Slavic Languages with the Tool Linguistica
In this paper we present preliminary work conducted on semi-automatic induction of inflectional paradigms from non annotated corpora using the open-source tool Linguistica (Goldsm...
Helena Blancafort
LREC
2010
183views Education» more  LREC 2010»
13 years 10 months ago
The DesPho-APaDy Project: Developing an Acoustic-phonetic Characterization of Dysarthric Speech in French
This paper presents the rationale, objectives and advances of an on-going project (the DesPho-APaDy project funded by the French National Agency of Research) which aims to provide...
Cécile Fougeron, Lise Crevier-Buchman, Cori...
LREC
2010
207views Education» more  LREC 2010»
13 years 10 months ago
Constructing the CODA Corpus: A Parallel Corpus of Monologues and Expository Dialogues
We describe the construction of the CODA corpus, a parallel corpus of monologues and expository dialogues. The dialogue part of the corpus consists of expository, i.e., informatio...
Svetlana Stoyanchev, Paul Piwek
LREC
2010
168views Education» more  LREC 2010»
13 years 10 months ago
Balancing SoNaR: IPR versus Processing Issues in a 500-Million-Word Written Dutch Reference Corpus
In The Low Countries, a major reference corpus for written Dutch is currently being built. In this paper, we discuss the interplay between data acquisition and data processing dur...
Martin Reynaert, Nelleke Oostdijk, Orphée D...
LREC
2010
155views Education» more  LREC 2010»
13 years 10 months ago
Efficient Minimal Perfect Hash Language Models
The recent availability of large collections of text such as the Google 1T 5-gram corpus (Brants and Franz, 2006) and the Gigaword corpus of newswire (Graff, 2003) have made it po...
David Guthrie, Mark Hepple, Wei Liu
LREC
2010
143views Education» more  LREC 2010»
13 years 10 months ago
Building a Generative Lexicon for Romanian
We present in this paper an on-going research: the construction and annotation of a Romanian Generative Lexicon (RoGL). Our system follows the specifications of CLIPS project for ...
Anca Dinu
LREC
2010
129views Education» more  LREC 2010»
13 years 10 months ago
The Indiana "Cooperative Remote Search Task" (CReST) Corpus
This paper introduces a novel corpus of natural language dialogues obtained from humans performing a cooperative, remote, search task (CReST) as it occurs naturally in a variety o...
Kathleen M. Eberhard, Hannele Nicholson, Sandra K&...
LREC
2010
138views Education» more  LREC 2010»
13 years 10 months ago
Corpus Aligner (CorAl) Evaluation on English-Croatian Parallel Corpora
An increasing demand for new language resources of recent EU members and accessing countries has in turn initiated the development of different language tools and resources, such ...
Sanja Seljan, Marko Tadic, Zeljko Agic, Jan Snajde...
LREC
2010
171views Education» more  LREC 2010»
13 years 10 months ago
The Kachna L1/L2 Picture Replication Corpus
This paper presents the Kachna Corpus of Spontaneous Speech, in which ten Czech and ten Norwegian speakers were recorded both in their native language and in English. The dialogue...
Helena Spilková, Daniel Brenner, Anton &Oum...