Sciweavers

LREC
2010
150views Education» more  LREC 2010»
14 years 27 days ago
Number or Nuance: Which Factors Restrict Reliable Word Sense Annotation?
Susan Windisch Brown, Travis Rood, Martha Palmer
LREC
2010
106views Education» more  LREC 2010»
14 years 27 days ago
Towards the Annotation of Named Entities in the National Corpus of Polish
We present the named entity annotation task within the on-going project of the National Corpus of Polish. To the best of our knowledge, this is the first attempt at a large-scale ...
Agata Savary, Jakub Waszczuk, Adam Przepiór...
LREC
2010
198views Education» more  LREC 2010»
14 years 27 days ago
The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals
Investigating differences in linguistic usage between individuals who have suffered brain injury (hereafter patients) and those who haven't can yield a number of benefits. It...
Caroline Williams, Andrew Thwaites, Paula Buttery,...
LREC
2010
141views Education» more  LREC 2010»
14 years 27 days ago
A Game-based Approach to Transcribing Images of Text
We present a methodology that takes as input scanned documents of typed or hand-written text, and produces transcriptions of the text as output. Instead of using OCR technology, t...
Khalil Dahab, Anja Belz
LREC
2010
118views Education» more  LREC 2010»
14 years 27 days ago
Constructing a Broad-coverage Lexicon for Text Mining in the Patent Domain
Nelleke Oostdijk, Suzan Verberne, Cornelis Koster
LREC
2010
165views Education» more  LREC 2010»
14 years 27 days ago
Cooperation for Arabic Language Resources and Tools - The MEDAR Project
The paper describes some of the work carried out within the European funded project MEDAR. The project has three streams of activity: the technical stream, the cooperation stream ...
Bente Maegaard, Mohamed Attia, Khalid Choukri, Oli...
LREC
2010
146views Education» more  LREC 2010»
14 years 27 days ago
A Morphologically-Analyzed CHILDES Corpus of Hebrew
We present a corpus of transcribed spoken Hebrew that forms an integral part of a comprehensive data system that has been developed to suit the specific needs and interests of chi...
Bracha Nir, Brian MacWhinney, Shuly Wintner
LREC
2010
164views Education» more  LREC 2010»
14 years 27 days ago
The POETICON Corpus: Capturing Language Use and Sensorimotor Experience in Everyday Interaction
Natural language use, acquisition, and understanding takes place usually in multisensory and multimedia communication environments. Therefore, for one to model language in its int...
Katerina Pastra, Christian Wallraven, Michael Schu...
LREC
2010
140views Education» more  LREC 2010»
14 years 27 days ago
A Typology of Near-Identity Relations for Coreference (NIDENT)
The task of coreference resolution requires people or systems to decide when two referring expressions refer to the `same' entity or event. In real text, this is often a diff...
Marta Recasens, Eduard H. Hovy, Maria Antòn...
LREC
2010
234views Education» more  LREC 2010»
14 years 27 days ago
Building an Italian FrameNet through Semi-automatic Corpus Analysis
In this paper, we outline the methodology we adopted to develop a FrameNet for Italian. The main element of novelty with respect to the original FrameNet is represented by the fac...
Alessandro Lenci, Martina Johnson, Gabriella Lapes...