Sciweavers

LREC
2010
442views Education» more  LREC 2010»
13 years 9 months ago
Medefaidrin: Resources Documenting the Birth and Death Language Life-cycle
Language resources are typically defined and created for application in speech technology contexts, but the documentation of languages which are unlikely ever to be provided with ...
Dafydd Gibbon, Moses Ekpenyong, Eno-Abasi Urua
LREC
2010
130views Education» more  LREC 2010»
13 years 9 months ago
ELAN as Flexible Annotation Framework for Sound and Image Processing Detectors
Annotation of digital recordings in humanities research still is, to a large extend, a process that is performed manually. This paper describes the first pattern recognition based...
Eric Auer, Albert Russel, Han Sloetjes, Peter Witt...
LREC
2010
135views Education» more  LREC 2010»
13 years 9 months ago
A Tool for Linking Stems and Conceptual Fragments to Enhance word Access
Electronic dictionaries offer many possibilities unavailable in paper dictionaries to view, display or access information. However, even these resources fall short when it comes t...
Nuria Gala, Véronique Rey, Michael Zock
LREC
2010
149views Education» more  LREC 2010»
13 years 9 months ago
DutchParl. The Parliamentary Documents in Dutch
A corpus called DutchParl is created which aims to contain all digitally available parliamentary documents written in the Dutch language. The first version of DutchParl contains d...
Maarten Marx, Anne Schuth
LREC
2010
113views Education» more  LREC 2010»
13 years 9 months ago
The Design of Syntactic Annotation Levels in the National Corpus of Polish
This paper presents the procedure of the syntactic annotation of the National Corpus of Polish. Syntactic annotation consists here of shallow parsing and manual post-editing of th...
Katarzyna Glowinska, Adam Przepiórkowski
LREC
2010
177views Education» more  LREC 2010»
13 years 9 months ago
IndoWordNet
India is a multilingual country where machine translation and cross lingual search are highly relevant problems. These problems require large resources- like wordnets and lexicons...
Pushpak Bhattacharyya
LREC
2010
155views Education» more  LREC 2010»
13 years 9 months ago
Djangology: A Light-weight Web-based Tool for Distributed Collaborative Text Annotation
Manual text annotation is a resource-consuming endeavor necessary for NLP systems when they target new tasks or domains for which there are no existing annotated corpora. Distribu...
Emilia Apostolova, Sean Neilan, Gary An, Noriko To...
LREC
2010
146views Education» more  LREC 2010»
13 years 9 months ago
From XML to XML: The Why and How of Making the Biodiversity Literature Accessible to Researchers
We present the ABLE document collection, which consists of a set of annotated volumes of the Bulletin of the British Museum (Natural History). These were developed during our ongo...
Alistair Willis, David King, David Morse, Anton Di...
LREC
2010
153views Education» more  LREC 2010»
13 years 9 months ago
Homographic Ideogram Understanding Using Contextual Dynamic Network
Conventional methods for disambiguation problems have been using statistical methods with co-occurrence of words in their contexts. It seems that human-beings assign appropriate w...
Jun Okamoto, Shun Ishizaki
LREC
2010
208views Education» more  LREC 2010»
13 years 9 months ago
Extraction of German Multiword Expressions from Parsed Corpora Using Context Features
We report about tools for the extraction of German multiword expressions (MWEs) from text corpora; we extract word pairs, but also longer MWEs of different patterns, e.g. verb-nou...
Marion Weller, Ulrich Heid