Sciweavers

910 search results - page 4 / 182
» Standardization of Speech Corpus
Sort
View
LREC
2010
171views Education» more  LREC 2010»
13 years 9 months ago
Design and Development of Part-of-Speech-Tagging Resources for Wolof (Niger-Congo, spoken in Senegal)
In this paper, we report on the design of a part-of-speech-tagset for Wolof and on the creation of a semi-automatically annotated gold standard. The main motivation for this resou...
Cheikh M. Bamba Dione, Jonas Kuhn, Sina Zarrie&szl...
LREC
2010
147views Education» more  LREC 2010»
13 years 9 months ago
Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch
This paper reports on the annotation of a corpus of 1 million words with four semantic annotation layers, including named entities, coreference relations, semantic roles and spati...
Ineke Schuurman, Véronique Hoste, Paola Mon...
NIPS
2003
13 years 8 months ago
Unsupervised Context Sensitive Language Acquisition from a Large Corpus
We describe a pattern acquisition algorithm that learns, in an unsupervised fashion, a streamlined representation of linguistic structures from a plain natural-language corpus. Th...
Zach Solan, David Horn, Eytan Ruppin, Shimon Edelm...
TAL
2004
Springer
14 years 24 days ago
Unsupervised Training of a Finite-State Sliding-Window Part-of-Speech Tagger
A simple, robust sliding-window part-of-speech tagger is presented and a method is given to estimate its parameters from an untagged corpus. Its performance is compared to a standa...
Enrique Sánchez Villamil, Mikel L. Forcada,...
LREC
2008
225views Education» more  LREC 2008»
13 years 9 months ago
The MoveOn Motorcycle Speech Corpus
A speech and noise corpus dealing with the extreme conditions of the motorcycle environment is developed within the MoveOn project. Speech utterances in British English are record...
Thomas Winkler, Theodoros Kostoulas, Richard Adder...