Sciweavers

910 search results - page 117 / 182
» Standardization of Speech Corpus
Sort
View
LREC
2010
211views Education» more  LREC 2010»
13 years 10 months ago
Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank
The Quranic Arabic Dependency Treebank (QADT) is part of the Quranic Arabic Corpus (http://corpus.quran.com), an online linguistic resource organized by the University of Leeds, a...
Kais Dukes, Eric Atwell, Abdul-Baquee M. Sharaf
LREC
2010
159views Education» more  LREC 2010»
13 years 10 months ago
Bilingual Lexicon Induction: Effortless Evaluation of Word Alignment Tools and Production of Resources for Improbable Language P
In this paper, we present a simple protocol to evaluate word aligners on bilingual lexicon induction tasks from parallel corpora. Rather than resorting to gold standards, it relie...
Adrien Lardilleux, Julien Gosme, Yves Lepage
KES
2010
Springer
13 years 7 months ago
W-kmeans: Clustering News Articles Using WordNet
 Document clustering is a powerful technique that has been widely used for organizing data into smaller and manageable information kernels. Several approaches have been proposed...
Christos Bouras, Vassilis Tsogkas
COST
2009
Springer
270views Multimedia» more  COST 2009»
14 years 3 months ago
Audiovisual Tools for Phonetic and Articulatory Visualization in Computer-Aided Pronunciation Training
This paper reviews interactive methods for improving the phonetic competence of subjects in the case of second language learning as well as in the case of speech therapy for subjec...
Bernd J. Kröger, Peter Birkholz, Rüdiger...
ICASSP
2008
IEEE
14 years 3 months ago
Combination of strongly and weakly constrained recognizers for reliable detection of OOVS
This paper addresses the detection of OOV segments in the output of large vocabulary continuous speech recognition (LVCSR) system. First, standard confidence measures based on fr...
Lukas Burget, Petr Schwarz, Pavel Matejka, Mirko H...