Sciweavers

63 search results - page 9 / 13
» Large Vocabulary Audio-Visual Speech Recognition Using Activ...
Sort
View
ICASSP
2009
IEEE
14 years 2 months ago
Lattice-based MLLR for speaker recognition
Maximum-Likelihod Linear Regression (MLLR) transform coefficients have shown to be useful features for text-independent speaker recognition systems. These use MLLR coefficients ...
Marc Ferras, Claude Barras, Jean-Luc Gauvain
ICML
2008
IEEE
14 years 8 months ago
Modified MMI/MPE: a direct evaluation of the margin in speech recognition
In this paper we show how common speech recognition training criteria such as the Minimum Phone Error criterion or the Maximum Mutual Information criterion can be extended to inco...
Georg Heigold, Hermann Ney, Ralf Schlüter, Th...
ICASSP
2008
IEEE
14 years 2 months ago
Phonetic pronunciations for arabic speech-to-text systems
In this paper two aspects of generating and using phonetic Arabic dictionaries are described. First, the use of single pronunciation acoustic models in the context of Arabic large...
Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Phi...
ICASSP
2011
IEEE
12 years 11 months ago
A study of an irrelevant variability normalization based discriminative training approach for LVCSR
This paper presents a discriminative training (DT) approach to irrelevant variability normalization (IVN) based training of feature transforms and hidden Markov models for large v...
Yu Zhang, Jian Xu, Zhi-Jie Yan, Qiang Huo
ICASSP
2011
IEEE
12 years 11 months ago
Whole word discriminative point process models
This paper introduces a discriminative extension to whole-word point process modeling techniques. Meant to circumvent the strong independence assumptions of their generative prede...
Aren Jansen