Sciweavers

45 search results - page 3 / 9
» WAPUSK20 - A Database for Robust Audiovisual Speech Recognit...
Sort
View
NOLISP
2005
Springer
14 years 15 days ago
Third-Order Moments of Filtered Speech Signals for Robust Speech Recognition
Novel speech features calculated from third-order statistics of subband-filtered speech signals are introduced and studied for robust speech recognition. These features have the p...
Kevin M. Indrebo, Richard J. Povinelli, Michael T....
ICIP
2003
IEEE
14 years 8 months ago
On automatic annotation of meeting databases
In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and m...
Daniel Gatica-Perez, Hervé Bourlard, Iain M...
ACII
2005
Springer
14 years 14 days ago
Pronunciation Learning and Foreign Accent Reduction by an Audiovisual Feedback System
Abstract. Global integration and migration force people to learn additional languages. With respect to major languages, the acquisition is already initiated at primary school but a...
Oliver Jokisch, Uwe Koloska, Diane Hirschfeld, R&u...
CVPR
2012
IEEE
11 years 9 months ago
Robust Boltzmann Machines for recognition and denoising
While Boltzmann Machines have been successful at unsupervised learning and density modeling of images and speech data, they can be very sensitive to noise in the data. In this pap...
Yichuan Tang, Ruslan Salakhutdinov, Geoffrey E. Hi...
NOLISP
2007
Springer
14 years 1 months ago
A Hybrid Genetic-Neural Front-End Extension for Robust Speech Recognition over Telephone Lines
This paper presents a hybrid technique combining the Karhonen-Loeve Transform (KLT), the Multilayer Perceptron (MLP) and Genetic Algorithms (GAs) to obtain less-variant Mel-freque...
Sid-Ahmed Selouani, Habib Hamam, Douglas D. O'Shau...