Sciweavers

36 search results - page 4 / 8
» Phoneme Recognition with Staged Neural Networks
Sort
View
ACSC
2004
IEEE
13 years 11 months ago
Sensor Fusion Weighting Measures in Audio-Visual Speech Recognition
Audio-Visual Speech Recognition (AVSR) uses vision to enhance speech recognition but also introduces the problem of how to join (or fuse) these two signals together. Mainstream re...
Trent W. Lewis, David M. W. Powers
NIPS
1994
13 years 8 months ago
SARDNET: A Self-Organizing Feature Map for Sequences
A self-organizing neural network for sequence classification called SARDNET is described and analyzed experimentally. SARDNET extends the Kohonen Feature Map architecture with act...
Daniel L. James, Risto Miikkulainen
ISNN
2011
Springer
12 years 10 months ago
Robust Multi-stream Keyword and Non-linguistic Vocalization Detection for Computationally Intelligent Virtual Agents
Abstract. Systems for keyword and non-linguistic vocalization detection in conversational agent applications need to be robust with respect to background noise and different speak...
Martin Wöllmer, Erik Marchi, Stefano Squartin...
ICASSP
2011
IEEE
12 years 11 months ago
A multi-stream ASR framework for BLSTM modeling of conversational speech
We propose a novel multi-stream framework for continuous conversational speech recognition which employs bidirectional Long Short-Term Memory (BLSTM) networks for phoneme predicti...
Martin Wöllmer, Florian Eyben, Björn Sch...
MICAI
2000
Springer
13 years 10 months ago
Verification of Correct Pronunciation of Mexican Spanish Using Speech Technology
This paper presents a new method for the verification of the correct pronunciation of spoken words. This process is based on speech recognition technology. It can be particularly ...
Ingrid Kirschning, Nancy Aguas