Search Sciweavers | Sciweavers

18 search results - page 3 / 4

» Combining Visual and Acoustic Speech Signals with a Neural N...

click to vote

PAMI
2002

98views more PAMI 2002»

Extraction of Visual Features for Lipreading

13 years 7 months ago

Download www.ri.cmu.edu

The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations and other body motion, such as those of the head, convey additional information...

Iain Matthews, Timothy F. Cootes, J. Andrew Bangha...

claim paper

Read More »

click to vote

ICMI
2004
Springer

281views Biometrics» more ICMI 2004»

Articulatory features for robust visual speech recognition

14 years 1 months ago

Download people.csail.mit.edu

Visual information has been shown to improve the performance of speech recognition systems in noisy acoustic environments. However, most audio-visual speech recognizers rely on a ...

Kate Saenko, Trevor Darrell, James R. Glass

claim paper

Read More »

click to vote

ESANN
2007

242views Neural Networks» more ESANN 2007»

A hierarchical model for syllable recognition

13 years 9 months ago

Download www.dice.ucl.ac.be

Inspired by recent ﬁndings on the similarities between the primary auditory and visual cortex we propose a neural network for speech recognition based on a hierarchical feedforw...

Xavier Domont, Martin Heckmann, Heiko Wersing, Fra...

claim paper

Read More »

click to vote

TSD
2007
Springer

124views Signal Processing» more TSD 2007»

Inter-speaker Synchronization in Audiovisual Database for Lip-Readable Speech to Animation Conversion

14 years 1 months ago

Download digitus.itk.ppke.hu

The present study proposes an inter-speaker audiovisual synchronization method to decrease the speaker dependency of our direct speech to animation conversion system. Our aim is to...

Gergely Feldhoffer, Balázs Oroszi, Gyö...

claim paper

Read More »

click to vote

ICASSP
2011
IEEE

119views Signal Processing» more ICASSP 2011»

A multi-stream ASR framework for BLSTM modeling of conversational speech

12 years 11 months ago

Download mirlab.org

We propose a novel multi-stream framework for continuous conversational speech recognition which employs bidirectional Long Short-Term Memory (BLSTM) networks for phoneme predicti...

Martin Wöllmer, Florian Eyben, Björn Sch...

claim paper

Read More »

« Prev « First page 3 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers