Sciweavers

123 search results - page 15 / 25
» Improving Acoustic Models with Captioned Multimedia Speech
Sort
View
ICPR
2006
IEEE
14 years 8 months ago
Switching Auxiliary Chains for Speech Recognition based on Dynamic Bayesian Networks
This paper investigates the problem of incorporating auxiliary information (e.g. pitch) for speech recognition using dynamic Bayesian networks (DBNs). Previous works usually model...
Hui Lin 0001, Zhijian Ou
COLING
2008
13 years 9 months ago
Towards Incremental End-of-Utterance Detection in Dialogue Systems
We define the task of incremental or 0lag utterance segmentation, that is, the task of segmenting an ongoing speech recognition stream into utterance units, and present first resu...
Michaela Atterer, Timo Baumann, David Schlangen
ICASSP
2009
IEEE
14 years 2 months ago
A speech fragment approach to localising multiple speakers in reverberant environments
Sound source localisation cues are severely degraded when multiple acoustic sources are active in the presence of reverberation. We present a binaural system for localising simult...
Heidi Christensen, Ning Ma, Stuart N. Wrigley, Jon...
ICMCS
2006
IEEE
117views Multimedia» more  ICMCS 2006»
14 years 1 months ago
Data Hiding for Speech Bandwidth Extension and its Hardware Implementation
Most of the current speech transmission systems are only able to deliver speech signals in a narrow frequency band. This narrowband speech is characterized by a thin and muffled ...
Fan Wu, Siyue Chen, Henry Leung
PAMI
2002
98views more  PAMI 2002»
13 years 7 months ago
Extraction of Visual Features for Lipreading
The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations and other body motion, such as those of the head, convey additional information...
Iain Matthews, Timothy F. Cootes, J. Andrew Bangha...