Sciweavers

41 search results - page 6 / 9
» Speaker Independent Audio-Visual Speech Recognition
Sort
View
ICASSP
2010
IEEE
13 years 7 months ago
Jointly recognizing multi-speaker conversations
We suggest an approach to speech recognition where multiple sides of a conversation in a dialog or meeting are processed and decoded jointly rather than independently. We moreover...
Gang Ji, Jeff Bilmes
ICPR
2010
IEEE
13 years 10 months ago
Crossmodal Matching of Speakers Using Lip and Voice Features in Temporally Non-Overlapping Audio and Video Streams
Person identification using audio (speech) and visual (facial appearance, static or dynamic) modalities, either independently or jointly, is a thoroughly investigated problem in pa...
Anindya Roy, Sebastien Marcel
CSL
2002
Springer
13 years 7 months ago
Transformation streams and the HMM error model
The most popular model used in automatic speech recognition is the hidden Markov model (HMM). Though good performance has been obtained with such models there are well known limit...
M. J. F. Gales
ICMCS
2000
IEEE
90views Multimedia» more  ICMCS 2000»
14 years 1 days ago
Towards a Multimodal Meeting Record
Face-to-face meetings usually encompass several modalities including speech, gesture, handwriting, and person identification. Recognition and integration of each of these modalit...
Ralph Gross, Michael Bett, Hua Yu, Xiaojin Zhu, Yu...
TSD
2004
Springer
14 years 1 months ago
Dynamic Unit Selection for Very Low Bit Rate Coding at 500 bits/sec
This paper presents a new unit selection process for Very Low Bit Rate speech encoding around 500 bits/sec. The encoding is based on speech recognition and speech synthesis technol...
Marc Padellini, François Capman, Genevi&egr...