Sciweavers

123 search results - page 19 / 25
» Improving Acoustic Models with Captioned Multimedia Speech
Sort
View
INTERSPEECH
2010
13 years 2 months ago
Semi-automated update of automatic transcription system for the Japanese national congress
Update of acoustic and language models is vital to maintain performance of automatic speech recognition (ASR) systems. To alleviate efforts for updating models, we propose a "...
Yuya Akita, Masato Mimura, Graham Neubig, Tatsuya ...
ICASSP
2011
IEEE
12 years 11 months ago
Speaker and noise factorisation on the AURORA4 task
For many realistic scenarios, there are multiple factors that affect the clean speech signal. In this work approaches to handling two such factors, speaker and background noise di...
Yongqiang Wang, Mark J. F. Gales
ICMCS
2009
IEEE
144views Multimedia» more  ICMCS 2009»
13 years 5 months ago
Speech control in surgery: A field analysis and strategies
This work introduces a robot driven camera controlled by speech. The SIMIS database of 20 recordings of real life surgical operations serves as basis for analyses and noise modell...
Björn Schuller, Salman Can, Hubertus Feussner...
ICIP
2003
IEEE
14 years 9 months ago
Audio-visual speaker identification using coupled hidden Markov models
In this paper, we investigate the use of the coupled hidden Markov models (CHMM) for the task of audio-visual text dependent speaker identification. Our system determines the iden...
Tieyan Fu, Xiao Xing Liu, Lu Hong Liang, Xiaobo Pi...
INTERSPEECH
2010
13 years 2 months ago
Language model cross adaptation for LVCSR system combination
State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often combine outputs from multiple subsystems developed at different sites. Cross system adaptatio...
Xunying Liu, Mark J. F. Gales, Philip C. Woodland