Sciweavers

184 search results - page 36 / 37
» Introduction of the Speaking Rate in the Model of Speech Rec...
Sort
View
INTERSPEECH
2010
13 years 2 months ago
Augmentation of adaptation data
Linear regression based speaker adaptation approaches can improve Automatic Speech Recognition (ASR) accuracy significantly for a target speaker. However, when the available adapt...
Ravichander Vipperla, Steve Renals, Joe Frankel
ICASSP
2008
IEEE
14 years 2 months ago
Phonetic pronunciations for arabic speech-to-text systems
In this paper two aspects of generating and using phonetic Arabic dictionaries are described. First, the use of single pronunciation acoustic models in the context of Arabic large...
Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Phi...
MM
2006
ACM
151views Multimedia» more  MM 2006»
14 years 1 months ago
News video search with fuzzy event clustering using high-level features
Precise automated video search is gaining in importance as the amount of multimedia information is increasing at exponential rates. One of the drawbacks that make video retrieval ...
Shi-Yong Neo, Yantao Zheng, Tat-Seng Chua, Qi Tian
DSMML
2004
Springer
14 years 1 months ago
Multi Channel Sequence Processing
Abstract. This paper summarizes some of the current research challenges arising from multi-channel sequence processing. Indeed, multiple real life applications involve simultaneous...
Samy Bengio, Hervé Bourlard
ICASSP
2010
IEEE
13 years 7 months ago
Semantic confidence calibration for spoken dialog applications
The success of spoken dialog applications depends strongly on the quality of the semantic confidence measure that determines the selection of the dialog strategy. However, the sem...
Dong Yu, Li Deng