Sciweavers

2047 search results - page 86 / 410
» The limits of speech recognition
Sort
View
MIR
2003
ACM
161views Multimedia» more  MIR 2003»
14 years 2 months ago
Highlight scene extraction in real time from baseball live video
This paper proposes a method to automatically extract highlight scenes from sports (baseball) live video in real time and to allow users to retrieve them. For this purpose, sophis...
Yasuo Ariki, Masahito Kumano, Kiyoshi Tsukada
ICASSP
2009
IEEE
14 years 3 months ago
Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion m
This paper considers a method for speech emotion recognition by a max-margin framework incorporating a loss function based on a well-known model called the Watson and Tellegen’s...
Sungrack Yun, Chang D. Yoo
NAACL
2010
13 years 6 months ago
Cheap, Fast and Good Enough: Automatic Speech Recognition with Non-Expert Transcription
Deploying an automatic speech recognition system with reasonable performance requires expensive and time-consuming in-domain transcription. Previous work demonstrated that non-pro...
Scott Novotney, Chris Callison-Burch
INTERSPEECH
2010
13 years 3 months ago
Augmentation of adaptation data
Linear regression based speaker adaptation approaches can improve Automatic Speech Recognition (ASR) accuracy significantly for a target speaker. However, when the available adapt...
Ravichander Vipperla, Steve Renals, Joe Frankel
ICASSP
2008
IEEE
14 years 3 months ago
Multimodal information fusion using the iterative decoding algorithm and its application to audio-visual speech recognition
The fusion of information from heterogenous sensors is crucial to the effectiveness of a multimodal system. Noise affect the sensors of different modalities independently. A good ...
Shankar T. Shivappa, Bhaskar D. Rao, Mohan M. Triv...