Sciweavers

2047 search results - page 16 / 410
» The limits of speech recognition
Sort
View
ISMIR
2004
Springer
130views Music» more  ISMIR 2004»
14 years 1 months ago
Speech-Recognition Interfaces for Music Information Retrieval: 'Speech Completion' and 'Speech Spotter'
This paper describes music information retrieval (MIR) systems featuring automatic speech recognition. Although various interfaces for MIR have been proposed, speech-recognition i...
Masataka Goto, Katunobu Itou, Koji Kitayama, Tetsu...
INTERSPEECH
2010
13 years 3 months ago
Unsupervised discovery and training of maximally dissimilar cluster models
One of the difficult problems of acoustic modeling for Automatic Speech Recognition (ASR) is how to adequately model the wide variety of acoustic conditions which may be present i...
Françoise Beaufays, Vincent Vanhoucke, Bria...
INTERSPEECH
2010
13 years 3 months ago
On the relation of Bayes risk, word error, and word posteriors in ASR
In automatic speech recognition, we are faced with a wellknown inconsistency: Bayes decision rule is usually used to minimize sentence (word sequence) error, whereas in practice w...
Ralf Schlüter, Markus Nußbaum-Thom, Her...
CHI
2006
ACM
14 years 9 months ago
Error correction of voicemail transcripts in SCANMail
Despite its widespread use, voicemail presents numerous usability challenges: People must listen to messages in their entirety, they cannot search by keywords, and audio files do ...
Moira Burke, Brian Amento, Philip L. Isenhour
TSD
2004
Springer
14 years 1 months ago
Multimodal Phoneme Recognition of Meeting Data
This paper describes experiments in automatic recognition of context-independent phoneme strings from meeting data using audiovisual features. Visual features are known to improve ...
Petr Motlícek, Jan Cernocký