Sciweavers

581 search results - page 36 / 117
» A hierarchical point process model for speech recognition
Sort
View
ISMIR
2000
Springer
168views Music» more  ISMIR 2000»
14 years 9 days ago
Mel Frequency Cepstral Coefficients for Music Modeling
We examine in some detail Mel Frequency Cepstral Coefficients (MFCCs) - the dominant features used for speech recognition - and investigate their applicability to modeling music. ...
Beth Logan
MIR
2003
ACM
161views Multimedia» more  MIR 2003»
14 years 2 months ago
Highlight scene extraction in real time from baseball live video
This paper proposes a method to automatically extract highlight scenes from sports (baseball) live video in real time and to allow users to retrieve them. For this purpose, sophis...
Yasuo Ariki, Masahito Kumano, Kiyoshi Tsukada
BIOADIT
2004
Springer
14 years 14 days ago
Biologically Plausible Speech Recognition with LSTM Neural Nets
Abstract. Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) are local in space and time and closely related to a biological model of memory in the prefrontal cortex. N...
Alex Graves, Douglas Eck, Nicole Beringer, Jü...
ACL
2001
13 years 10 months ago
Practical Issues in Compiling Typed Unification Grammars for Speech Recognition
Current alternatives for language modeling are statistical techniques based on large amounts of training data, and hand-crafted context-free or finite-state grammars that are diff...
John Dowding, Beth Ann Hockey, Jean Mark Gawron, C...
ICASSP
2011
IEEE
13 years 14 days ago
A conditional model for triggering understanding actions in a speech understanding system
A conditional model is introduced for triggering understanding actions that correct errors of frame hypothesization and composition. Experimental evidence is provided using the Fr...
Frédéric Duvert, Renato de Mori