Sciweavers

417 search results - page 28 / 84
» Signal processing tools for speech recognition
Sort
View
ICASSP
2011
IEEE
12 years 11 months ago
Structured precision modelling with Cholesky Basis Superposition for speech recognition
Structured precision modelling is an important approach to improve the intra-frame correlation modelling of the standard HMM, where Gaussian mixture model with diagonal covariance...
Lei Jia, Kai Yu, Bo Xu
ICASSP
2011
IEEE
12 years 11 months ago
Pronunciation variants generation using SMT-inspired approaches
Enriching a pronunciation dictionary with phonological variation is a challenging task, not yet solved despite several decades of research, in particular for speech-to-text transc...
Panagiota Karanasou, Lori Lamel
MIR
2003
ACM
161views Multimedia» more  MIR 2003»
14 years 27 days ago
Highlight scene extraction in real time from baseball live video
This paper proposes a method to automatically extract highlight scenes from sports (baseball) live video in real time and to allow users to retrieve them. For this purpose, sophis...
Yasuo Ariki, Masahito Kumano, Kiyoshi Tsukada
ICASSP
2009
IEEE
14 years 2 months ago
A flat direct model for speech recognition
We introduce a direct model for speech recognition that assumes an unstructured, i.e., flat text output. The flat model allows us to model arbitrary attributes and dependences o...
Georg Heigold, Geoffrey Zweig, Xiao Li, Patrick Ng...
TSD
2004
Springer
14 years 1 months ago
Dynamic Unit Selection for Very Low Bit Rate Coding at 500 bits/sec
This paper presents a new unit selection process for Very Low Bit Rate speech encoding around 500 bits/sec. The encoding is based on speech recognition and speech synthesis technol...
Marc Padellini, François Capman, Genevi&egr...