Sciweavers

193 search results - page 23 / 39
» To Separate Speech
Sort
View
NIPS
2007
13 years 10 months ago
A probabilistic model for generating realistic lip movements from speech
The present work aims to model the correspondence between facial motion and speech. The face and sound are modelled separately, with phonemes being the link between both. We propo...
Gwenn Englebienne, Tim Cootes, Magnus Rattray
ICA
2007
Springer
14 years 2 months ago
Discovering Convolutive Speech Phones Using Sparseness and Non-negativity
Discovering a representation that allows auditory data to be parsimoniously represented is useful for many machine learning and signal processing tasks. Such a representation can b...
Paul D. O'Grady, Barak A. Pearlmutter
ICASSP
2009
IEEE
14 years 3 months ago
COSINE - A corpus of multi-party COnversational Speech In Noisy Environments
We present an overview of the data collection and transcription efforts for the COnversational Speech In Noisy Environments (COSINE) corpus. The corpus is a set of multi-party con...
Alex Stupakov, Evan Hanusa, Jeff A. Bilmes, Dieter...
ISVC
2009
Springer
14 years 3 months ago
Speech-Driven Facial Animation Using a Shared Gaussian Process Latent Variable Model
Abstract. In this work, synthesis of facial animation is done by modelling the mapping between facial motion and speech using the shared Gaussian process latent variable model. Bot...
Salil Deena, Aphrodite Galata
ICPR
2008
IEEE
14 years 10 months ago
A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition
This work extends and improves a recently introduced (Dec. 2007) dynamic Bayesian network (DBN) based audio-visual automatic speech recognition (AVASR) system. That system models ...
Louis H. Terry, Aggelos K. Katsaggelos