Search Sciweavers | Sciweavers

135

NIPS
2007

134views Information Technology» more NIPS 2007»

A probabilistic model for generating realistic lip movements from speech

15 years 5 months ago

The present work aims to model the correspondence between facial motion and speech. The face and sound are modelled separately, with phonemes being the link between both. We propo...

Gwenn Englebienne, Tim Cootes, Magnus Rattray

claim paper

Read More »

164

click to vote

ICA
2007
Springer

121views Signal Processing» more ICA 2007»

Discovering Convolutive Speech Phones Using Sparseness and Non-negativity

15 years 10 months ago

Download ee.ucd.ie

Discovering a representation that allows auditory data to be parsimoniously represented is useful for many machine learning and signal processing tasks. Such a representation can b...

Paul D. O'Grady, Barak A. Pearlmutter

claim paper

Read More »

164

click to vote

ICASSP
2009
IEEE

166views Signal Processing» more ICASSP 2009»

COSINE - A corpus of multi-party COnversational Speech In Noisy Environments

15 years 11 months ago

Download ssli.ee.washington.edu

We present an overview of the data collection and transcription efforts for the COnversational Speech In Noisy Environments (COSINE) corpus. The corpus is a set of multi-party con...

Alex Stupakov, Evan Hanusa, Jeff A. Bilmes, Dieter...

claim paper

Read More »

162

click to vote

ISVC
2009
Springer

166views Applied Computing» more ISVC 2009»

Speech-Driven Facial Animation Using a Shared Gaussian Process Latent Variable Model

15 years 11 months ago

Download aig.cs.man.ac.uk

Abstract. In this work, synthesis of facial animation is done by modelling the mapping between facial motion and speech using the shared Gaussian process latent variable model. Bot...

Salil Deena, Aphrodite Galata

claim paper

Read More »

142

click to vote

ICPR
2008
IEEE

156views Computer Vision» more ICPR 2008»

A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition

16 years 5 months ago

Download ivpl.ece.northwestern.edu

This work extends and improves a recently introduced (Dec. 2007) dynamic Bayesian network (DBN) based audio-visual automatic speech recognition (AVASR) system. That system models ...

Louis H. Terry, Aggelos K. Katsaggelos

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers