A probabilistic multimodal approach for predicting listener backchannels

14 years 2 months ago

Download people.ict.usc.edu

During face-to-face interactions, listeners use backchannel feedback such as head nods as a signal to the speaker that the communication is working and that they should continue speaking. Predicting these backchannel opportunities is an important milestone for building engaging and natural virtual humans. In this paper we show how sequential probabilistic models (e.g., Hidden Markov Model or Conditional Random Fields) can automatically learn from a database of human-to-human interactions to predict listener backchannels using the speaker multimodal output features (e.g., prosody, spoken words and eye gaze). The main challenges addressed in this paper are automatic selection of the relevant features and optimal feature representation for probabilistic models. For prediction of visual backchannel cues (i.e., head nods), our prediction model shows a statistically significant improvement over a previously published approach based on hand-crafted rules.

Louis-Philippe Morency, Iwan de Kok, Jonathan Grat

Real-time Traffic

AAMAS 2010 | Hidden Markov Model | Intelligent Agents | Probabilistic Models | Speaker Multimodal Output |

claim paper

Post Info
More Details (n/a)

Added	08 Dec 2010
Updated	08 Dec 2010
Type	Journal
Year	2010
Where	AAMAS
Authors	Louis-Philippe Morency, Iwan de Kok, Jonathan Gratch

Comments (0)

Sciweavers

A probabilistic multimodal approach for predicting listener backchannels

AAMAS 2010 | Hidden Markov Model | Intelligent Agents | Probabilistic Models | Speaker Multimodal Output |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers