Inferring body pose using speech content

15 years 8 months ago

Download people.csail.mit.edu

Untethered multimodal interfaces are more attractive than tethered ones because they are more natural and expressive for interaction. Such interfaces usually require robust vision-based body pose estimation and gesture recognition. In interfaces where a user is interacting with a computer using speech and arm gestures, the user’s spoken keywords can be recognized in conjuction with a hypothesis of body poses. This co-occurence can reduce the number of body pose hypothesis for the vision based tracker. In this paper we show that incorporating speech-based body pose constraints can increase the robustness and accuracy of vision-based tracking systems. Next, we describe an approach for gesture recognition. We show how Linear Discriminant Analysis (LDA), can be employed to estimate ‘good features’ that can be used in a standard HMM-based gesture recognition system. We show that, by applying our LDA scheme, recognition errors can be signiﬁcantly reduced over a standard HMM-based te...

Sy Bor Wang, David Demirdjian

Real-time Traffic

Body Poses | Gesture Recognition | ICMI 2005 | Vision-based Tracking System |

claim paper

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ICMI
Authors	Sy Bor Wang, David Demirdjian

Comments (0)

Sciweavers

Inferring body pose using speech content

Body Poses | Gesture Recognition | ICMI 2005 | Vision-based Tracking System |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers