Sciweavers

26 search results - page 5 / 6
» A computational auditory scene analysis system for speech se...
Sort
View
ICMI
2005
Springer
170views Biometrics» more  ICMI 2005»
14 years 18 days ago
Inferring body pose using speech content
Untethered multimodal interfaces are more attractive than tethered ones because they are more natural and expressive for interaction. Such interfaces usually require robust vision...
Sy Bor Wang, David Demirdjian
AAAI
2008
13 years 9 months ago
Unstructured Audio Classification for Environment Recognition
My thesis aims to contribute towards building autonomous agents that are able to understand their surrounding environment through the use of both audio and visual information. To ...
Selina Chu
ICDAR
2009
IEEE
14 years 1 months ago
Text-Tracking Wearable Camera System for the Blind
Disability of visual text reading has a huge impact on the quality of life for visually disabled people. One of the most anticipated devices is a wearable camera capable of findi...
Hideaki Goto, Makoto Tanaka
ATAL
2007
Springer
14 years 1 months ago
Towards using multiple cues for robust object recognition
A robot’s ability to assist humans in a variety of tasks, e.g. in search and rescue or in a household, heavily depends on the robot’s reliable recognition of the objects in th...
Sarah Aboutalib, Manuela M. Veloso
RIAO
2000
13 years 8 months ago
Speaker change detection using joint audio-visual statistics
In this paper, we present an approach for speaker change detection in broadcast video using joint audio-visual scene change statistics. Our experiments indicate that using joint a...
Giridharan Iyengar, Chalapathy Neti, Sankar Basu