Sciweavers

53 search results - page 5 / 11
» Audio Segmentation and Speaker Localization in Meeting Video...
Sort
View
TSD
2004
Springer
14 years 27 days ago
Multimodal Phoneme Recognition of Meeting Data
This paper describes experiments in automatic recognition of context-independent phoneme strings from meeting data using audiovisual features. Visual features are known to improve ...
Petr Motlícek, Jan Cernocký
ICMCS
2007
IEEE
144views Multimedia» more  ICMCS 2007»
14 years 1 months ago
Analysis, User Interface, and their Evaluation for Student Presentation Videos
In the domain of candidly-captured student presentation videos, we examine and evaluate approaches for multimodal analysis and indexing of audio and video. We apply visual segment...
Alexander Haubold, John R. Kender
ICMCS
2005
IEEE
113views Multimedia» more  ICMCS 2005»
14 years 1 months ago
Highlights extraction from sports video based on an audio-visual marker detection framework
We propose to use a visual object (e.g., the baseball catcher) detection algorithm to find local, semantic objects in video frames in addition to an audio classification algorit...
Ziyou Xiong, Regunathan Radhakrishnan, Ajay Divaka...
AINA
2007
IEEE
14 years 1 months ago
Speaker Tracking and Identifying Based on Indoor Localization System and Microphone Array
This paper presents a novel multimodal system to track the participants and identify the active speaker in the smart meeting room. Indoor localization system, Cicada, is used to o...
Xiaojie Chen, Yuanchun Shi, Wenfeng Jiang
ICIP
2003
IEEE
14 years 9 months ago
Audio-visual speaker tracking with importance particle filters
We present a probabilistic method for audio-visual (AV) speaker tracking, using an uncalibrated wide-angle camera and a microphone array. The algorithm fuses 2-D object shape and ...
Daniel Gatica-Perez, Guillaume Lathoud, Iain McCow...