Sciweavers

53 search results - page 1 / 11
» Audio Segmentation and Speaker Localization in Meeting Video...
Sort
View
ICPR
2006
IEEE
14 years 1 months ago
Audio Segmentation and Speaker Localization in Meeting Videos
Segmenting different individuals in a group meeting and their speech is an important first step for various tasks such as meeting transcription, automatic camera panning, multime...
Himanshu Vajaria, Tanmoy Islam, Sudeep Sarkar, Rav...
TCSV
2008
125views more  TCSV 2008»
13 years 7 months ago
Exploring Co-Occurence Between Speech and Body Movement for Audio-Guided Video Localization
This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
H. Vajaria, S. Sarkar, R. Kasturi
ICASSP
2010
IEEE
13 years 7 months ago
Low-latency online speaker tracking on the AMI Corpus of meeting conversations
Ambient Inteligence aims to create smart spaces providing services in a transparent and non-intrusive fashion, so context awareness and user adaptation are key issues. Speech can ...
Maider Zamalloa, Luis Javier Rodríguez-Fuen...
ICMCS
2005
IEEE
103views Multimedia» more  ICMCS 2005»
14 years 1 months ago
Using spatial cues for meeting speech segmentation
This work investigates the validity and accuracy of using spatial cues with Time-Delay Estimation (TDE) as a method of segmenting multichannel recorded speech by speaker location....
Eva Cheng, Jason Lukasiak, Ian S. Burnett, David S...
ICASSP
2009
IEEE
14 years 2 months ago
Multi-modal speaker diarization of real-world meetings using compressed-domain video features
Speaker diarization is originally defined as the task of determining “who spoke when” given an audio track and no other prior knowledge of any kind. The following article sho...
Gerald Friedland, Hayley Hung, Chuohao Yeo