Audio Segmentation and Speaker Localization in Meeting Videos

14 years 9 months ago

Download marathon.csee.usf.edu

Segmenting different individuals in a group meeting and their speech is an important ﬁrst step for various tasks such as meeting transcription, automatic camera panning, multimedia retrieval and monologue detection. In this effort, given a meeting room video, we attempt to segment individual person’s speech and localize them in the video, based on data from a single audio and video source. The segmentation method is driven by audio and enhanced by video cues. We used Bayesian Information Criterion (BIC) to segment the feature vector streams and graph spectral partitioning to cluster them. We compare our results with audio based segmentation method and our localization technique with the commonly used mutual information.

Himanshu Vajaria, Tanmoy Islam, Sudeep Sarkar, Rav

Real-time Traffic

Computer Vision | ICPR 2006 | Important ﬁrst Step | Meeting Room Video | Segmentation Method |

claim paper

Post Info
More Details (n/a)

Added	11 Jun 2010
Updated	11 Jun 2010
Type	Conference
Year	2006
Where	ICPR
Authors	Himanshu Vajaria, Tanmoy Islam, Sudeep Sarkar, Ravi Sankar, Rangachar Kasturi

Comments (0)

Sciweavers

Audio Segmentation and Speaker Localization in Meeting Videos

Computer Vision | ICPR 2006 | Important ﬁrst Step | Meeting Room Video | Segmentation Method |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers