Detection of Documentary Scene Changes by Audio-Visual Fusion

16 years 18 days ago

Download www.ifp.illinois.edu

The concept of a documentary scene was inferred from the audio-visual characteristics of certain documentary videos. It was observed that the amount of information from the visual component alone was not enough to convey a semantic context to most portions of these videos, but a joint observation of the visual component and the audio component conveyed a better semantic context. From the observations that we made on the video data, we generated an audio score and a visual score. We later generated a weighted audio-visual score within an interval and adaptively expanded or shrunk this interval until we found a local maximum score value. The video ultimately will be divided into a set of intervals that correspond to the documentary scenes in the video. After we obtained a set of documentary scenes, we made a check for any redundant detections.

Atulya Velivelli, Chong-Wah Ngo, Thomas S. Huang

Real-time Traffic

CIVR 2003 | Documentary Scenes | Semantic Context | Visual Component |

claim paper

» Detection and Location of People in Video Images Using Adaptive Fusion of Color and Edge I...

» Flux Tensor Constrained Geodesic Active Contours with Sensor Fusion for Persistent Object ...

Post Info
More Details (n/a)

Added	06 Jul 2010
Updated	06 Jul 2010
Type	Conference
Year	2003
Where	CIVR
Authors	Atulya Velivelli, Chong-Wah Ngo, Thomas S. Huang

Comments (0)

Sciweavers

Detection of Documentary Scene Changes by Audio-Visual Fusion

CIVR 2003 | Documentary Scenes | Semantic Context | Visual Component |

Explore & Download

Productivity Tools

Sciweavers