Automatic multi-modal dialogue scene indexing

16 years 9 months ago

Download www.eee.metu.edu.tr

An automatic algorithm for indexing dialogue scenes in multimedia content is proposed. The content is segmented into dialogue scenes using the state transitions of a hidden Markov model (HMM). Each shot is classified using both audio and visual information to determine the state/scene transitions for this model. Face detection and silence/speech/music classification are the basic tools which are utilized to index the scenes. While face information is extracted after applying some heuristics to skin-colored regions, audio analysis is achieved by examining signal energy, periodicity and zero crossing rate (ZCR) of the audio waveform. The simulation results show the possibility of automatically indexing the dialogues using the proposed algorithm.

A. Aydin Alatan

Real-time Traffic

Audio Waveform | Hidden Markov Model | ICIP 2001 | Image Processing | Indexing Dialogue Scenes |

claim paper

Post Info
More Details (n/a)

Added	25 Oct 2009
Updated	25 Oct 2009
Type	Conference
Year	2001
Where	ICIP
Authors	A. Aydin Alatan

Comments (0)

Sciweavers

Automatic multi-modal dialogue scene indexing

Audio Waveform | Hidden Markov Model | ICIP 2001 | Image Processing | Indexing Dialogue Scenes |

Explore & Download

Productivity Tools

Sciweavers