Coherent bag-of audio words model for efficient large-scale video copy detection

15 years 1 months ago

Download www.nlpr.ia.ac.cn

Current content-based video copy detection approaches mostly concentrate on the visual cues and neglect the audio information. In this paper, we attempt to tackle the video copy detection task resorting to audio information, which is equivalently important as well as visual information in multimedia processing. Firstly, inspired by bag-of visual words model, a bag-of audio words (BoA) representation is proposed to characterize each audio frame. Different from naive singlebased modeling audio retrieval approaches, BoA is a highlevel model due to its perceptual and semantical property. Within the BoA model, a coherency vocabulary indexing structure is adopted to achieve more efficient and effective indexing than single vocabulary of standard BoW model. The coherency vocabulary takes advantage of multiple audio features by computing co-occurrence of them across different feature spaces. By enforcing the tight coherency constraint across feature spaces, coherency vocabulary makes the BoA ...

Yang Liu, Wanlei Zhao, Chong-Wah Ngo, Changsheng X

Real-time Traffic

CIVR 2010 | Coherency Vocabulary | Image Analysis | Video Copy Detection | Visual |

claim paper

Post Info
More Details (n/a)

Added	13 May 2011
Updated	13 May 2011
Type	Journal
Year	2010
Where	CIVR
Authors	Yang Liu, Wanlei Zhao, Chong-Wah Ngo, Changsheng Xu, Hanqing Lu

Comments (0)

Sciweavers

Coherent bag-of audio words model for efficient large-scale video copy detection

CIVR 2010 | Coherency Vocabulary | Image Analysis | Video Copy Detection | Visual |

Explore & Download

Productivity Tools

Sciweavers