Sciweavers

57 search results - page 9 / 12
» A multimodal approach to music transcription
Sort
View
MM
2004
ACM
112views Multimedia» more  MM 2004»
14 years 29 days ago
LyricAlly: automatic synchronization of acoustic musical signals and textual lyrics
We present a prototype that automatically aligns acoustic musical signals with their corresponding textual lyrics, in a manner similar to manually-aligned karaoke. We tackle this ...
Ye Wang, Min-Yen Kan, Tin Lay Nwe, Arun Shenoy, Ju...
RIAO
2000
13 years 9 months ago
Multimodal Meeting Tracker
Face-to-face meetings usually encompass several modalities including speech, gesture, handwriting, and person identification. Recognition and integration of each of these modaliti...
Michael Bett, Ralph Gross, Hua Yu, Xiaojin Zhu, Yu...
ICMCS
2008
IEEE
173views Multimedia» more  ICMCS 2008»
14 years 2 months ago
Automatic video annotation through search and mining
Conventional approaches to video annotation predominantly focus on supervised identification of a limited set of concepts, while unsupervised annotation with infinite vocabulary...
Emily Moxley, Tao Mei, Xian-Sheng Hua, Wei-Ying Ma...
RIAO
2000
13 years 9 months ago
Speaker change detection using joint audio-visual statistics
In this paper, we present an approach for speaker change detection in broadcast video using joint audio-visual scene change statistics. Our experiments indicate that using joint a...
Giridharan Iyengar, Chalapathy Neti, Sankar Basu
MLMI
2004
Springer
14 years 27 days ago
Multistream Dynamic Bayesian Network for Meeting Segmentation
This paper investigates the automatic analysis and segmentation of meetings. A meeting is analysed in terms of individual behaviours and group interactions, in order to decompose e...
Alfred Dielmann, Steve Renals