Sciweavers

931 search results - page 169 / 187
» Speech for Multimedia Information Retrieval
Sort
View
MM
2006
ACM
157views Multimedia» more  MM 2006»
14 years 1 months ago
Syllabic level automatic synchronization of music signals and text lyrics
We present a framework to synchronize pop music to corresponding text lyric. We refine line level alignment achievable by existing work to syllabic level by using a dynamic progra...
Denny Iskandar, Ye Wang, Min-Yen Kan, Haizhou Li
MM
2005
ACM
110views Multimedia» more  MM 2005»
14 years 1 months ago
Photo LOI: browsing multi-user photo collections
The number of digital photographs is growing beyond the abilities of individuals to easily manage and understand their own photo collections. Photo LOI (Level of Interest) is a te...
Rahul Nair, Nick Reid, Marc Davis
MM
2004
ACM
154views Multimedia» more  MM 2004»
14 years 1 months ago
PLSA-based image auto-annotation: constraining the latent space
We address the problem of unsupervised image auto-annotation with probabilistic latent space models. Unlike most previous works, which build latent space representations assuming ...
Florent Monay, Daniel Gatica-Perez
MM
2009
ACM
147views Multimedia» more  MM 2009»
14 years 20 days ago
Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior
While existing studies on YouTube’s massive user-generated video content have mostly focused on the analysis of videos, their characteristics, and network properties, little att...
Joan-Isaac Biel, Daniel Gatica-Perez
LREC
2010
237views Education» more  LREC 2010»
13 years 9 months ago
Entity Mention Detection using a Combination of Redundancy-Driven Classifiers
We present an experimental framework for Entity Mention Detection in which two different classifiers are combined to exploit Data Redundancy attained through the annotation of a l...
Silvana Marianela Bernaola Biggio, Manuela Speranz...