Sciweavers

931 search results - page 169 / 187
» Speech for Multimedia Information Retrieval
Sort
View
MM
2006
ACM
157views Multimedia» more  MM 2006»
15 years 11 months ago
Syllabic level automatic synchronization of music signals and text lyrics
We present a framework to synchronize pop music to corresponding text lyric. We refine line level alignment achievable by existing work to syllabic level by using a dynamic progra...
Denny Iskandar, Ye Wang, Min-Yen Kan, Haizhou Li
MM
2005
ACM
110views Multimedia» more  MM 2005»
15 years 11 months ago
Photo LOI: browsing multi-user photo collections
The number of digital photographs is growing beyond the abilities of individuals to easily manage and understand their own photo collections. Photo LOI (Level of Interest) is a te...
Rahul Nair, Nick Reid, Marc Davis
MM
2004
ACM
154views Multimedia» more  MM 2004»
15 years 11 months ago
PLSA-based image auto-annotation: constraining the latent space
We address the problem of unsupervised image auto-annotation with probabilistic latent space models. Unlike most previous works, which build latent space representations assuming ...
Florent Monay, Daniel Gatica-Perez
MM
2009
ACM
147views Multimedia» more  MM 2009»
15 years 10 months ago
Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior
While existing studies on YouTube’s massive user-generated video content have mostly focused on the analysis of videos, their characteristics, and network properties, little att...
Joan-Isaac Biel, Daniel Gatica-Perez
LREC
2010
237views Education» more  LREC 2010»
15 years 7 months ago
Entity Mention Detection using a Combination of Redundancy-Driven Classifiers
We present an experimental framework for Entity Mention Detection in which two different classifiers are combined to exploit Data Redundancy attained through the annotation of a l...
Silvana Marianela Bernaola Biggio, Manuela Speranz...