Feature representation for multimedia content is the key to the progress of many fundamental multimedia tasks. Although recent advances in deep feature learning offer a promising...
Hanwang Zhang, Xindi Shang, Huan-Bo Luan, Yang Yan...
Human action recognition from realistic videos plays a key role in multimedia event detection and understanding. In this paper, a novel Trajectory Based Covariance (TBC) descripto...
Images in social networks share different destinies: some are going to become popular while others are going to be completely unnoticed. In this paper we propose to use visual se...
Francesco Gelli, Tiberio Uricchio, Marco Bertini, ...
Deep learning has shown outstanding performance in various machine learning tasks. However, the deep complex model structure and massive training data make it expensive to train. ...
Perception of multimedia quality is shaped by a rich interplay between system, context and human factors. While system and context factors are widely researched, few studies consi...
Michael James Scott, Sharath Chandra Guntuku, Huan...
Topic models such as Latent Dirichlet Allocation (LDA) [3] have been extensively used for characterizing text collections according to the topics discussed in documents. Organizin...
Damiano Spina, Johanne R. Trippas, Lawrence Cavedo...
The fifth Audio-Visual Emotion Challenge and workshop AVEC 2015 was held in conjunction ACM Multimedia’15. Like the previous editions of AVEC, the workshop/challenge addresses ...
In recent years, deep networks have been successfully applied to model image concepts and achieved competitive performance on many data sets. In spite of impressive performance, t...
New technologies arise in a number of ways. They may come from advances in scientific research, through new combinations of existing technologies, or by simply imagining what migh...
Tony Dunnigan, John Doherty, Daniel Avrahami, Jaco...