Learning Features from Large-Scale, Noisy and Social Image-Tag Collection

8 years 7 months ago

Download lms.comp.nus.edu.sg

Feature representation for multimedia content is the key to the progress of many fundamental multimedia tasks. Although recent advances in deep feature learning oﬀer a promising route towards these tasks, they are limited in application to domains where high-quality and large-scale training data are hard to obtain. In this paper, we propose a novel deep feature learning paradigm based on large, noisy and social image-tag collections, which can be acquired from the inexhaustible social multimedia content on the Web. Instead of learning features from high-quality image-label supervision, we propose to learn from the image-word semantic relations, in a way of seeking a uniﬁed image-word embedding space, where the pairwise feature similarities preserve the semantic relations in the original image-word pairs. We oﬀer an easyto-use implementation for the proposed paradigm, which is fast and compatible for integrating into any state-of-the-art deep architectures. Experiments on NUSWIDE...

Hanwang Zhang, Xindi Shang, Huan-Bo Luan, Yang Yan

Real-time Traffic

MM 2015 | Multimedia |

claim paper

Post Info
More Details (n/a)

Added	14 Apr 2016
Updated	14 Apr 2016
Type	Journal
Year	2015
Where	MM
Authors	Hanwang Zhang, Xindi Shang, Huan-Bo Luan, Yang Yang, Tat-Seng Chua

Comments (0)

Sciweavers

Learning Features from Large-Scale, Noisy and Social Image-Tag Collection

MM 2015 | Multimedia |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers