Image2Emoji: Zero-shot Emoji Prediction for Visual Media

8 years 7 months ago

Download staff.fnwi.uva.nl

We present Image2Emoji, a multi-modal approach for generating emoji labels for an image in a zero-shot manner. Diﬀerent from existing zero-shot image-to-text approaches, we exploit both image and textual media to learn a semantic embedding for the new task of emoji prediction. We propose that the widespread adoption of emoji suggests a semantic universality which is well-suited for interaction with visual media. We quantify the eﬃcacy of our proposed model on the MSCOCO dataset, and demonstrate the value of visual, textual and multi-modal prediction of emoji. We conclude the paper with three examples of the application potential of emoji in the context of multimedia retrieval.

Spencer Cappallo, Thomas Mensink, Cees G. M. Snoek

Real-time Traffic

MM 2015 | Multimedia |

claim paper

Post Info
More Details (n/a)

Added	14 Apr 2016
Updated	14 Apr 2016
Type	Journal
Year	2015
Where	MM
Authors	Spencer Cappallo, Thomas Mensink, Cees G. M. Snoek

Comments (0)

Sciweavers

Image2Emoji: Zero-shot Emoji Prediction for Visual Media

MM 2015 | Multimedia |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers