In this demo we focus on cross-modal (visual and textual) e-commerce search within the fashion domain. Particularly, we demonstrate two tasks: 1) given a query image (without any a...
On touch-based devices such as smartphones and tablets users are accustomed to browse through lists and collections by using flick gestures. For video navigation, however, mobile ...
Klaus Schoeffmann, Marco A. Hudelist, Bonifaz Kauf...
To provide comprehensive evaluation of interactive image segmentation algorithms, we propose an automatic scribble simulation approach. We first analyze the variety of scribbles l...
Abstract. There are millions of users who tag multimedia content, generating a large vocabulary of tags. Some tags are frequent, while other tags are rarely used following a long t...
Svetlana Kordumova, Jan C. van Gemert, Cees G. M. ...
Most of the image retrieval approaches nowadays are based on the Bag-of-Words (BoW) model, which allows for representing an image efficiently and quickly. The efficiency of the BoW...
Ilias Gialampoukidis, Stefanos Vrochidis, Ioannis ...
This paper introduces iAutoMotion, an autonomous video retrieval system that requires only minimal user input. It is based on the video retrieval engine IMOTION. iAutoMotion uses a...
Luca Rossetto, Ivan Giangreco, Claudiu Tanase, Hei...
Abstract. Nonnegative Matrix Factorization (NMF) has received considerable attention due to its psychological and physiological interpretation of naturally occurring data whose rep...
Most concept recognition in visual multimedia is based on relatively simple concepts, things which are present in the image or video. These usually correspond to objects which can ...
Peng Wang, Lifeng Sun, Shiqiang Yang, Alan F. Smea...
Object proposal is utilized as a fundamental preprocessing of various multimedia applications by detecting the candidate regions of objects in images. In this paper, we propose a n...
Abstract. In this demo, we demonstrate a mobile real-time eating action recognition system, GrillCam. It continuously recognizes user’s eating action and estimates categories of ...